A Neural Procedure for Gene Function Prediction

Frasca, Marco; Bertoni, Alberto; Sion, Andrea

doi:10.1007/978-3-642-35467-0_19

Marco Frasca⁵,
Alberto Bertoni⁵ &
Andrea Sion⁵

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 19))

1432 Accesses
1 Citations

Abstract

The graph classification problem consists, given a weighted graph and a partial node labeling, in extending the labels to all nodes. In many real-world context, such as Gene Function Prediction, the partial labeling is unbalanced: positive labels are much less than negatives. In this paper we present a new neural algorithm for predicting labels in presence of label imbalance. This algorithm is based on a family of Hopfield networks, described by 2 continuous parameters and 1 discrete parameter, and it consists of two main steps: 1) the network parameters are learnt through a cost-sensitive optimization procedure based on local search; 2) a suitable Hopfield network restricted to unlabeled nodes is considered and simulated. The reached equilibrium point induces the classification of unlabeled nodes. An experimental analysis on real-world unbalanced data in the context of genome-wide prediction of gene functions show the effectiveness of the proposed approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Belkin, M., Matveeva, I., Niyogi, P.: Regularization and Semi-supervised Learning on Large Graphs. In: Shawe-Taylor, J., Singer, Y. (eds.) COLT 2004. LNCS (LNAI), vol. 3120, pp. 624–638. Springer, Heidelberg (2004)
Chapter Google Scholar
Bengio, Y., Delalleau, O., Le Roux, N.: Label Propagation and Quadratic Criterion. In: Chapelle, O., Scholkopf, B., Zien, A. (eds.) Semi-Supervised Learning, pp. 193–216. MIT Press (2006)
Google Scholar
Bertoni, A., Frasca, M., Valentini, G.: COSNet: A Cost Sensitive Neural Network for Semi-supervised Learning in Graphs. In: Gunopulos, D., Hofmann, T., Malerba, D., Vazirgiannis, M. (eds.) ECML PKDD 2011, Part I. LNCS, vol. 6911, pp. 219–234. Springer, Heidelberg (2011)
Chapter Google Scholar
Borgatti, S., Mehra, A., Brass, D., Labianca, G.: Network Analysis in the Social Sciences. Science 232, 892–895 (2009)
Article Google Scholar
Deng, M., Chen, T., Sun, F.: An integrated probabilistic model for functional prediction of proteins. J. Comput. Biol. 11, 463–475 (2004)
Article Google Scholar
Dorogovtsev, S., Mendes, J.: Evolution of networks: From biological nets to the Internet and WWW. Oxford University Press, Oxford (2003)
MATH Google Scholar
Elkan, C.: The foundations of cost-sensitive learning. In: Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence, pp. 973–978 (2001)
Google Scholar
Hopfield, J.: Neural networks and physical systems with emergent collective compatational abilities. Proc. Natl. Acad. Sci. USA 79, 2554–2558 (1982)
Article MathSciNet Google Scholar
Karaoz, U., et al.: Whole-genome annotation by using evidence integration in functional-linkage networks. Proc. Natl. Acad. Sci. USA 101, 2888–2893 (2004)
Article Google Scholar
Lin, H.T., Lin, C.J., Weng, R.: A note on platt’s probabilistic outputs for support vector machines. Machine Learning 68(3), 267–276 (2007)
Article Google Scholar
Marcotte, E., Pellegrini, M., Thompson, M., Yeates, T., Eisenberg, D.: A combined algorithm for genome-wide prediction of protein function. Nature 402, 83–86 (1999)
Article Google Scholar
Ruepp, A., et al.: The FunCat, a functional annotation scheme for systematic classification of proteins from whole genomes. Nucleic Acids Research 32(18), 5539–5545 (2004)
Article Google Scholar
Szummer, M., Jaakkola, T.: Partially labeled classification with Markov random walks. In: Advances in Neural Information Processing Systems (NIPS), vol. 14, pp. 945–952. MIT Press (2001)
Google Scholar
Tsuda, K., Shin, H., Scholkopf, B.: Fast protein classification with multiple networks. Bioinformatics 21(suppl. 2), ii59–ii65 (2005)
Google Scholar
Wilcoxon, F.: Individual comparisons by ranking methods. Biometrics 1, 80–83 (1945)
Article Google Scholar
Wuchty, S., Ravasz, E., Barabsi, A.L.: The architecture of biological networks. Complex Systems in Biomedicine 5259, 165–181 (2003)
Google Scholar
Zhu, X., Ghahramani, Z., Lafferty, J.: Semi-supervised learning using gaussian fields and harmonic functions. In: ICML, pp. 912–919 (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

Dipartimento di Scienze dell’Informazione, Università degli Studi di Milano, via Comelico 39, 20135, Milano, Italy
Marco Frasca, Alberto Bertoni & Andrea Sion

Authors

Marco Frasca
View author publications
You can also search for this author in PubMed Google Scholar
Alberto Bertoni
View author publications
You can also search for this author in PubMed Google Scholar
Andrea Sion
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Marco Frasca .

Editor information

Editors and Affiliations

, Dept of Computer Science, Milano University, Via Comelico 39, Milano, 20135, Italy
Bruno Apolloni
Dept of Computer Science, Milano University, Via Comelico 39, Milano, 20135, Italy
Simone Bassis
Dept. of Psychology, Second University of Naples, Via Vivaldi 43, Caserta, 81100, Salerno, Italy
Anna Esposito
, Dipartimento di Meccanica e Materiali, University Mediterranea of Reggio, Via Graziella - Feo di Vito, Reggio Calabria, 89124, Reggio Calabria, Italy
Francesco Carlo Morabito

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Frasca, M., Bertoni, A., Sion, A. (2013). A Neural Procedure for Gene Function Prediction. In: Apolloni, B., Bassis, S., Esposito, A., Morabito, F. (eds) Neural Nets and Surroundings. Smart Innovation, Systems and Technologies, vol 19. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35467-0_19

Download citation

DOI: https://doi.org/10.1007/978-3-642-35467-0_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35466-3
Online ISBN: 978-3-642-35467-0
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics