Abstract
In this paper, we will apply, to the task of detecting web spam, a combination of the best of its breed algorithms for processing graph domain input data, namely, probability mapping graph self organizing maps and graph neural networks. The two connectionist models are organized into a layered architecture, consisting of a mixture of unsupervised and supervised learning methods. It is found that the results of this layered architecture approach are comparable to the best results obtained so far by others using very different approaches.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Gyöngyi, Z., Garcia-Molina, H., Pedersen, J.: Combating web spam with trustrank. In: Proceedings of the Thirtieth international conference on Very large data bases, vol. 30, p. 587. VLDB Endowment (2004)
Manning, C., Raghavan, P., Schütze, H.: An introduction to information retrieval. Cambridge University Press, Cambridge (2008)
Brin, S., Page, L., Motwani, R., Winograd, T.: The PageRank citation ranking: Bringing order to the Web. Technical Report 1999-66, Stanford University (1999), http://dbpubs.stanford.edu:8090/pub/1999-66
Bianchini, M., Gori, M., Scarselli, F.: Inside pagerank. ACM Transactions on Internet Technology (TOIT) 5(1), 92–128 (2005)
Gyöngyi, Z., Garcia-Molina, H.: Web spam taxonomy. In: Adversarial Information Retrieval on the Web (2005)
Scarselli, F., Gori, M., Tsoi, A., Hagenbuchner, M., Monfardini, G.: The graph neural network model. IEEE Transactions on Neural Networks 20(1), 61–80 (2009)
Hagenbuchner, M., Zhang, S., Tsoi, A., Sperduti, A.: Projection of undirected and nonpositional graphs using self organizing maps. In: European Symposium on Artificial Neural Networks-Advances in Computational Intelligence and Learning, pp. 22–24 (April 2009)
Scarselli, F., Gori, M., Tsoi, A., Hagenbuchner, M., Monfardini, G.: Computational capabilities of graph neural networks. IEEE Transactions on Neural Networks 20(1), 81–102 (2009)
Frasconi, P., Gori, M., Sperduti, A.: A general framework for adaptive processing of data structures. IEEE Transactions on Neural Networks 9(5), 768–786 (1998)
Hagenbuchner, M., Sperduti, A., Tsoi, A.: A self-organizing map for adaptive processing of structured data. IEEE Transactions on Neural Networks 14(3), 491–505 (2003)
Kohonen, T.: Self-organization and associative memory. Springer Information Sciences Series (1989)
Khamsi, M.A.: An Introduction to Metric Spaces and Fixed Point Theory. John Wiley & Sons Inc., Chichester (2001)
Almeida, L.: A learning rule for asynchronous perceptrons with feedback in a combinatorial environment. In: Caudill, M., Butler, C. (eds.) IEEE International Conference on Neural Networks, San Diego, vol. 2, pp. 609–618. IEEE, New York (1987)
Pineda, F.: Generalization of back–propagation to recurrent neural networks. Physical Review Letters 59, 2229–2232 (1987)
Riedmiller, M., Braun, H.: RPROP-A fast adaptive learning algorithm. In: Proc. of ISCIS VII, Universitat (1992)
Castillo, C., Donato, D., Gionis, A., Murdock, V., Silvestri, F.: Know your neighbors: web spam detection using the web topology. In: SIGIR 2007: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 423–430. ACM, New York (2007)
Gyöngyi, Z., Garcia-Molina, H., Pedersen, J.: Combating web spam with trustrank. In: VLDB Conference (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Di Noi, L., Hagenbuchner, M., Scarselli, F., Tsoi, A.C. (2010). Web Spam Detection by Probability Mapping GraphSOMs and Graph Neural Networks. In: Diamantaras, K., Duch, W., Iliadis, L.S. (eds) Artificial Neural Networks – ICANN 2010. ICANN 2010. Lecture Notes in Computer Science, vol 6353. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15822-3_45
Download citation
DOI: https://doi.org/10.1007/978-3-642-15822-3_45
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15821-6
Online ISBN: 978-3-642-15822-3
eBook Packages: Computer ScienceComputer Science (R0)