Abstract
In this paper, we present a link oriented measuring method to discriminate the manipulated web pages effectively. We define the label of an edge as having a link context and a similarity measure between link context and target page. By suggesting an assessing measure based on singular value decomposition, it is explained that our proposed method can effectively detect the manipulated web pages. We, however, extend the SVD as an assessment measure to detect the rank-manipulated pages. In the experiment, the LOD method reduced about 17% amount of the rank that is minimum 209.4% higher than not manipulated web pages. Using this proposed approach, the chance of manipulated web pages getting high ranks than deserved can be discriminated effectively.
This is a preview of subscription content, access via your institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Gyöngyi, Z., Garcia-Molina, H., Pedersen, J.: Combating web Spam with TrustRank. In: Proc. VLDB, pp. 576–587 (2004)
Lee, W., Shin, K., Kang, S.: Structuring Web with Semantic Hypertext Algorithm. In: Proc. CITSA, Florida, pp. 257–262 (2004)
Halkida, M., Nguyen, B., Varlamis, I., Vazirgiannis, M.: THESUS: Organizing web document collections based on link semantics. The VLDB Journal (12), 320–332 (2003)
Arasu, A., Cho, J., Garcia-Molina, H., Paepcke, A., Rachavan, S.: Searching the web. ACM Transactions on Internet Technology 1(1), 2–43 (2001)
Miller, J., Rae, G., Schaefer, F.: Modifications of Kleinberg’s HITS Algorithm Using Matrix Exponentiation and web Log Records. In: Proc. ACM SIGIR, pp. 444–445 (2001)
Haveliwala, T.: Topic-Sensitive PageRank: A Context-Sensitive Ranking Algorithm for web Search. IEEE TKDE 15(4), 784–796 (2003)
Brin, S., Page, L.: The Anatomy of a Large-Scale Hypertextual web Search Engine. In: Proc. WWW, pp. 107–117 (1998)
Wang, Y., DeWitt, D.: Computing PageRank in a Distributed Internet Search Engine System. In: Proc. VLDB, pp. 420–431 (2004)
Caldo, P., Ribeiro-Neto, B., Ziviani, N.: Local versus Global Link Information in the web. ACM TOIS 21(1), 42–63 (2003)
Phelps, T., Wilensky, R.: Robust Hyperlinks: Cheap, Everywhere, Now. In: Proc. DDEP/PODDP, pp. 28–43 (2000)
Lu, W., Chien, L., Lee, H.: Anchor Text Mining for Translation of web Queries. In: Proc. ICDM, pp. 401–408 (2001)
Gentle, J.: Singular Value Factorization. In: Numerical Linear Algebra for Applications in Statistics, pp. 102–103. Springer, Heidelberg (1998)
Castelli, V., Thomasian, A., Li, C.: CSVD: Clustering and Singular Value De-composition for Approximate Similarity Search in High-Dimensional Spaces. IEEE TKDE 15(3), 671–685 (2003)
Elmroth, E., Gustavson, F.: Applying recursion to serial and parallel QR factorization leads to better performance. IBM Journal of R&D 44(4), 605–624 (2000)
Pandurangan, G., Raghavan, P., Upfal, E.: Using PageRank to Characterize Web Structure. In: Ibarra, O.H., Zhang, L. (eds.) COCOON 2002. LNCS, vol. 2387, pp. 330–339. Springer, Heidelberg (2002)
Lee, W., Kim, J.: Structuring the Web to Cope with Dynamic Changes. In: ICWS (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lee, W. (2005). Discriminating Biased Web Manipulations in Terms of Link Oriented Measures. In: Yolum, p., Güngör, T., Gürgen, F., Özturan, C. (eds) Computer and Information Sciences - ISCIS 2005. ISCIS 2005. Lecture Notes in Computer Science, vol 3733. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11569596_61
Download citation
DOI: https://doi.org/10.1007/11569596_61
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29414-6
Online ISBN: 978-3-540-32085-2
eBook Packages: Computer ScienceComputer Science (R0)