Abstract
We study the PageRank mass of principal components in a bow-tie Web Graph, as a function of the damping factor c. Using a singular perturbation approach, we show that the PageRank share of IN and SCC components remains high even for very large values of the damping factor, in spite of the fact that it drops to zero when c→1. However, a detailed study of the OUT component reveals the presence of “dead-ends” (small groups of pages linking only to each other) that receive an unfairly high ranking when c is close to one. We argue that this problem can be mitigated by choosing c as small as 1/2.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Page, L., Brin, S., Motwani, R., Winograd, T.: The PageRank citation ranking: Bringing order to the Web. Technical report, Stanford University (1998)
Kleinberg, J.M.: Authoritative sources in a hyperlinked environment. Journal of the ACM 46(5), 604–632 (1999)
Lempel, R., Moran, S.: The stochastic approach for link-structure analysis (SALSA) and the TKC effect. Comput. Networks 33(1-6), 387–401 (2000)
Langville, A.N., Meyer, C.D.: Deeper inside PageRank. Internet Math. 1, 335–380 (2003)
Langville, A.N., Meyer, C.D.: Google’s PageRank and beyond: the science of search engine rankings. Princeton University Press, Princeton, NJ (2006)
Boldi, P., Santini, M., Vigna, S.: PageRank as a function of the damping factor. In: Proc. of the Fourteenth International World Wide Web Conference, Chiba, Japan, ACM Press, New York (2005)
Avrachenkov, K., Litvak, N.: The effect of new links on Google PageRank. Stoch. Models 22(2), 319–331 (2006)
Bianchini, M., Gori, M., Scarselli, F.: Inside PageRank. ACM Trans. Inter. Tech. 5(1), 92–128 (2005)
Broder, A., Kumar, R., Maghoul, F., Raghavan, P., Rajagopalan, S., Statac, R., Tomkins, A., Wiener, J.: Graph structure in the Web. Computer Networks 33, 309–320 (2000)
Kumar, R., Raghavan, P., Rajagopalan, S., Sivakumar, D., Tomkins, A., Upfal, E.: The Web as a graph. In: PODS 2000. Proc. 19th ACM SIGACT-SIGMOD-AIGART Symp. Principles of Database Systems, pp. 1–10. ACM Press, New York (2000)
Avrachenkov, K.: Analytic Perturbation Theory and its Applications. PhD thesis, University of South Australia (1999)
Korolyuk, V.S., Turbin, A.F.: Mathematical foundations of the state lumping of large systems. Mathematics and its Applications, vol. 264. Kluwer Academic Publishers, Dordrecht (1993)
Pervozvanskii, A.A., Gaitsgori, V.G.: Theory of Suboptimal Decisions. Mathematics and its Applications (Soviet Series), vol. 12. Kluwer Academic Publishers, Dordrecht (1988)
Yin, G.G., Zhang, Q.: Discrete-time Markov chains. Applications of Mathematics (New York), vol. 55. Springer, New York (2005)
Chen, P., Xie, H., Maslov, S., Redner, S.: Finding scientific gems with Google. Arxiv preprint Physics 0604130 (2006)
Avrachenkov, K., Litvak, N., Pham, K.: Distribution of PageRank mass among principle components of the Web. Arxiv preprint CS 0709.2016 (2007)
Dill, S., Kumar, R., McCurley, K.S., Rajagopalan, S., Sivakumar, D., Tomkins, A.: Self-similarity in the Web. ACM Trans. Inter. Tech. 2(3), 205–223 (2002)
Haveliwala, T.: Topic-sensitive PageRank: A context-sensitive ranking algorithm for Web search. IEEE Transactions on Knowledge and Data Engineering 15(4), 784–796 (2003)
Moler, C., Moler, K.: Numerical Computing with MATLAB. In: SIAM (2003)
Eiron, N., McCurley, K., Tomlin, J.: Ranking the Web frontier. In: WWW 2004: Proceedings of the 13th international conference on World Wide Web, pp. 309–318. ACM Press, New York (2004)
Seneta, E.: Non-negative Matrices and Markov Chains. Springer Series in Statistics. Springer, New York, Revised reprint of the second (1981) edition [Springer-Verlag, New York MR0719544] (2006)
Fortunato, S., Flammini, A.: Random walks on directed networks: the case of PageRank. Arxiv preprint Physics 0604203 (2006)
Avrachenkov, K., Litvak, N., Nemirovsky, D., Osipova, N.: Monte Carlo methods in PageRank computation: When one iteration is sufficient. SIAM J. Numer. Anal. 45(2), 890–904 (2007)
Berkhin, P.: A survey on PageRank computing. Internet Math. 2, 73–120 (2005)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Avrachenkov, K., Litvak, N., Pham, K.S. (2007). Distribution of PageRank Mass Among Principle Components of the Web. In: Bonato, A., Chung, F.R.K. (eds) Algorithms and Models for the Web-Graph. WAW 2007. Lecture Notes in Computer Science, vol 4863. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-77004-6_2
Download citation
DOI: https://doi.org/10.1007/978-3-540-77004-6_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-77003-9
Online ISBN: 978-3-540-77004-6
eBook Packages: Computer ScienceComputer Science (R0)