Skip to main content

Hierarchical All-Pairs SimRank Calculation

  • Conference paper
  • First Online:
Database Systems for Advanced Applications (DASFAA 2023)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13945))

Included in the following conference series:

Abstract

All-pairs SimRank calculation is a classic SimRank problem. However, all-pairs algorithms suffer from efficiency issues and accuracy issues. In this paper, we convert the non-linear simrank calculation into a new simple closed formulation of linear system. And we come up with a sequence of novel algorithms to efficiently solve the linear system with accuracy guarantees. To reduce the memory consumption and improve the computational efficiency, we build a hierarchical framework to calculate the all-pairs SimRank scores, which includes locally coarse calculation and globally refine calculation. We first solve the local linear systems generated from the subgraphs, then we refine the SimRank scores on the full graph from the residuals of the local structures. We also show that our algorithms outperform the state-of-the-art all-pairs SimRank computation algorithms on real graphs.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 89.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 119.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://glaros.dtc.umn.edu/gkhome/views/metis.

  2. 2.

    http://snap.stanford.edu/data/index.html.

References

  1. Abbassi, Z., Mirrokni, V.S.: A recommender system based on local random walks and spectral methods. In: WebKDD, pp. 102–108. ACM (2007)

    Google Scholar 

  2. Fogaras, D., Rácz, B.: Scaling link-based similarity search. In: Proceedings of the 14th International Conference on World Wide Web, pp. 641–650 (2005)

    Google Scholar 

  3. Fujiwara, Y., Nakatsuji, M., Shiokawa, H., Onizuka, M.: Efficient search algorithm for simrank. In: ICDE 2013, Brisbane, Australia, 8–12 April 2013, pp. 589–600 (2013)

    Google Scholar 

  4. He, G., Feng, H., Li, C., Chen, H.: Parallel SimRank computation on large graphs with iterative aggregation. In: SIGKDD, Washington, DC, USA, 25–28 July 2010, pp. 543–552 (2010)

    Google Scholar 

  5. He, G., Li, C., Chen, H., Du, X., Feng, H.: Using graphics processors for high performance SimRank computation. IEEE Trans. Knowl. Data Eng. 24(9), 1711–1725 (2012)

    Article  Google Scholar 

  6. Jeh, G., Widom, J.: Simrank: a measure of structural-context similarity. In: Proceedings of the Eighth ACM SIGKDD, 23–26 July 2002, Edmonton, Alberta, Canada, pp. 538–543 (2002)

    Google Scholar 

  7. Jiang, M., Fu, A.W.C., Wong, R.C.W.: Reads: a random walk approach for efficient and accurate dynamic SimRank. PVLDB 10(9), 937–948 (2017)

    Google Scholar 

  8. Jin, R., Lee, V.E., Hong, H.: Axiomatic ranking of network role similarity. In: Proceedings of the 17th ACM SIGKDD, pp. 922–930. ACM (2011)

    Google Scholar 

  9. Li, C., et al.: Fast computation of SimRank for static and dynamic information networks. In: EDBT 2010, 13th International Conference on Extending Database Technology, Lausanne, Switzerland, 22–26 March 2010, Proceedings, pp. 465–476 (2010)

    Google Scholar 

  10. Lin, Z., Lyu, M.R., King, I.: MatchSim: a novel similarity measure based on maximum neighborhood matching. Knowl. Inf. Syst. 32(1), 141–166 (2012)

    Article  Google Scholar 

  11. Liu, Y., et al.: ProbeSim: scalable single-source and top-k SimRank computations on dynamic graphs. Proceed. VLDB Endow. 11(1), 14–26 (2017)

    Article  Google Scholar 

  12. Lizorkin, D., Velikhov, P., Grinev, M.N., Turdakov, D.: Accuracy estimate and optimization techniques for SimRank computation. PVLDB 1(1), 422–433 (2008)

    Google Scholar 

  13. Lizorkin, D., Velikhov, P., Grinev, M.N., Turdakov, D.: Accuracy estimate and optimization techniques for SimRank computation. VLDB J. 19(1), 45–66 (2010)

    Article  Google Scholar 

  14. Lu, J., Gong, Z., Lin, X.: A novel and fast SimRank algorithm. IEEE Trans. Knowl. Data Eng. 29(3), 572–585 (2017)

    Article  Google Scholar 

  15. Luo, X., Gao, J., Zhou, C., Yu, J.X.: UniWalk: unidirectional random walk based scalable SimRank computation over large graph. In: 2017 IEEE 33rd International Conference on Data Engineering (ICDE), pp. 325–336. IEEE (2017)

    Google Scholar 

  16. Shao, Y., Cui, B., Chen, L., Liu, M., Xie, X.: An efficient similarity search framework for SimRank over large dynamic graphs. Proceed. VLDB Endow. 8(8), 838–849 (2015)

    Article  Google Scholar 

  17. Spirin, N., Han, J.: Survey on web spam detection: principles and algorithms. ACM SIGKDD Explorations Newsl 13(2), 50–64 (2012)

    Article  Google Scholar 

  18. Tian, B., Xiao, X.: SLING: a near-optimal index structure for SimRank. In: Proceedings of the 2016 International Conference on Management of Data, SIGMOD Conference 2016, San Francisco, CA, USA, 26 June - 01 July 2016, pp. 1859–1874 (2016)

    Google Scholar 

  19. Wang, Y., Lian, X., Chen, L.: Efficient SimRank tracking in dynamic graphs. In: 2018 IEEE 34th International Conference on Data Engineering (ICDE), pp. 545–556. IEEE (2018)

    Google Scholar 

  20. Wei, Z., et al.: PRSim: Sublinear time SimRank computation on large power-law graphs. In: Proceedings of the 2019 International Conference on Management of Data, pp. 1042–1059 (2019)

    Google Scholar 

  21. Yu, W., Lin, X., Le, J.: A space and time efficient algorithm for SimRank computation. In: Advances in Web Technologies and Applications, Proceedings of the 12th Asia-Pacific Web Conference, APWeb 2010, Busan, Korea, 6–8 April 2010, pp. 164–170 (2010)

    Google Scholar 

  22. Yu, W., Lin, X., Zhang, W.: Towards efficient SimRank computation on large networks. In: 29th IEEE International Conference on Data Engineering, ICDE 2013, Brisbane, Australia, 8–12 April 2013, pp. 601–612 (2013)

    Google Scholar 

  23. Yu, W., Lin, X., Zhang, W.: Fast incremental SimRank on link-evolving graphs. In: IEEE 30th International Conference on Data Engineering, Chicago, ICDE 2014, IL, USA, 31 March - 4 April 2014, pp. 304–315 (2014)

    Google Scholar 

  24. Yu, W., Lin, X., Zhang, W., McCann, J.A.: Dynamical SimRank search on time-varying networks. VLDB J.-Int. J. Very Large Data Bases 27(1), 79–104 (2018)

    Article  Google Scholar 

  25. Yu, W., McCann, J.A.: Sig-SR: SimRank search over singular graphs. In: The 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2014, Gold Coast, QLD, Australia, 06-11 July 2014, pp. 859–862 (2014)

    Google Scholar 

  26. Yu, W., McCann, J.A.: High quality graph-based similarity search. In: Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, Santiago, Chile, 9–13 August 2015, pp. 83–92 (2015)

    Google Scholar 

  27. Yu, W., Zhang, W., Lin, X., Zhang, Q., Le, J.: A space and time efficient algorithm for SimRank computation. World Wide Web 15(3), 327–353 (2012)

    Article  Google Scholar 

  28. Zhang, Y., Li, C., Chen, H., Sheng, L.: Fast SimRank computation over disk-resident graphs. In: Meng, W., Feng, L., Bressan, S., Winiwarter, W., Song, W. (eds.) DASFAA 2013. LNCS, vol. 7826, pp. 16–30. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-37450-0_2

    Chapter  Google Scholar 

Download references

Acknowledgements

This work is supported by National Natural Science Foundation of China under the grant No. 62072460, 62076245, 62172424, 62276270, and Beijing Natural Science Foundation (4212022).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Cuiping Li .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Zhang, L., Li, C., Zhang, X., Chen, H. (2023). Hierarchical All-Pairs SimRank Calculation. In: Wang, X., et al. Database Systems for Advanced Applications. DASFAA 2023. Lecture Notes in Computer Science, vol 13945. Springer, Cham. https://doi.org/10.1007/978-3-031-30675-4_17

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-30675-4_17

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-30674-7

  • Online ISBN: 978-3-031-30675-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics