An Efficient Algorithm for Solving Pseudo Clique Enumeration Problem

Uno, Takeaki

doi:10.1007/s00453-008-9238-3

An Efficient Algorithm for Solving Pseudo Clique Enumeration Problem

Published: 18 October 2008

Volume 56, pages 3–16, (2010)
Cite this article

Algorithmica Aims and scope Submit manuscript

Takeaki Uno¹

517 Accesses
53 Citations
Explore all metrics

Abstract

The problem of finding dense structures in a given graph is quite basic in informatics including data mining and data engineering. Clique is a popular model to represent dense structures, and widely used because of its simplicity and ease in handling. Pseudo cliques are natural extension of cliques which are subgraphs obtained by removing small number of edges from cliques. We here define a pseudo clique by a subgraph such that the ratio of the number of its edges compared to that of the clique with the same number of vertices is no less than a given threshold value. In this paper, we address the problem of enumerating all pseudo cliques for a given graph and a threshold value. We first show that it seems to be difficult to obtain polynomial time algorithms using straightforward divide and conquer approaches. Then, we propose a polynomial time, polynomial delay in precise, algorithm based on reverse search. The time complexity for each pseudo clique is O(Δlog |V|+min {Δ ²,|V|+|E|}). Computational experiments show the efficiency of our algorithm for both randomly generated graphs and practical graphs.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Agrawal, R., Mannila, H., Srikant, R., Toivonen, H., Verkamo, A.I.: Fast discovery of association rules. In: Advances in Knowledge Discovery and Data Mining, pp. 307–328. MIT Press, Cambridge (1996)
Google Scholar
Aslam, J., Pelekhov, K., Rus, D.: A practical clustering algorithms for static and dynamic information organization. In: Symposium on Discrete Algorithms (SODA), vol. 99, pp. 51–60. Assoc. Comput. Mach., New York (1999)
Google Scholar
Arora, S., Karger, D., Karpinski, M.: Polynomial time approximation schemes for dense instances of NP-hard problems. In: Proceedings of ACM Symposium on Theory of Computing, pp. 284–293 (1995)
Asai, T., Abe, K., Kawasoe, S., Arimura, H., Sakamoto, H., Arikawa, S.: Efficient substructure discovery from large semi-structured data. In: Proceedings of SDM 2002 (2002)
Avis, D., Fukuda, K.: Reverse search for enumeration. Discrete Appl. Math. 65, 21–46 (1996)
Article MATH MathSciNet Google Scholar
Feige, U., Peleg, D., Kortsarz, G.: The dense k-subgraph problem. Algorithmica 29, 410–421 (2001)
Article MATH MathSciNet Google Scholar
Fukuda, K., Matsui, T.: Finding all minimum-cost perfect matchings in bipartite graphs. Networks 22, 461–468 (1992)
Article MATH MathSciNet Google Scholar
Fujisawa, K., Hamuro, Y., Katoh, N., Tokuyama, T., Yada, K.: Approximation of optimal two-dimensional association rules for categorical attributes using semidefinite programming. Lect. Notes Comput. Sci. 1721, 148–159 (1999)
Article Google Scholar
Gallo, G., Grigoriadis, M.D., Tarjan, R.E.: A fast parametric maximum flow algorithm and applications. SIAM J. Comput. 18, 30–55 (1989)
Article MATH MathSciNet Google Scholar
Garey, M.R., Johnson, D.S., Stockmeyer, L.: Some simplified NP-complete problems, In: Proceedings of ACM Symposium on Theory of Computing, pp. 47–63 (1974)
Gibson, D., Kumar, R., Tomkins, A.: Discovering large dense subgraphs in massive graphs. In: Proceedings of Very Large Data Bases Conference, pp. 721–732 (2005)
Haraguchi, M., Okubo, Y.: A method for clustering of web pages with pseudo-clique search. Lect. Notes Artif. Intell. 3847, 59–78 (2006)
Google Scholar
Hu, H., Yan, X., Huang, Y., Han, J., Zhou, X.J.: Mining coherent dense subgraphs across massive biological networks for functional discovery. Bioinformatics 21, 213–221 (2005)
Article Google Scholar
Kumar, R., Raghavan, P., Rajagopalan, S., Tomkins, A.: Extracting large-scale knowledge bases from the Web. In: Proceedings of Very Large Data Bases Conference, pp. 639–650 (1999)
Kumar, S.R., Raphavan, P., Rajagopalan, S., Tomkins, A.: Trawling the Web for emerging cyber communities. In: Proceedings of 8th International WWW Conference, pp. 1481–1493 (1999)
Makino, K., Uno, T.: New algorithms for enumerating all maximal cliques. Lect. Notes Comput. Sci. 3111, 260–272 (2004)
MathSciNet Google Scholar
Nakano, S., Uno, T.: Constant time generation of trees with specified diameter. Lect. Notes Comput. Sci. 3353, 33–45 (2004)
Article MathSciNet Google Scholar
Palla, G., Derenyi, I., Farkas, I., Vicsek, T.: Uncovering the overlapping community structure of complex networks in nature and society. Nature 435(7043), 814–818 (2005)
Article Google Scholar
Read, R.C., Tarjan, R.E.: Bounds on backtrack algorithms for listing cycles, paths, and spanning trees. Networks 5, 237–252 (1975)
MATH MathSciNet Google Scholar
Tomita, E., Tanaka, A., Takahashi, H.: The worst-case time complexity for generating all maximal cliques and computational experiments. Theor. Comput. Sci. 363, 28–42 (2006)
Article MATH MathSciNet Google Scholar
Uno, T.: Two general methods to reduce delay and change of enumeration algorithms. National Institute of Informatics (in Japan) Technical Report, 004E (2003)
Uno, T.: An efficient algorithm for enumerating pseudo cliques. Lect. Notes Comput. Sci. 4835, 402–414 (2007)
Article MathSciNet Google Scholar
Uno, T., Kiyomi, M., Arimura, H.: LCM ver. 2: efficient mining algorithms for frequent/closed/maximal itemsets. In: Proceedings of IEEE ICDM’04 Workshop FIMI’04. Available at http://sunsite.informatik.rwth-aachen.de/Publications/CEUR-WS//Vol-126/ (2004)
Warner, S.: E-prints and the open archives initiative. Libr. Hi Tech 21, 151–158 (2003)
Article Google Scholar
Zhang, Y., Chu, C.H., Ji, X., Zha, H.: Correlating summarization of multisource news with k way graph biclustering. SIGKDD Explor. 6, 34–42 (2004)
Article Google Scholar

Download references

Author information

Authors and Affiliations

National Institute of Informatics, 2-1-2, Hitotsubashi, Chiyoda-ku, Tokyo, 101-8430, Japan
Takeaki Uno

Authors

Takeaki Uno
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Takeaki Uno.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Uno, T. An Efficient Algorithm for Solving Pseudo Clique Enumeration Problem. Algorithmica 56, 3–16 (2010). https://doi.org/10.1007/s00453-008-9238-3

Download citation

Received: 30 November 2007
Accepted: 08 October 2008
Published: 18 October 2008
Issue Date: January 2010
DOI: https://doi.org/10.1007/s00453-008-9238-3

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An Efficient Algorithm for Solving Pseudo Clique Enumeration Problem

Abstract

Access this article

Similar content being viewed by others

K-Clique-Graphs for Dense Subgraph Discovery

Fast Maximal Clique Enumeration for Real-World Graphs

Solving Clique Covering in Very Large Sparse Random Graphs by a Technique Based on k-Fixed Coloring Tabu Search

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

An Efficient Algorithm for Solving Pseudo Clique Enumeration Problem

Abstract

Access this article

Similar content being viewed by others

K-Clique-Graphs for Dense Subgraph Discovery

Fast Maximal Clique Enumeration for Real-World Graphs

Solving Clique Covering in Very Large Sparse Random Graphs by a Technique Based on k-Fixed Coloring Tabu Search

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation