Abstract
Bipartite data is common in data engineering and brings unique challenges, particularly when it comes to clustering tasks that impose strong structural assumptions. This work presents an unsupervised method for assessing similarity in bipartite data. The method is based on regular equivalence in graphs and uses spectral properties of a bipartite adjacency matrix to estimate similarity in both dimensions. The method is reflexive in that similarity in one dimension informs similarity in the other. The method also uses local graph transitivities, a contribution governed by its only free parameter. Reflexive regular equivalence can be used to validate assumptions of co-similarity, which are required but often untested in co-clustering analyses. The method is robust to noise and asymmetric data, making it particularly suited for cluster analysis and recommendation in data of unknown structure. (An extended preprint of this paper is available at arxiv.org/abs/1702.04956.)
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Borgatti, S.P.: 2-mode concepts in social network analysis. In: Encyclopedia of Complexity and System Science, vol. 6 (2009)
Breiger, R.L.: The duality of persons and groups. Soc. Forces 53, 181–190 (1974)
Codocedo, V., Napoli, A.: Lattice-based biclustering using partition pattern structures. In: ECAI 2014, pp. 213–218 (2014)
Dhillon, I.S.: Co-clustering documents and words using bipartite spectral graph partitioning. In: Proceedings of the SIGKDD 2001, pp. 269–274 (2001)
Doreian, P., Batagelj, V., Ferligoj, A.: Generalized blockmodeling of two-mode network data. Soc. Netw. 26(1), 29–53 (2004)
Gerow, A., Lou, B., Duede, E., Evans, J.: Proposing ties in a dense hypergraph of academics. Social Informatics. LNCS, vol. 9471, pp. 209–226. Springer, Cham (2015). doi:10.1007/978-3-319-27433-1_15
Gnatyshak, D., Ignatov, D.I., Semenov, A., Poelmans, J.: Gaining insight in social networks with biclustering and triclustering. In: Aseeva, N., Babkin, E., Kozyrev, O. (eds.) BIR 2012. LNBIP, vol. 128, pp. 162–171. Springer, Heidelberg (2012). doi:10.1007/978-3-642-33281-4_13
Leicht, E.A., Holme, P., Newman, M.E.: Vertex similarity in networks. Phys. Rev. E 73(2), 026120 (2006)
Pensa, R.G., Ienco, D., Meo, R.: Hierarchical co-clustering: off-line and incremental approaches. Data Min. Knowl. Disc. 28(1), 31–64 (2014)
Shi, F., Foster, J.G., Evans, J.A.: Weaving the fabric of science: dynamic network models of science’s unfolding structure. Soc. Netw. 43, 73–85 (2015)
Tanay, A., Sharan, R., Shamir, R.: Biclustering algorithms: a survey. Handb. Comput. Mol. Biol. 9(1–20), 122–124 (2005)
Teng, L., Tan, K.: Finding combinatorial histone code by semi-supervised biclustering. BMC Genom. 13(1), 301 (2012)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Gerow, A., Zhou, M., Matwin, S., Shi, F. (2017). Reflexive Regular Equivalence for Bipartite Data. In: Mouhoub, M., Langlais, P. (eds) Advances in Artificial Intelligence. Canadian AI 2017. Lecture Notes in Computer Science(), vol 10233. Springer, Cham. https://doi.org/10.1007/978-3-319-57351-9_9
Download citation
DOI: https://doi.org/10.1007/978-3-319-57351-9_9
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-57350-2
Online ISBN: 978-3-319-57351-9
eBook Packages: Computer ScienceComputer Science (R0)