Abstract
We present a new approach to factor analysis of three-way binary data, i.e. data described by a 3-dimensional binary matrix I, describing a relationship between objects, attributes, and conditions. The problem consists in finding a decomposition of I into three binary matrices, an object-factor matrix A, an attribute-factor matrix B, and a condition-factor matrix C, with the number of factors as small as possible. The scenario is similar to that of decomposition-based methods of analysis of three-way data but the difference consists in the composition operator and the constraint on A, B, and C to be binary. We show that triadic concepts of I, developed within formal concept analysis, provide us with optimal decompositions. We present an example demonstrating the usefulness of the decompositions. Since finding optimal decompositions is NP-hard, we propose a greedy algorithm for computing suboptimal decompositions and evaluate its performance.
Similar content being viewed by others
References
Belohlavek, R., Vychodil, V.: Discovery of optimal factors in binary data via a novel method of matrix decomposition. J. Comput. Syst. Sci. 76(1), 3–20 (2010)
Cichocki, A., Zdunek, R., Phan, A.H., Amari, S.-I.: Nonnegative Matrix and Tensor Factorizations: Applications to Exploratory Multi-way Data Analysis and Blind Source Separation. J. Wiley (2009)
Cormen, T.H., Leiserson C.E., Rivest, R.L., Stein C.: Introduction to Algorithms, 2nd ed. MIT Press (2001)
Frolov, A.A., Húsek, D., Muraviev, I.P., Polyakov, P.A.: Boolean factor analysis by Hopfield-like autoassociative memory. IEEE Trans. Neural Netw. 18(3), 698–707 (2007)
Ganter, B., Wille, R.: Formal Concept Analysis. Mathematical Foundations. Springer, Berlin (1999)
Jäschke, R., Hotho, A., Schmitz, C., Ganter, B., Stumme, G.: TRIAS – an algorithm for mining iceberg tri-lattices. In: Proc. ICDM 2006, pp. 907–911 (2006)
Kolda, T.G., Bader, B.W.: Tensor decompositions and applications. SIAM Rev. 51(3), 455–500 (2009)
Krolak-Schwerdt, S., Orlik P., Ganter, B.: TRIPAT: a model for analyzing three-mode binary data. Stud. Classif. Data Anal. Knowl. Organ. 4, 298–307 (1994)
Kroonenberg, P.M.: Applied Multiway Data Analysis. J. Wiley (2008)
Kuznetsov, S., Obiedkov, S.: Comparing performance of algorithms for generating concept lattices. J. Exp. Theor. Artif. Intell. 14(2–3), 189–216 (2002)
Lehmann, F., Wille, R.: A triadic approach to formal concept analysis. Lect. Notes Comput. Sci. 954, 32–43 (1995)
Mickey, M.R., Mundle, P., Engelman, L.: Boolean factor analysis. In: Dixon, W.J. (ed.) BMDP Statistical Software Manual, vol. 2, pp. 849–860. University of California Press, Berkeley, CA (1990)
Miettinen, P.: Boolean tensor factorization. In: ICDM 2011, 11th IEEE International Conference on Data Mining, pp. 447–456 (2011)
Miettinen, P., Mielikäinen, T., Gionis, A., Das, G., Mannila, H.: The Discrete Basis Problem. PKDD 2006, Lect. Notes Comput. Sci. 4213, 335–346 (2006)
Nau, D.S., Markowsky, G., Woodbury, M.A., Amos, D.B.: A mathematical analysis of human leukocyte antigen serology. Math. Biosci. 40, 243–270 (1978)
Outrata, J.: Boolean factor analysis for data preprocessing in machine learning. In: Draghici, S., Khoshgoftaar, T.M., Palade, V., Pedrycz, W., Wani, M.A., Zhu, X. (eds.) Proc. ICMLA 2010, Intern. Conf. on Machine Learning and Applications, pp. 899-902. IEEE, Washington, DC. doi:10.1109/ICMLA.2010.141
Smilde, A., Bro, R., Geladi, P.: Multi-way Analysis: Applications in the Chemical Sciences. J. Wiley (2004)
Tang, F., Tao H.: Binary principal component analysis. In: Proc. British Machine Vision Conference 2006, pp. 377–386 (2006)
Tatti, N., Mielikäinen T., Gionis A., Mannila H.: What is the dimension of your binary data? In: Proc. ICDM 2006, pp. 603–612 (2006)
Wille, R.: The basic theorem of triadic concept analysis. Order 12, 149–158 (1995)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Belohlavek, R., Glodeanu, C. & Vychodil, V. Optimal Factorization of Three-Way Binary Data Using Triadic Concepts. Order 30, 437–454 (2013). https://doi.org/10.1007/s11083-012-9254-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11083-012-9254-4