Skip to main content

On the Detection of Significant Pairwise Interactions in Complex Systems

  • Conference paper
  • First Online:
Artificial Life and Evolutionary Computation (WIVACE 2023)

Abstract

Many systems in nature, society and technology are complex systems, i.e., they are composed of numerous parts that interact in a non-linear way giving rise to positive and negative feedback. The dynamic organization of these systems often allows the emergence of intermediate structures that once formed profoundly influence the system and therefore play a key role in understanding its behavior. In the recent past our group has devised an effective method for identifying groups of interacting variables within a system, based on their observation. The result is a set of entities, each of which connects two or more nodes of the system: this result can therefore be represented by a hypergraph, which can be of considerable use for understanding the system under consideration. In particular, we use an index that allows us to evaluate the level of integration of a group of variables. In order for a group to be identified as significant, the value of this index must exceed a threshold that corresponds (under appropriate hypotheses) to a level of statistical significance decided by the user. In this work we propose a more elaborate approach to determining the significance threshold, which is (i) in itself theoretically interesting and (ii) of considerable practical utility. We use the new approach to determine collections of pairwise relationships in meaningful cases, such as relationships in gene regulatory networks.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 79.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 99.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    In the case of pairs of variables, this measure can be declined as mutual information.

  2. 2.

    It can be noted that the proposed approach is asymmetric with respect to the “roles” of A (copied variable) and B (variable that can copy A with probability Pc). Leaving the possible exploitation of this asymmetry to further works, in this paper for each analyzed pair we will test both roles, and we will use the threshold with the highest resulting value.

References

  1. Bar-Yam, Y., McKay, S.R., Christian, W.: Dynamics of complex systems (studies in nonlinearity). Comput. Phys. 12, 335–336 (1988)

    Article  Google Scholar 

  2. Lane, D.: Hierarchy, complexity, society. In: Hierarchy in Natural and Social Sciences, vol. 3, pp. 81–119, Springer, Dordrecht (2006). https://doi.org/10.1007/1-4020-4127-6_5

  3. Torres, L., Blevins, A.S., Bassett, D., Eliassi-Rad, T.: The why, how, and when of representations for complex systems. SIAM Rev. 63(3), 435–485 (2021)

    Google Scholar 

  4. Filisetti, A., Villani, M., Roli, A., Fiorucci, M., Serra, R.: Exploring the organisation of complex systems through the dynamical interactions among their relevant subsets. In: Proceedings of the ECAL 2015: the 13th European Conference on Artificial Life. ECAL 2015: the 13th European Conference on Artificial Life. York, UK, pp. 286–293 (2015)

    Google Scholar 

  5. Sani, L., D’Addese, G., Pecori, R., Mordonini, M., Villani, M., Cagnoni, S.: An integration-based approach to pattern clustering and classification. In: Ghidini, C., Magnini, B., Passerini, A., Traverso, P. (eds.) AI*IA 2018. LNCS (LNAI), vol. 11298, pp. 362–374. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-03840-3_27

    Chapter  Google Scholar 

  6. Villani, M., et al.: An iterative information-theoretic approach to the detection of structures in complex systems. Complexity 2018, 1–15 (2018)

    Google Scholar 

  7. D’Addese, G., Sani, L., La Rocca, L., Serra, R., Villani, M.: Asymptotic information-theoretic detection of dynamical organization in complex systems. Entropy 23(4), 398 (2021)

    Article  MathSciNet  Google Scholar 

  8. D’Addese, G., Casari, M., Serra, R., Villani, M.: A fast and effective method to identify relevant sets of variables in complex systems. Mathematics 9(9), 1022 (2021)

    Article  Google Scholar 

  9. Mooij, J.M., Peters, J., Janzing, D., Zscheischler, J., Schölkopf, B.: Distinguishing cause from effect using observational data: methods and benchmarks. J. Mach. Learn. Res. 17(1), 1103–1204 (2016)

    MathSciNet  Google Scholar 

  10. Igelström, E., Craig, P., Lewsey, J., Lynch, J., Pearce, A., Katikireddi, S.V.: Causal inference and effect estimation using observational data. J. Epidemiol. Community Health 76(11), 960–966 (2022)

    Article  Google Scholar 

  11. Roli, A., Villani, M., Caprari, R., Serra, R.: Identifying critical states through the relevance index. Entropy 19(2), 73 (2017)

    Article  Google Scholar 

  12. Wheeler, D.J., Chambers, D.S.: Understanding Statistical Process Control. SPC Press, Knoxville (1992)

    Google Scholar 

  13. Pukelsheim, F.: The three sigma rule. Am. Stat. 48(2), 88–91 (1994)

    Article  MathSciNet  Google Scholar 

  14. Cover, T.M., Thomas, J.A.: Elements of Information Theory. Wiley, New York (1991)

    Google Scholar 

  15. Lachmann, A., Giorgi, F.M., Lopez, G., Califano, A.: ARACNe-AP: gene network reverse engineering through adaptive partitioning inference of mutual information. Bioinformatics 32, 2233–2235 (2016)

    Article  Google Scholar 

  16. Margolin, A.A., et al.: ARACNE: an algorithm for the reconstruction of gene regulatory networks in a mammalian cellular context. BMC Bioinform 7, 1–15 (2006)

    Article  Google Scholar 

  17. Eberwine, J., Sul, J.Y., Bartfai, T., Kim, J.: The promise of single-cell sequencing. Nat. Methods 11(1), 25–27 (2014)

    Article  Google Scholar 

  18. Gawad, C., Koh, W., Quake, S.R.: Single-cell genome sequencing: current state of the science. Nat. Rev. Genet. 17(3), 175–188 (2016)

    Article  Google Scholar 

  19. Pratapa, A., Jalihal, A.P., Law, J.N., Bharadwaj A., Murali. T.M.: Benchmarking algorithms for gene regulatory network inference from single-cell transcriptomic data. Nat. Methods 17(2), 147–154 (2020)

    Google Scholar 

  20. Griffiths, A.J., Miller, J.H., Suzuki, D.T., Lewontin, W.C., Gelbart, W.M.: An Introduction to Genetic Analysis, 7th edn. W. H. Freeman, New York (2000)

    Google Scholar 

  21. Mendes, P., Sha, W., Ye, K.: Artificial gene networks for objective comparison of analysis algorithms. Bioinformatics 19(Suppl. S2), ii122–ii129 (2003)

    Google Scholar 

  22. Erdos, P., Renyi, A.: On random graphs. Publicationes Mathematicae 6, 290–297 (1959)

    Article  MathSciNet  Google Scholar 

  23. Barabasi, A.L., Albert, R.: Emergence of scaling in random networks. Science 286(5439), 509–512 (1999)

    Article  MathSciNet  Google Scholar 

  24. Watts, D.J., Strogatz, S.H.: Collective dynamics of ‘small-world’ networks. Nature 393(6684), 440–442 (1998)

    Article  Google Scholar 

  25. Newman, M.E.J.: The structure and function of complex networks. SIAM Rev. 45(2), 167–256 (2003)

    Article  MathSciNet  Google Scholar 

  26. Powers, D.M.: Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation. Int. J. Mach. Learn. Technol. 2, 37–63 (2011)

    Google Scholar 

Download references

Funding

This research was funded by Università degli Studi di Modena e Reggio Emilia (FAR2023 project of the Department of Physics, Informatics and Mathematics). Financial support was provided also by the MUR-PRIN grant 2022 SMNNKY, CUP B53D23009470006.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Marco Villani .

Editor information

Editors and Affiliations

Appendix A

Appendix A

The stochastic model presented in the paper can also be used to calculate the threshold in the case of three-level variables. To this aim, let us consider the ternary variables A and B and suppose that A takes value “0” with probability Pa0, “1” with probability Pa1 and “2” with probability Pa2, then B either copies A with probability Pc or takes value “0” with probability Pb0, “1” with probability Pb1 and “2” with probability Pb2.

In this case it is possible to directly calculate the entropy of the pair (A, B) as:

$$H\left(AB\right)=-\left({P}_{00}ln\left({P}_{00}\right)+{P}_{01}ln\left({P}_{01}\right)+{P}_{02}ln\left({P}_{02}\right)+{P}_{10}ln\left({P}_{10}\right)+{P}_{11}ln\left({P}_{11}\right)+{P}_{12}ln\left({P}_{12}\right)+{P}_{20}ln\left({P}_{20}\right)+{P}_{21}ln\left({P}_{21}\right)+{P}_{22}ln\left({P}_{22}\right)\right)$$

where:

$$ \left\{ {\begin{array}{*{20}l} {P_{00} = P_{a0} P_{c} + P_{a0} \left( {1 - P_{c} } \right)P_{b0} } \hfill \\ {\;\;\;\;\;P_{01} = P_{a0} \left( {1 - P_{c} } \right)P_{b1} } \hfill \\ {\;\;\;\;\;P_{02} = P_{a0} \left( {1 - P_{c} } \right)P_{b2} } \hfill \\ {\;\;\;\;\;P_{10} = P_{a1} \left( {1 - P_{c} } \right)P_{b0} } \hfill \\ {P_{11} = P_{a1} P_{c} + P_{a1} \left( {1 - P_{c} } \right)P_{b1} } \hfill \\ {\;\;\;\;\;P_{12} = P_{a1} \left( {1 - P_{c} } \right)P_{b2} } \hfill \\ {\;\;\;\;\;P_{20} = P_{a2} \left( {1 - P_{c} } \right)P_{b0} } \hfill \\ {\;\;\;\;\;P_{21} = P_{a2} \left( {1 - P_{c} } \right)P_{b1} } \hfill \\ {P_{22} = P_{a2} P_{c} + P_{a2} \left( {1 - P_{c} } \right)P_{b2} } \hfill \\ \end{array} } \right. $$

As before, we can calculate H(A) and H(B) from the marginal probabilities. It is then possible to directly calculate the integration I(SAB(Pa0, Pa1, Pa2, Pb0, Pb1, Pb2, Pc)) of the pair (A, B) and therefore the value of the zI index in case of n observations.

This leads us to the new threshold:

$${\theta }_{zI}\left({P}_{a0},{P}_{a1},{P}_{a2},{P}_{b0},{P}_{b1},{P}_{b2}{,P}_{c}\right)=max\left(3.0,\frac{2nI\left({S}_{AB}\left({P}_{a0},{P}_{a1},{P}_{a2},{P}_{b0},{P}_{b1},{P}_{b2}{,P}_{c}\right)\right)-{d}_{k}}{\sqrt{2{d}_{k}}}\right)$$

where n is the number of observations, and the probabilities Paj and Pbj(j = 0,1,2) can be estimated from data (thus making the threshold specific for the case under examination). Notice again that the threshold cannot be less than the normal value of 3.0, to avoid confusing noise with signal.

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Fini, G., D’Addese, G., La Rocca, L., Villani, M. (2024). On the Detection of Significant Pairwise Interactions in Complex Systems. In: Villani, M., Cagnoni, S., Serra, R. (eds) Artificial Life and Evolutionary Computation. WIVACE 2023. Communications in Computer and Information Science, vol 1977. Springer, Cham. https://doi.org/10.1007/978-3-031-57430-6_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-57430-6_5

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-57429-0

  • Online ISBN: 978-3-031-57430-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics