Skip to main content
Log in

Sophistication vs Logical Depth

  • Published:
Theory of Computing Systems Aims and scope Submit manuscript

Abstract

Sophistication and logical depth are two measures that express how complicated the structure in a string is. Sophistication is defined as the minimal complexity of a computable function that defines a two-part description for the string that is shortest within some precision; the second can be defined as the minimal computation time of a program that is shortest within some precision. We show that the Busy Beaver function of the sophistication of a string exceeds its logical depth with logarithmically bigger precision, and that logical depth exceeds the Busy Beaver function of sophistication with logarithmically bigger precision. We also show that sophistication is unstable in its precision: constant variations can change its value by a linear term in the length of the string.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1

Similar content being viewed by others

Notes

  1. In fact the proof only requires that U and V are optimal, i.e. for all machines W there exist c W such that C U (x) ≤ C W (x)+c W and similarly for V.

  2. For any ε> 0, we can replace the term \(\tfrac {3}{4}|x|\) by (1−ε)|x| if the significance of the second sophistication term is replaced by \(c + O(\log (c/\varepsilon ))\).

  3. On the other hand, any set must have non-typical elements unless the set contains a lot of mutual information with the Halting problem [17].

  4. The Kolmogorov complexity of a set is the length of a shortest program that prints all its elements and halts.

  5. Partition the set in subsets of size at most 2jk, this increases the complexity of the x-containing set by at most k.

  6. Theorem IV.4 in [14] states that every decreasing function f is \((C(f)+O(\log |x|))\)-close to the function

    $$\lambda_{x}(k) = \min \left\{ C(S) + \log |S|: S \ni x \wedge C(x) \le k \right\} $$

    of some string of length f(0). (We use plain complexity in the definition of λ x , because all results hold up to \(O(\log |x|)\) terms). For fixed x, the function λ x is the inverse of \(\text {soph}_{x}^{\text {Set}}\).

  7. This probabilistic sufficiency criterion was defined in [20] in terms of prefix-free complexity, because 2K(x|P) defines a probability distribution and hence, it is natural to compare it with P(x). Prefix complexity and plain complexity differ by at most \(O(\log |x|)\) [12], and this precision is sufficient for our discussion.

  8. It is unclear whether H(x) = K m(x) + O(1).

  9. The definition of total entropy used in [27,28] is K(P) + H(P). Notice that plain and prefix complexity are close (\(|K(P) - C(P)| \le O(\log C(P)\)). See also footnote 7.

  10. In fact, in [27] the precision for which these inequalities should hold is not discussed. Also, the authors suggest that the computation time of a program for P is bounded by some computable function. In [29] the first requirement c = δ|x| is chosen for some δ > 0 and in the second requirement a different parameter is chosen. Furthermore, P should be computable as a real function and no restrictions on the computation time are considered. Also, K(P) was replaced by K(P, H(P)).

  11. Indeed, if P is c-good then it is (2c)-sufficient. For the other direction, note that at most 2H(P) + c+1 elements satisfy \(\log (1/P(x)) \le \lceil H(P) \rceil + c\), and these elements can be computed given P and ⌈H(x)⌉≤C(x) + c ≤ |x| + c + O(1). Hence a c-good model defines a \((c+O(\log |x|))\)-sufficient set.

  12. The formulation of Theorem 3.2.2 in [30] uses I(x;H) = K(x)−K H(x) with K H(x) the Kolmogorov complexity on a machine that has an oracle for the Halting problem. To obtain Theorem 7 from this, use the folklore result: \(\text {depth}_{0}(x) \ge I(x;H) + O(\log I(x;H))\).

References

  1. Solomonoff, R.: A formal theory of inductive inference. Part I. Inf. Control. 7(1), 1–22 (1964)

    Article  MathSciNet  MATH  Google Scholar 

  2. Kolmogorov, A.: Three approaches to the quantitative definition of information. Problemy Peredachi Informatsii 1(1), 3–11 (1965)

    MathSciNet  MATH  Google Scholar 

  3. Chaitin, G.: On the length of programs for computing finite binary sequences. J. ACM 13(4), 547–569 (1966). ACM Press

    Article  MathSciNet  MATH  Google Scholar 

  4. Bennett, C.: Logical Depth and Physical Complexity, pp 227–257. Oxford University Press, Inc., New York (1988)

    MATH  Google Scholar 

  5. Antunes, L., Fortnow, L., van Melkebeek, D., Vinodchandran, N.V.: Computational depth: concept and applications. Theor. Comput. Sci. 354(3), 391–404 (2006)

    Article  MathSciNet  MATH  Google Scholar 

  6. Koppel, M.: Complexity, depth, and sophistication. Complex Systems 1, 1087–1091 (1987)

    MathSciNet  MATH  Google Scholar 

  7. Juedes, D., Lathrop, J., Lutz, J.: Computational depth and reducibility. Theor. Comput. Sci. 132, 37–70 (1994)

    Article  MathSciNet  MATH  Google Scholar 

  8. Kolmogorov, A.N.: Talk in Information Theory Symposium. In: Tallinn, Estonia (1973)

  9. Kolmogorov, A.N.: Complexity of algorithms and objective definition of randomness. Uspekhi Mat. Nauk 29(4), 155 (1974)

    Google Scholar 

  10. Koppel, M.: Structure. In: Herken, R. (ed.) The Universal Turing Machine: a Half-Century Survey, 2nd edition, pp 403–419. Springer (1995)

  11. Koppel, M., Atlan, H.: An almost machine-independent theory of program-length complexity, sophistication, and induction. Inf. Sci. 56(1-3), 23–33 (1991). Elsevier Science Publishers Ltd.

    Article  MathSciNet  MATH  Google Scholar 

  12. Li, Ming, Vitányi, P.: An Introduction to Kolmogorov Complexity and its Applications. Springer (2008)

  13. Antunes, L., Fortnow, L.: Sophistication revisited. Theory of Computing Systems 45(1), 150–161 (2009). Springer

    Article  MathSciNet  MATH  Google Scholar 

  14. Vereshchagin, N., Vitanyi, P.: Kolmogorov’s structure functions and model selection. IEEE Trans. Inf. Theory 50, 3265–3290 (2004). Computer Society

    Article  MathSciNet  MATH  Google Scholar 

  15. Li, Ming, Vitányi, P.: An Introduction to Kolmogorov Complexity and Its Applications. Springer (1997)

  16. Cover, T., Gacs, P., Gray, R.M.: Kolmogorov’s contributions to information theory and algorithmic complexity. Ann. Probab. 17(3), 840–865 (1989)

    Article  MathSciNet  MATH  Google Scholar 

  17. Epstein, S., Levin, L.: On sets of high complexity strings. CoRR, arXiv:abs/1107.1458 (2011)

  18. Shen, A.: The concept of (alpha, beta)-stochasticity in the kolmogorov sense and its properties. Soviet Mathematics Doklady 28(1), 295–299 (1983)

    MATH  Google Scholar 

  19. Vyugin, V.: On the defect of randomness of a finite object with respect to measures with given complexity bounds. Theory Prob. Appl. 32(3), 508–512 (1987)

    Article  Google Scholar 

  20. Gács, P., Tromp, J., Vitányi, P.: Algorithmic statistics. IEEE Trans. Inform. Theory 47(6), 2443–2463 (2001)

    Article  MathSciNet  MATH  Google Scholar 

  21. Cover, T.: The Impact of Processing Techniques on Communications. chapter Kolmogorov Complexity, Data Compression and Inference., pages 23–33. J. Skwyrzynski. Martinus Nijhoff Publishers (1985)

  22. Cover, T., Joy, T.: Elements of Information Theory. Wiley (1991)

  23. Vitányi, P.: Meaningful information. IEEE Trans. Inf. Theory 52(10), 4617–4626 (2006)

    Article  MathSciNet  MATH  Google Scholar 

  24. Shen, A.: Algorithmic statistics: main results. In preparation (2013)

  25. Vereshchagin, N.: Algorithmic minimal sufficient statistic revisited. Mathematical Theory and Computational Practice, 478–487 (2009)

  26. Verschagin, N.: On Algorithmic Strong Sufficient Statistics. In: Proceedings of Computability in Europe (2013)

  27. Gell-Mann, M., Lloyd, S.: Information measures, effective complexity, and total information. Complexity 2(1), 44–52 (1998)

    Article  MathSciNet  MATH  Google Scholar 

  28. Gell-Mann, M., Lloyd, S.: Effective complexity. Nonextensive entropy, 387–398 (2004)

  29. Nihat, A., Muller, M., Szkola, A.: Effective complexity and its relation to logical depth. IEEE Trans. Inf. Theory 56(9), 4593–4607 (2010)

    Article  MathSciNet  Google Scholar 

  30. Bauwens, B.: Computability in statistical hypotheses testing, and characterizations of independence and directed influences in time series using Kolmogorov complexity. PhD thesis, Ugent (2010)

  31. Vereshchagin, N., Shen, A.: Algorithmic statistics revisited (2015). arXiv preprint arXiv:1504.04950

Download references

Acknowledgments

The authors are grateful to the anonymous reviewers and to to Alexander Shen and Nikolay Vereshchagin for useful comments and discussions. This work was partially supported by the national science foundation: Fundação para a Ciência e Tecnologia, through the scholarships SFRH/BPD/76231/2011, SFRH/BPD/75129/2010 and SFRH/BD/33234/2007, and grants of Instituto de Telecomunicações. The first author is partially funded by the ERDF through the COMPETE 2020 Programme within project POCI-01-0145-FEDER-006961, and by National Funds through the FCT as part of project UID/EEA/50014/2013. The second author was also supported by NAFIT ANR-08-EMER-008-01 project.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Luís Antunes.

Appendices

Appendix A: Machine Invariance of Logical Depth

Lemma 7

For all universal Turing machines U and V, there exist a constant \(c^{\prime }\) such that for all c and x: depth c,U (x)≥|x| [no Busy Beaver here!] implies

$$\text{depth}_{c+c^{\prime},V}^{bb}(x) \le \text{depth}_{c,U}^{bb}(x) + c^{\prime}. $$

Note that for some universal machines there exist a string w such that U(w x) = x for all x and the computation requires at most O(1) steps. For such machines U we have depth|w x|(x) ≤ O(1) and hence \(\text {depth}_{|wx|}^{bb}(x) \le O(1)\). Other universal machines always scan the input, and on such machines we have \(\text {depth}_{|wx|}^{bb}(x) \ge bb(|x|) - O(1)\) for all x. Hence, the assumption in the lemma is necessary.

Proof

Let w V be the prefix such that V(w V p) simulates U(p) for all p. Our result would follow easily if we assume that for any halting programs p, q on U such that time(p) ≤ time(q) we have time(w V p) ≤ time(w V q) on V; i.e. simulating U on V preserves the order of computation time. Indeed, any pair (p, q) usable in the definition of depth on U defines a pair (w V p, w V q) that can be used in the definition of depth on V. The program w V p is minimal on V within c+|w V |+|w U | error (where w U is the string that allows to simulate U on V). Hence, the pair (w V p, w V q) witnesses an increase of sophistication by at most |w V | for an increase of the significance of at most c+|w V |+|w U |.

In the case where the assumption is not true, we need to find \(c^{\prime }\) and a program of length at most \(|q|+c^{\prime }\) on V that computes longer than time(w V p) (where \(c^{\prime }\) does not depend on p, q, c). Consider the following algorithm on input q: determine all programs p that have running time at most time(q) on U, determine for all these p’s the maximal running time T of a program w V p on V (assume for now that for finite time(q) there are finitely many such p), and finally print a string of length T. For (p, q) usable in the definition of depth c, U (x), the algorithm with input q produces an output longer than time(w V p), and by universality there is a program of length \(|q|+c^{\prime }\) on V that prints this string and hence computes longer than T.

Above, we have assumed that only finitely many programs on U have a halting time at most time(q) for halting q. This assumption is not true in general, but by the additional assumption of the lemma: depth c, U (x) ≥ |x|, it suffices to consider only a finite subset of candidates: we only need the pairs (p, q) on U such that |p|≤|x| + O(1) and |x|≤time(q), which implies |p|≤time(q) + O(1). The proof finishes by modifying the above algorithm such that it only considers programs p for which |p|≤time(q) + O(1). □

Recall that Bennett’s definition of logical depth is the minimal computation time of a program on a prefix-free machine W (of some type) that is c-incompressible. We show that when scaled by the inverse Busy Beaver function, both notions of depth are \(O(\log |x|)\)-close. On a prefix-free machine W, both (unscaled) depths are closely related: Bennett’s logical depth of x at significance c is at most depth c + O(1), W (x), because any c-shortest program p for x is c + O(1)-incompressible on W. On the other hand, by Lemma 5.3 of [7] (attributed to Bennett [4]), depth c + O(1)(x) is bounded by a computable function of Bennett’s logical depth of x with significance c. Hence, after rescaling with the inverse Busy Beaver function, both notions are O(1)-close. Exchanging prefix-free machine by a plain machine, both depth notions are \(O(\log |x|)\)-close; indeed this follows by the same argument as Lemma 7 for W = V and replacing |w V | by \(O(\log |x|)\)-terms in the proof (since \(|K_{W}(x)-C_{U}(x)| \le O(\log |x|)\)).

Appendix B: Alternative Proof of Theorem 1

An alternative proof for the second inequality in Theorem 1 is given: there exists e such that for all c and x with |x|≥e we have

$$\text{soph}_{c + e\log |x|}(x) \le \text{depth}_{c}^{bb}(x) + e\log |x|\,. $$

A prefix stable machine V is a plain machine such that for all strings p and extensions q of p: if p∈DomV then q∈DomV and V(p) = V(q). For (infinite) sequences ω let V(ω) be V(p) if a prefix p of ω exists such that V(p) is defined, and undefined otherwise. For any string or sequence ω, let 0.ω be the real \({\sum }_{i} \omega _{i} 2^{-i}\). A prefix stable machine is left computable [17] if for p such that V(p) is defined and for all q such that 0.q ≤ 0.p, also V(q) is defined. There are universal prefix stable machines that are left computable (just rearrange the programs on a universal machine). Let \(\Omega = \sup \{0.p: V(p) \textnormal {is defined} \}\).

In order to prove the result aforementioned, it is sufficient to show that \(\text {soph}_{c + 2\log |x|,U}(x) \le \text {depth}_{c,W}^{bb}(x) + 2\log |x|\) for large x, where W is a universal left computable machine. Indeed, there exists a universal plain machine U such that

$$\text{depth}_{c + 2\log |x|,W}^{bb}(x) \le \text{depth}_{c,U}^{bb}(x) + O(\log |x|). $$

(translating plain programs to self-delimiting ones can happen by affecting program sizes by at most \(O(\log |p|)\) and computation time by a computable function of |p| and the halting time).

Let p be a program satisfying the conditions in the definition of \(\text {depth}_{c}^{bb}(x)\). We show that the initial segment where p and Ω are equal defines a computable function that satisfies the conditions in the definition of sophistication. More precisely, let i be the length of the common initial segment, then \(F(d) = V(\Omega _{1}{\dots } \Omega _{i}0d)\) satisfies the conditions. Note that, \(p_{1}{\dots } p_{i} = \Omega _{1}{\dots } \Omega _{i-1}0\) and Ω i = 1 by construction of i. Thus \(F(p_{i+2}{\dots } p_{|p|}) = x\). For any d we have \(0.\Omega _{1}{\dots } \Omega _{i}0d < \Omega \) and by left computability this is in DomV, thus F is computable. It remains to show that \(C(F) \le \text {depth}_{c,W}(x)+O(\log \text {depth}_{c,W}(x))\). We show that \(C(\Omega _{1}{\dots } \Omega _{i-1}) \le \text {depth}_{c,W}(x) + O(\log i)\). In fact, given i and a t that exceeds the computation time of p, we can search for the maximal value 0.w for a program w that halts in t computation steps. We know that 0.p ≤ 0.w ≤ Ω, hence we can compute the first i−1 bits of Ω which completes the proof.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Antunes, L., Bauwens, B., Souto, A. et al. Sophistication vs Logical Depth. Theory Comput Syst 60, 280–298 (2017). https://doi.org/10.1007/s00224-016-9672-6

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00224-016-9672-6

Keywords

Navigation