Sophistication vs Logical Depth

Antunes, Luís; Bauwens, Bruno; Souto, André; Teixeira, Andreia

doi:10.1007/s00224-016-9672-6

Sophistication vs Logical Depth

Published: 19 March 2016

Volume 60, pages 280–298, (2017)
Cite this article

Theory of Computing Systems Aims and scope Submit manuscript

Luís Antunes¹,
Bruno Bauwens²,
André Souto^3,4 &
…
Andreia Teixeira^5,6

340 Accesses
4 Citations
3 Altmetric
Explore all metrics

Abstract

Sophistication and logical depth are two measures that express how complicated the structure in a string is. Sophistication is defined as the minimal complexity of a computable function that defines a two-part description for the string that is shortest within some precision; the second can be defined as the minimal computation time of a program that is shortest within some precision. We show that the Busy Beaver function of the sophistication of a string exceeds its logical depth with logarithmically bigger precision, and that logical depth exceeds the Busy Beaver function of sophistication with logarithmically bigger precision. We also show that sophistication is unstable in its precision: constant variations can change its value by a linear term in the length of the string.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Siamese Neural Networks: An Overview

Future Progress in Artificial Intelligence: A Survey of Expert Opinion

What an Algorithm Is

Article 11 January 2015

Robin K. Hill

Notes

In fact the proof only requires that U and V are optimal, i.e. for all machines W there exist c _W such that C _U(x) ≤ C _W(x)+c _W and similarly for V.
For any ε> 0, we can replace the term $\tfrac {3}{4}|x|$ by (1−ε)|x| if the significance of the second sophistication term is replaced by $c + O(\log (c/\varepsilon ))$.
On the other hand, any set must have non-typical elements unless the set contains a lot of mutual information with the Halting problem [17].
The Kolmogorov complexity of a set is the length of a shortest program that prints all its elements and halts.
Partition the set in subsets of size at most 2^j−k, this increases the complexity of the x-containing set by at most k.
Theorem IV.4 in [14] states that every decreasing function f is $(C(f)+O(\log |x|))$-close to the function
$$\lambda_{x}(k) = \min \left\{ C(S) + \log |S|: S \ni x \wedge C(x) \le k \right\} $$
of some string of length f(0). (We use plain complexity in the definition of λ_x, because all results hold up to $O(\log |x|)$ terms). For fixed x, the function λ_x is the inverse of $\text {soph}_{x}^{\text {Set}}$.
This probabilistic sufficiency criterion was defined in [20] in terms of prefix-free complexity, because 2^−K(x|P) defines a probability distribution and hence, it is natural to compare it with P(x). Prefix complexity and plain complexity differ by at most $O(\log |x|)$ [12], and this precision is sufficient for our discussion.
It is unclear whether H(x) = K m(x) + O(1).
The definition of total entropy used in [27,28] is K(P) + H(P). Notice that plain and prefix complexity are close ($|K(P) - C(P)| \le O(\log C(P)$). See also footnote 7.
In fact, in [27] the precision for which these inequalities should hold is not discussed. Also, the authors suggest that the computation time of a program for P is bounded by some computable function. In [29] the first requirement c = δ|x| is chosen for some δ > 0 and in the second requirement a different parameter is chosen. Furthermore, P should be computable as a real function and no restrictions on the computation time are considered. Also, K(P) was replaced by K(P, H(P)).
Indeed, if P is c-good then it is (2c)-sufficient. For the other direction, note that at most 2^{H(P) + c+1} elements satisfy $\log (1/P(x)) \le \lceil H(P) \rceil + c$, and these elements can be computed given P and ⌈H(x)⌉≤C(x) + c ≤ |x| + c + O(1). Hence a c-good model defines a $(c+O(\log |x|))$-sufficient set.
The formulation of Theorem 3.2.2 in [30] uses I(x;H) = K(x)−K ^H(x) with K ^H(x) the Kolmogorov complexity on a machine that has an oracle for the Halting problem. To obtain Theorem 7 from this, use the folklore result: $\text {depth}_{0}(x) \ge I(x;H) + O(\log I(x;H))$.

References

Solomonoff, R.: A formal theory of inductive inference. Part I. Inf. Control. 7(1), 1–22 (1964)
Article MathSciNet MATH Google Scholar
Kolmogorov, A.: Three approaches to the quantitative definition of information. Problemy Peredachi Informatsii 1(1), 3–11 (1965)
MathSciNet MATH Google Scholar
Chaitin, G.: On the length of programs for computing finite binary sequences. J. ACM 13(4), 547–569 (1966). ACM Press
Article MathSciNet MATH Google Scholar
Bennett, C.: Logical Depth and Physical Complexity, pp 227–257. Oxford University Press, Inc., New York (1988)
MATH Google Scholar
Antunes, L., Fortnow, L., van Melkebeek, D., Vinodchandran, N.V.: Computational depth: concept and applications. Theor. Comput. Sci. 354(3), 391–404 (2006)
Article MathSciNet MATH Google Scholar
Koppel, M.: Complexity, depth, and sophistication. Complex Systems 1, 1087–1091 (1987)
MathSciNet MATH Google Scholar
Juedes, D., Lathrop, J., Lutz, J.: Computational depth and reducibility. Theor. Comput. Sci. 132, 37–70 (1994)
Article MathSciNet MATH Google Scholar
Kolmogorov, A.N.: Talk in Information Theory Symposium. In: Tallinn, Estonia (1973)
Kolmogorov, A.N.: Complexity of algorithms and objective definition of randomness. Uspekhi Mat. Nauk 29(4), 155 (1974)
Google Scholar
Koppel, M.: Structure. In: Herken, R. (ed.) The Universal Turing Machine: a Half-Century Survey, 2nd edition, pp 403–419. Springer (1995)
Koppel, M., Atlan, H.: An almost machine-independent theory of program-length complexity, sophistication, and induction. Inf. Sci. 56(1-3), 23–33 (1991). Elsevier Science Publishers Ltd.
Article MathSciNet MATH Google Scholar
Li, Ming, Vitányi, P.: An Introduction to Kolmogorov Complexity and its Applications. Springer (2008)
Antunes, L., Fortnow, L.: Sophistication revisited. Theory of Computing Systems 45(1), 150–161 (2009). Springer
Article MathSciNet MATH Google Scholar
Vereshchagin, N., Vitanyi, P.: Kolmogorov’s structure functions and model selection. IEEE Trans. Inf. Theory 50, 3265–3290 (2004). Computer Society
Article MathSciNet MATH Google Scholar
Li, Ming, Vitányi, P.: An Introduction to Kolmogorov Complexity and Its Applications. Springer (1997)
Cover, T., Gacs, P., Gray, R.M.: Kolmogorov’s contributions to information theory and algorithmic complexity. Ann. Probab. 17(3), 840–865 (1989)
Article MathSciNet MATH Google Scholar
Epstein, S., Levin, L.: On sets of high complexity strings. CoRR, arXiv:abs/1107.1458 (2011)
Shen, A.: The concept of (alpha, beta)-stochasticity in the kolmogorov sense and its properties. Soviet Mathematics Doklady 28(1), 295–299 (1983)
MATH Google Scholar
Vyugin, V.: On the defect of randomness of a finite object with respect to measures with given complexity bounds. Theory Prob. Appl. 32(3), 508–512 (1987)
Article Google Scholar
Gács, P., Tromp, J., Vitányi, P.: Algorithmic statistics. IEEE Trans. Inform. Theory 47(6), 2443–2463 (2001)
Article MathSciNet MATH Google Scholar
Cover, T.: The Impact of Processing Techniques on Communications. chapter Kolmogorov Complexity, Data Compression and Inference., pages 23–33. J. Skwyrzynski. Martinus Nijhoff Publishers (1985)
Cover, T., Joy, T.: Elements of Information Theory. Wiley (1991)
Vitányi, P.: Meaningful information. IEEE Trans. Inf. Theory 52(10), 4617–4626 (2006)
Article MathSciNet MATH Google Scholar
Shen, A.: Algorithmic statistics: main results. In preparation (2013)
Vereshchagin, N.: Algorithmic minimal sufficient statistic revisited. Mathematical Theory and Computational Practice, 478–487 (2009)
Verschagin, N.: On Algorithmic Strong Sufficient Statistics. In: Proceedings of Computability in Europe (2013)
Gell-Mann, M., Lloyd, S.: Information measures, effective complexity, and total information. Complexity 2(1), 44–52 (1998)
Article MathSciNet MATH Google Scholar
Gell-Mann, M., Lloyd, S.: Effective complexity. Nonextensive entropy, 387–398 (2004)
Nihat, A., Muller, M., Szkola, A.: Effective complexity and its relation to logical depth. IEEE Trans. Inf. Theory 56(9), 4593–4607 (2010)
Article MathSciNet Google Scholar
Bauwens, B.: Computability in statistical hypotheses testing, and characterizations of independence and directed influences in time series using Kolmogorov complexity. PhD thesis, Ugent (2010)
Vereshchagin, N., Shen, A.: Algorithmic statistics revisited (2015). arXiv preprint arXiv:1504.04950

Download references

Acknowledgments

The authors are grateful to the anonymous reviewers and to to Alexander Shen and Nikolay Vereshchagin for useful comments and discussions. This work was partially supported by the national science foundation: Fundação para a Ciência e Tecnologia, through the scholarships SFRH/BPD/76231/2011, SFRH/BPD/75129/2010 and SFRH/BD/33234/2007, and grants of Instituto de Telecomunicações. The first author is partially funded by the ERDF through the COMPETE 2020 Programme within project POCI-01-0145-FEDER-006961, and by National Funds through the FCT as part of project UID/EEA/50014/2013. The second author was also supported by NAFIT ANR-08-EMER-008-01 project.

Author information

Authors and Affiliations

Faculty of Sciences, University of Porto, CRACS & INESC-Porto LA, R.Campo Alegre, 1021/1055, 4169-007, Porto, Portugal
Luís Antunes
Faculty of Computer Science, Research University Higher School of Economics, Kochnovskiy Proezd 3, 125319, Moscow, Russia
Bruno Bauwens
Departamento de Informática, Faculdade de Ciências da Universidade de Lisboa, Lisboa, Portugal
André Souto
SQIG-Instituto de Telecomunicações, Lisboa, Portugal
André Souto
Health Information and Decision Sciences Department, Universidade do Porto, Porto, Portugal
Andreia Teixeira
CINTESIS, Center for Research in Health Technologies and Information Systems, Porto, Portugal
Andreia Teixeira

Authors

Luís Antunes
View author publications
You can also search for this author in PubMed Google Scholar
Bruno Bauwens
View author publications
You can also search for this author in PubMed Google Scholar
André Souto
View author publications
You can also search for this author in PubMed Google Scholar
Andreia Teixeira
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Luís Antunes.

Appendices

Appendix A: Machine Invariance of Logical Depth

Lemma 7

For all universal Turing machines U and V, there exist a constant $c^{\prime }$ such that for all c and x: depth _c,U (x)≥|x| [no Busy Beaver here!] implies

$$\text{depth}_{c+c^{\prime},V}^{bb}(x) \le \text{depth}_{c,U}^{bb}(x) + c^{\prime}. $$

Note that for some universal machines there exist a string w such that U(w x) = x for all x and the computation requires at most O(1) steps. For such machines U we have depth_{|w
x|}(x) ≤ O(1) and hence $\text {depth}_{|wx|}^{bb}(x) \le O(1)$. Other universal machines always scan the input, and on such machines we have $\text {depth}_{|wx|}^{bb}(x) \ge bb(|x|) - O(1)$ for all x. Hence, the assumption in the lemma is necessary.

Proof

Let w _V be the prefix such that V(w _V p) simulates U(p) for all p. Our result would follow easily if we assume that for any halting programs p, q on U such that time(p) ≤ time(q) we have time(w _V p) ≤ time(w _V q) on V; i.e. simulating U on V preserves the order of computation time. Indeed, any pair (p, q) usable in the definition of depth on U defines a pair (w _V p, w _V q) that can be used in the definition of depth on V. The program w _V p is minimal on V within c+|w _V|+|w _U| error (where w _U is the string that allows to simulate U on V). Hence, the pair (w _V p, w _V q) witnesses an increase of sophistication by at most |w _V| for an increase of the significance of at most c+|w _V|+|w _U|.

In the case where the assumption is not true, we need to find $c^{\prime }$ and a program of length at most $|q|+c^{\prime }$ on V that computes longer than time(w _V p) (where $c^{\prime }$ does not depend on p, q, c). Consider the following algorithm on input q: determine all programs p that have running time at most time(q) on U, determine for all these p’s the maximal running time T of a program w _V p on V (assume for now that for finite time(q) there are finitely many such p), and finally print a string of length T. For (p, q) usable in the definition of depth_{c, U}(x), the algorithm with input q produces an output longer than time(w _V p), and by universality there is a program of length $|q|+c^{\prime }$ on V that prints this string and hence computes longer than T.

Above, we have assumed that only finitely many programs on U have a halting time at most time(q) for halting q. This assumption is not true in general, but by the additional assumption of the lemma: depth_{c, U}(x) ≥ |x|, it suffices to consider only a finite subset of candidates: we only need the pairs (p, q) on U such that |p|≤|x| + O(1) and |x|≤time(q), which implies |p|≤time(q) + O(1). The proof finishes by modifying the above algorithm such that it only considers programs p for which |p|≤time(q) + O(1). □

Recall that Bennett’s definition of logical depth is the minimal computation time of a program on a prefix-free machine W (of some type) that is c-incompressible. We show that when scaled by the inverse Busy Beaver function, both notions of depth are $O(\log |x|)$-close. On a prefix-free machine W, both (unscaled) depths are closely related: Bennett’s logical depth of x at significance c is at most depth_{c + O(1), W}(x), because any c-shortest program p for x is c + O(1)-incompressible on W. On the other hand, by Lemma 5.3 of [7] (attributed to Bennett [4]), depth_{c + O(1)}(x) is bounded by a computable function of Bennett’s logical depth of x with significance c. Hence, after rescaling with the inverse Busy Beaver function, both notions are O(1)-close. Exchanging prefix-free machine by a plain machine, both depth notions are $O(\log |x|)$-close; indeed this follows by the same argument as Lemma 7 for W = V and replacing |w _V| by $O(\log |x|)$-terms in the proof (since $|K_{W}(x)-C_{U}(x)| \le O(\log |x|)$).

Appendix B: Alternative Proof of Theorem 1

An alternative proof for the second inequality in Theorem 1 is given: there exists e such that for all c and x with |x|≥e we have

$$\text{soph}_{c + e\log |x|}(x) \le \text{depth}_{c}^{bb}(x) + e\log |x|\,. $$

A prefix stable machine V is a plain machine such that for all strings p and extensions q of p: if p∈DomV then q∈DomV and V(p) = V(q). For (infinite) sequences ω let V(ω) be V(p) if a prefix p of ω exists such that V(p) is defined, and undefined otherwise. For any string or sequence ω, let 0.ω be the real ${\sum }_{i} \omega _{i} 2^{-i}$. A prefix stable machine is left computable [17] if for p such that V(p) is defined and for all q such that 0.q ≤ 0.p, also V(q) is defined. There are universal prefix stable machines that are left computable (just rearrange the programs on a universal machine). Let $\Omega = \sup \{0.p: V(p) \textnormal {is defined} \}$.

In order to prove the result aforementioned, it is sufficient to show that $\text {soph}_{c + 2\log |x|,U}(x) \le \text {depth}_{c,W}^{bb}(x) + 2\log |x|$ for large x, where W is a universal left computable machine. Indeed, there exists a universal plain machine U such that

$$\text{depth}_{c + 2\log |x|,W}^{bb}(x) \le \text{depth}_{c,U}^{bb}(x) + O(\log |x|). $$

(translating plain programs to self-delimiting ones can happen by affecting program sizes by at most $O(\log |p|)$ and computation time by a computable function of |p| and the halting time).

Let p be a program satisfying the conditions in the definition of $\text {depth}_{c}^{bb}(x)$. We show that the initial segment where p and Ω are equal defines a computable function that satisfies the conditions in the definition of sophistication. More precisely, let i be the length of the common initial segment, then $F(d) = V(\Omega _{1}{\dots } \Omega _{i}0d)$ satisfies the conditions. Note that, $p_{1}{\dots } p_{i} = \Omega _{1}{\dots } \Omega _{i-1}0$ and Ω_i = 1 by construction of i. Thus $F(p_{i+2}{\dots } p_{|p|}) = x$. For any d we have $0.\Omega _{1}{\dots } \Omega _{i}0d < \Omega $ and by left computability this is in DomV, thus F is computable. It remains to show that $C(F) \le \text {depth}_{c,W}(x)+O(\log \text {depth}_{c,W}(x))$. We show that $C(\Omega _{1}{\dots } \Omega _{i-1}) \le \text {depth}_{c,W}(x) + O(\log i)$. In fact, given i and a t that exceeds the computation time of p, we can search for the maximal value 0.w for a program w that halts in t computation steps. We know that 0.p ≤ 0.w ≤ Ω, hence we can compute the first i−1 bits of Ω which completes the proof.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Antunes, L., Bauwens, B., Souto, A. et al. Sophistication vs Logical Depth. Theory Comput Syst 60, 280–298 (2017). https://doi.org/10.1007/s00224-016-9672-6

Download citation

Published: 19 March 2016
Issue Date: February 2017
DOI: https://doi.org/10.1007/s00224-016-9672-6

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Sophistication vs Logical Depth

Abstract

Access this article

Similar content being viewed by others

Siamese Neural Networks: An Overview

Future Progress in Artificial Intelligence: A Survey of Expert Opinion

What an Algorithm Is

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendices

Appendix A: Machine Invariance of Logical Depth

Lemma 7

Proof

Appendix B: Alternative Proof of Theorem 1

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Sophistication vs Logical Depth

Abstract

Access this article

Similar content being viewed by others

Siamese Neural Networks: An Overview

Future Progress in Artificial Intelligence: A Survey of Expert Opinion

What an Algorithm Is

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendices

Appendix A: Machine Invariance of Logical Depth

Lemma 7

Proof

Appendix B: Alternative Proof of Theorem 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation