Abstract
Siegel’s Lemma is concerned with finding a “small” nontrivial integer solution of a large system of homogeneous linear equations with integer coefficients, where the number of variables substantially exceeds the number of equations (for example, n equations and N variables with N ≥ 2n), and “small” means small in the maximum norm. Siegel’s Lemma is a clever application of the Pigeonhole Principle, and it is a pure existence argument. The basically combinatorial Siegel’s Lemma is a key tool in transcendental number theory and diophantine approximation. David Masser (a leading expert in transcendental number theory) asked the question whether or not the Siegel’s Lemma is best possible. Here we prove that the so-called “Third Version of Siegel’s Lemma” is best possible apart from an absolute constant factor. In other words, we show that no other argument can beat the Pigeonhole Principle proof of Siegel’s Lemma (apart from an absolute constant factor). To prove this, we combine a concentration inequality (i.e., Fourier analysis) with combinatorics.
References
N. Alon, D.N. Kozlov, Coins with arbitrary weights. J. Algorithms 25, 162–176 (1997)
N. Alon, V.H. Vu, Anti-Hadamard matrices, coin weighing, threshold gates and indecomposable hypergraphs. J. Comb. Theory Ser. A 79, 133–160 (1997)
A. Baker, Transcendental Number Theory (Cambridge University Press, Cambridge, 1975)
E. Bombieri, W. Gubler, Heights in Diophantine Geometry. New Mathematical Monographs, vol. 4 (Cambridge University Press, Cambridge, 2006)
E. Bombieri, J.D. Vaaler, On Siegel’s lemma. Invent. Math. 73(1), 11–32 (1983)
P. Erdős, On a lemma of Littlewood and Offord. Bull. Am. Math. Soc. 51, 898–902 (1945)
G. Halász, Estimates for the concentration function of combinatorial number theory and probability. Period. Math. Hung. 8, 197–211 (1977)
A.M. Macbeath, On measure of sum-sets, II. The sum-theorem for the torus. Proc. Camb. Philos. Soc. 49, 40–43 (1953)
W.M. Schmidt, Diophantine Approximations and Diophantine Equations. Lecture Notes in Mathematics, vol. 1467 (Springer, Berlin, 1991)
J. Spencer, Six standard deviations suffice. Trans. Am. Math. Soc. 289, 679–706 (1985)
J.D. Vaaler, The best constant in Siegel’s lemma. Monatshaft. Math. 140(1), 71–89 (2003)
J.D. Vaaler, A.J. van der Poorten, Bounds for solutions of systems of linear equations. Bull. Aust. Math. Soc. 25, 125–132 (1982)
Acknowledgements
I am grateful to David Masser (Basel) who called my attention to the problem.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Appendix: Proof of the Third Version of Siegel’s Lemma
Appendix: Proof of the Third Version of Siegel’s Lemma
We combine the usual pigeonhole principle argument with probability theory; in particular, we borrow some ideas from the paper of Spencer [10]. We use the following variant of the large deviation theorem in probability theory. Bernstein’s inequality Let Z 1, Z 2, …, Z N be real-valued independent random variables with zero expectation E Z j = 0 and | Z j | ≤ M, 1 ≤ j ≤ N. Then, for all positive τ > 0,
Consider now the i-th row of the homogeneous linear system
For a positive integer B ≥ 1 and an integer j in 1 ≤ j ≤ N, let X j denote the random variable with \(\Pr [X_{j} = b] ={ 1 \over 2B+1}\), where b runs over the integers − B, −B + 1, −B + 2, …, B. Moreover, assume that X 1, X 2, …, X N are independent. We apply Bernstein’s inequality with Z j = d i, j X j , \(\tau =\lambda \sqrt{N}AB\) and M = AB:
Let
Then by (109), for every 1 ≤ h ≤ logn,
For every integer h in 1 ≤ h ≤ logn, define the random variable
By using (111), we obtain the following upper bound for the expected value of Y h :
Since the random variable Y h has non-negative values, we can use the simple Markov inequality stating that for any random variable Y with non-negative values and finite expectation
Applying Markov inequality in (113) we have
Using the telescoping sum
we obtain that
so, with probability greater than 1∕2 we have
(114) means that, with probability greater than 1∕2, the number of row-sums ∑ j = 1 N d i, j X j that have absolute value \(\geq 6h\sqrt{N}AB\), is less than
It follows that, with probability greater than 1∕2, the number of row-sums ∑ j = 1 N d i, j X j that have absolute value between \(6h\sqrt{N}AB\) and \(6(h + 1)\sqrt{N}AB\), is less than
and there is no row-sum with absolute value \(\geq 6\log n\sqrt{N}AB\). In the last step we used the fact that
(lower integral part).
Write (see (115))
and
The total number of row-sum vectors (with n coordinates) satisfying (115) can be estimated from above by using the parameters k h in (116)–(117) as follows:
On the other hand, “with probability greater than 1∕2” means more than
possible vectors v ∈ {−B, −B + 1, −B + 2, …, B}N.
So, if (119) is greater or equal to (118), than the Pigeonhole Principle applies, and there exist two different vectors
such that they generate the same row-vector, i.e., Dv 1 = Dv 2. Then x = v 1 −v 2 satisfies the homogeneous linear system Dx = 0 with
The rest is routine estimation. Clearly
By using s! ≥ (s∕e)s and (116), we have
Combining (118), (119) and (123), it suffices to guarantee the inequality
Inequality (124) clearly holds with
and using (120) in (125), we conclude
completing the proof of the Third Version of Siegel’s Lemma. □
Rights and permissions
Copyright information
© 2017 Springer International publishing AG
About this chapter
Cite this chapter
Beck, J. (2017). Siegel’s Lemma Is Sharp. In: Loebl, M., Nešetřil, J., Thomas, R. (eds) A Journey Through Discrete Mathematics. Springer, Cham. https://doi.org/10.1007/978-3-319-44479-6_8
Download citation
DOI: https://doi.org/10.1007/978-3-319-44479-6_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-44478-9
Online ISBN: 978-3-319-44479-6
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)