Abstract
In Pollack and Stevens (Ann Sci Éc Norm Supér 44(1):1–42, 2011), efficient algorithms are given to compute with overconvergent modular symbols. These algorithms then allow for the fast computation of padic Lfunctions and have further been applied to compute rational points on elliptic curves (e.g. Darmon and Pollack in Israel J Math 153:319–354, 2006, Trifkovic in Duke Math J 135(3):415–453, 2006). In this paper, we generalize these algorithms to the case of families of overconvergent modular symbols. As a consequence, we can compute padic families of Heckeeigenvalues, twovariable padic Lfunctions, Linvariants, as well as the shape and structure of ordinary Hida–Hecke algebras.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Background
In a seminal work from the mid80s, Hida [21, 22] introduced a theory of padic families of ordinary Heckeeigenforms. This work was generalized by Coleman [12] in the mid90s to include nonordinary forms which ultimately led to the Coleman–Mazur [11] construction of the eigencurve—a rigid analytic space which parametrizes all finite slope Heckeeigenforms. Over the following decade, the theory of padic variation of automorphic forms blossomed with multiple constructions of eigenvarieties over a wide class of reductive groups [1, 2, 4, 15, 33]. Moreover, the consequences to number theory of the existence of padic families of automorphic forms have been profound with the proofs of the Mazur–Tate–Teitelbaum conjecture, the Main Conjecture (for class groups or modular forms!), and the Fontaine–Mazur conjecture (just to name a few) all heavily reliant upon the theory of padic variation.
With that said, our current state of understanding of the shape and structure of these eigenvarieties is still quite limited. Taking the simplest example of ordinary forms on \({{\mathrm{GL}}}_2 / {\mathbb Q}\) (e.g. the setting of Hida’s original work), we do not have a good understanding of a single example of a Hecke–Hida algebra which is not simply a union of open discs.
This paper will attempt to rectify this situation at least in the case of classical Hida theory by introducing methods for computing with families of overconvergent modular symbols (which form the basis of Stevens’ construction of the eigencurve). These methods generalize the constructions of [28] where overconvergent modular symbols of a fixed weight were studied (some applications of that article appear in [14] and [31]). As a consequence, we can compute qexpansions of Hida families, twovariable padic Lfunctions, Linvariants of modular forms and their symmetric squares, and, moreover, we can get our hands on the geometry of Hida families in several nontrivial situations.
As an example of some of the invariants we can compute, take p \(=\) 11 and consider the 11adic Hida family passing through Ramanujan’s discriminant form \(\Delta \). In this case, the Hida family passing through \(\Delta \) is parametrized by a single open disc. Thus, for any prime \(\ell \), the Heckeeigenvalue of \(T_\ell \) acting on the Hida family through Ramanujan’s \(\Delta \) is a power series in a weight variable k ^{Footnote 1}. We compute for example
Note that we have that
while
as this Hida family specializes in weight 2 to the modular form associated with the elliptic curve \(X_0(11)\). The above approximation to \(a_2(k)\) does indeed give these values modulo \(11^5\). For other positive integer values of k, the above formula gives 11adic approximations to the coefficient of \(q^2\) in the unique normalized 11ordinary form of weight k and level 1. A higher precision approximation to \(a_2(k)\) (and \(a_\ell (k)\) for \(\ell \le 11\)) is given in Example 5.2.
For another example, consider \(p=3\) and fix a tame level \(N=11\). There are exactly two 3ordinary forms of weight 2 and level 33, and moreover, these forms are congruent modulo 3. In particular, the Hida family attached to these forms cannot simply be the union of two open discs (because of the congruence between the two forms). The possibilities for the geometry of this Hida family include two discs glued together at some collection of points or a double cover of weight space ramified at several points. We note this example has already appeared in several places including [10, 19, 28]. Using the methods of this paper, we were able to determine that this Hida family is a doublecover of weight space ramified at a single (nonclassical) weight, and moreover, this weight is congruent to \(30060 \text { }({\text {mod}}\text { }3^{11})\).
To discuss the methods of the paper, we introduce some notation. Let \({\mathbf A}\) denote the space of convergent power series on the closed unit disc, and let \({\mathbf {D}}\) denote the space of distributions equal to the continuous \({\mathbb Q}_p\)dual of \({\mathbf A}\). Let \(\Sigma _0(p)\) denote the semigroup of matrices \(\bigl ( {\begin{matrix} a &{} b \\ c &{} d \end{matrix}} \bigr )\in \text {M}_2({\mathbb Z}_p)\) with \(a \in {\mathbb Z}_p^\times \), \(c \in p{\mathbb Z}_p\), and with nonzero determinant. For each weight k, one can endow \({\mathbf {D}}\) with a weight k action by \(\Sigma _0(p)\), and we write \({\mathbf {D}}_k\) for this space of distributions. The space
is the collection of overconvergent modular symbols of level \(\Gamma _0 = \Gamma _0(Np)\). The systems of Heckeeigenvalues occurring in this space are essentially the same as the systems which occur in the space of finite slope overconvergent modular forms of weight \(k+2\) and level \(\Gamma _0\) (see [29, Theorem 7.1]).
As with overconvergent modular forms, these spaces can be padically interpolated over weight space. To this end, set D equal to a closed disc in weight space of radius 1/p about any tame character, and set R equal to the collection of convergent power series on D. Then \({\mathbf {D}}\hat{\otimes }R\) sits inside of the space of Rvalued distributions. In particular, by evaluating at a weight k in D, we obtain a specialization map \({\mathbf {D}}\hat{\otimes }R \rightarrow {\mathbf {D}}_k\), and we refer to elements of \({\mathbf {D}}\hat{\otimes }R\) as families of distributions on D.
Moreover, one can equip \({\mathbf {D}}\hat{\otimes }R\) with a \(\Sigma _0(p)\)action which is simultaneously compatible with all of the specialization maps on D.^{Footnote 2} We thus interpret
as the space of families of overconvergent modular symbols on D of level \(\Gamma _0\). This space admits a Heckeaction, and we define the ordinary subspace \({{\mathrm{Symb}}}_{\Gamma _0}({\mathbf {D}}\hat{\otimes }R)^{{{\mathrm{ord}}}}\) as the intersection of the images of all powers of \(U_p\). All of the information of padic Hida families of tame level N is contained within this ordinary subspace (as Hida families extend to all of weight space).
In this paper, we introduce methods for explicitly computing approximations to elements of \({{\mathrm{Symb}}}_{\Gamma _0}({\mathbf {D}}\hat{\otimes }R)\). In particular, we are able to compute approximations to the characteristic polynomial of any Hecke operator acting on \({{\mathrm{Symb}}}_{\Gamma _0}({\mathbf {D}}\hat{\otimes }R)^{{{\mathrm{ord}}}}\). From these computations, one can then compute qexpansions of Hida families of eigenforms. From this, one can compute Linvariants via the formulae of [13, 20, 23]. Moreover, computing twovariable padic Lfunctions is immediate once one has a family of overconvergent eigensymbols in hand as in [18]. Lastly, these computations also allow us to gain some fine control over the geometry of these Hida families in a wide variety of examples. We include several of the examples we computed below. In future work, we will implement nontrivial nebentypus (thus allowing for odd weights), coefficients in extensions of \({\mathbb Z}_p\), and computations for the prime \(p=2\). The primary bottleneck in speed is computing with padic polynomials in Sage. After speeding this up, our future work will also aim to compute examples more systematically.
The algorithms developed in this paper have been implemented in Sage [32] and continue to be developed on the SageMathCloud. Once sufficiently polished, the code will be submitted for inclusion into Sage.
1.1 Outline
In the following section, we introduce the relevant distribution spaces leading to the definition of the space of families of distributions, \({\mathbf {D}}\hat{\otimes }R\). In the third section, we introduce methods of working in the space \({{\mathrm{Symb}}}_{\Gamma _0}({\mathbf {D}}\hat{\otimes }R)\) including producing explicit elements in this space, forming a basis of \({{\mathrm{Symb}}}_{\Gamma _0}({\mathbf {D}}\hat{\otimes }R)^{{{\mathrm{ord}}}}\), and computing characteristic power series of Hecke operators on this ordinary subspace. In the fourth section, we explain how to carry out these computations in practice by giving a systematic method of approximating families of overconvergent modular symbols. Lastly, in the fifth section, we close with several examples which we computed via these methods.
2 Distribution modules in families
In this section, we introduce the relevant distribution spaces which will ultimately be the coefficients of our spaces of modular symbols.
2.1 Distribution spaces
Let \({\mathbf A}\) denote the Tate algebra in a single variable z. That is, \({\mathbf A}= {\mathbb Q}_p\langle z\rangle \), the collection of power series with coefficients in \({\mathbb Q}_p\) which converge on the unit disc of \({\mathbb C}_p\):
Note that \({\mathbf A}\) is a Banach space under the norm
where \(f(z) = \sum _n a_n z^n\). We then define our space of distributions \({\mathbf {D}}\) by
Note that \({\mathbf {D}}\) is a Banach space under the operator norm
An element \(\mu \in {\mathbf {D}}\) is uniquely determined by its values on all monomials \(z^j\) since the latter have dense span in \({\mathbf {D}}\). We will refer to the sequence \(\{\mu (z^j)\}_{j=0}^\infty \) as the moments of \(\mu \). We have that
Indeed, for each \(f(z) = \sum _j c_j z^j \in {\mathbf A}\), we need that \(\sum _j c_j \mu (z^j)\) converges. But since \(c_j \rightarrow 0\), this only forces \(\mu (z^j)\) to be bounded (and any bounded sequence defines a distribution).
We will write \({\mathbf {D}}^0\) (resp. \({\mathbf A}^{\!0}\)) for the unit ball of \({\mathbf {D}}\) (resp. \({\mathbf A}\)). Note that \(\mu \in {\mathbf {D}}^0\) if and only if \(\mu (z^j) \in {\mathbb Z}_p\) for all \(j \ge 0\).
The space \({\mathbf {D}}\) is our basic distribution space which we will ultimately study in families. But we will need to make use of some slightly fancier distribution spaces which we introduce now.
For \(r \ge 1\), let \({\mathbf A}[r]\) denote the collection of power series over \({\mathbb Q}_p\) which converge on the disc in \({\mathbb C}_p\) of radius r around 0, i.e.
Then \({\mathbf A}[r]\) is a Banach space under the sup norm, and we define \({\mathbf {D}}[r] = {{\mathrm{Hom}}}_{{{\mathrm{cont}}}}({\mathbf A}[r],{\mathbb Q}_p)\) as the dual Banach space.
Thus, \({\mathbf {D}}[1]\) is nothing other than \({\mathbf {D}}\) introduced above. However, if \(r>1\), then \({\mathbf {D}}[r]\) is a larger space than \({\mathbf {D}}\) and contains distributions whose moments are not bounded. Indeed, for \(f = \sum _j c_j z^j \in {\mathbf A}[r]\) to converge on the disc of radius r, the sequence \(\{c _j \}\) must converge rapidly to 0, thus allowing the sequence \(\{ \mu (z^j) \}\) to have some nontrivial growth. Explicitly,
Finally, we set , i.e. the collection of power series which converge on some disc of radius strictly greater than 1. This space is endowed with the inductive limit topology. We define \({\mathcal D}^\dag = {{\mathrm{Hom}}}_{{{\mathrm{cont}}}}({\mathcal A}^\dag ,{\mathbb Q}_p)\). Equivalently, ; or more simply, \({\mathcal D}^\dag \) is the intersection over all \(r>1\) of \({\mathbf {D}}[r]\). Thus,
2.2 Weight space
For the remainder of the paper, let p denote an odd prime. Let \({\mathcal W}= {{\mathrm{Hom}}}({\mathbb Z}_p^\times ,{\mathbb C}_p^\times )\) denote the collection of continuous characters from \({\mathbb Z}_p^\times \) to \({\mathbb C}_p^\times \). We will refer to this as weight space. There is an injective map from \({\mathbb Z}\rightarrow {\mathcal W}\) sending k to the “raising to the kth power” character.
Since \({\mathbb Z}_p^\times \cong ({\mathbb Z}/p{\mathbb Z})^\times \times (1+p{\mathbb Z}_p)\), a character in \({\mathcal W}\) is uniquely determined by its restriction to \(({\mathbb Z}/p{\mathbb Z})^\times \) and by its value on a topological generator \(\gamma \) of \(1+p{\mathbb Z}_p\). Moreover, if \(\kappa \in {\mathcal W}\), then \(\kappa (\gamma )  1 < 1\).
Let D(0, 1) be the open unit disc of \({\mathbb C}_p\) about 0. The map
is a bijection. In particular, \({\mathcal W}\) can be identified with \(p1\) copies of the open unit disc. Let \(\omega : ({\mathbb Z}/p{\mathbb Z})^\times \rightarrow {\mathbb Z}_p^\times \) denote the Teichmüller character and, for \(0\le m\le p2\), let \({\mathcal W}_m\) denote the subspace of \({\mathcal W}\) consisting of characters whose restriction to \(({\mathbb Z}/p{\mathbb Z})^\times \) equals \(\omega ^m\).
2.3 The weight \(\kappa \) action
Let \(\Sigma _0(p)\subseteq \text {M}_2({\mathbb Z}_p)\) denote the semigroup of matrices \(\bigl ( {\begin{matrix} a &{} b \\ c &{} d \end{matrix}} \bigr )\) of nonzero determinant with \(a \in {\mathbb Z}_p^\times \) and \(c \in p{\mathbb Z}_p\). For each \(\kappa \in {\mathcal W}\), we wish to define a “weight \(\kappa \) action” of \(\Sigma _0(p)\) on the above spaces of power series and distributions. This will allow us to eventually define Hecke actions on spaces of overconvergent modular symbols.
First, for k an integer, we can define the weight k action of \(\Sigma _0(p)\) on the spaces defined in Sect. 2.1 as follows. For f in \({\mathbf A}[r]\) with \(r<p\) and \(\gamma \in \Sigma _0(p)\), we define
which endows \({\mathbf A}[r]\) with a left \(\Sigma _0(p)\)action. Dually, for \(\mu \in {\mathbf {D}}[r]\), we define
which endows \({\mathbf {D}}[r]\) with a right \(\Sigma _0(p)\)action. Furthermore, this endows \({\mathcal D}^\dag \) (resp. \({\mathcal A}^\dag \)) with a right (resp. left) \(\Sigma _0(p)\)action.
Now we consider the case of padic weights. Let \(W_m\) denote the subspace of characters \(\kappa \) in \({\mathcal W}_m\) that satisfy \(\kappa (\gamma )  1 \le 1/p\) for some (and hence every) topological generator \(\gamma \) of \(1+p{\mathbb Z}_p\). Note that the classical weights—the “raising to the kth power” characters, for \(k\in {\mathbb Z}\)—are all in \(W_m\), for some m. We can (and will) identify \(W_m\) with the closed disc of radius 1 / p around 0. For \(\kappa \in W_m\), we will define weight \(\kappa \) actions on our spaces of distributions. The key to doing this is to make sense of \(\kappa (a+cz)\) as a power series in z (see Definition 2.4 and Lemma 2.5 below).
We begin with some lemmas.
Lemma 2.1
If \({{\mathrm{ord}}}_p(x) > \frac{1}{p1}\), then \(\log (1+x)  = x\).
Proof
The condition that \({{\mathrm{ord}}}_p(x) > \frac{1}{p1}\) forces the first term to dominate in the power series expansion of \(\log (1+x)\). \(\square \)
Lemma 2.2
For \(n \ge 1\),
Proof
We have
\(\square \)
Now for \(\kappa \in W_0\), define
which the following lemma shows is a power series expansion for the character \(\kappa \).
Lemma 2.3
Fix \(\kappa \in W_0\).

(1)
\(F_\kappa (x)\) converges for x such that \({{\mathrm{ord}}}_p(x) > \frac{1}{p1}\), i.e. \(F_\kappa (z) \in {\mathbf A}[r]\) for any \(r < p^{1/(p1)}\).

(2)
For x with \({{\mathrm{ord}}}_p(x) > \frac{1}{p1}\), we have \(F_\kappa (x) \le 1\).

(3)
For \(x \in 1+p{\mathbb Z}_p\),
$$\begin{aligned} F_\kappa (x1) = \kappa (x). \end{aligned}$$
Proof
For the first part, since \(\kappa \in W_0\), we have \(\kappa (\gamma )1 \le 1/p\). Furthermore, if \({{\mathrm{ord}}}_p(x) > \frac{1}{p1}\), then by Lemma 2.1, we have \({{\mathrm{ord}}}_p( \log (1+x) ) = {{\mathrm{ord}}}_p(x)\). If
then \({{\mathrm{ord}}}_p(L) = {{\mathrm{ord}}}_p(x)  1\).
Let’s further assume that \({{\mathrm{ord}}}_p(x) < 1\), so that \({{\mathrm{ord}}}_p(L) < 0\). Then,
and thus
Since this term goes to infinity as \(n \rightarrow \infty \), we have that \(F_\kappa (x)\) converges. Furthermore, since this is true for any x with \(1> {{\mathrm{ord}}}_p(x) > \frac{1}{p1}\), we must have that \(F_\kappa (x)\) converges for all x with \({{\mathrm{ord}}}_p(x) > \frac{1}{p1}\).
For the second part, note that every term in the power series which defines \(F_\kappa (x)\) has valuation at least 0 for x with \(1> {{\mathrm{ord}}}_p(x) > \frac{1}{p1}\). Thus, \(F_\kappa (x) \le 1\) for all x with \({{\mathrm{ord}}}_p(x)> \frac{1}{p1}\) (by the Maximum Modulus Principle applied to any closed disc of radius between 1 / p and \(p^{1/(p1)}\), see [7, Proposition 3 of Section 5.1.4]).
For the third part, write \(x = \gamma ^a\). Then, we have
\(\square \)
Definition 2.4
Fix \(\kappa \in W_m\) and write \(\kappa = \omega ^m \cdot \kappa _0\) with \(\kappa _0 \in W_0\). Let \(a \in {\mathbb Z}_p^\times \) and \(c \in p {\mathbb Z}_p\). Define
Lemma 2.5
For \(\kappa \in W_m\), \(a \in {\mathbb Z}_p^\times \), and \(c \in p{\mathbb Z}_p\), we have

(1)
\(F_{\kappa ,a,c}(x)\) converges for x such that \({{\mathrm{ord}}}_p(x) > \frac{1}{p1}  1\), i.e. \(F_{\kappa ,a,c}(z)\) is in \({\mathbf A}[p^h]\) for any \(h< c_p := 1  \frac{1}{p1}=\frac{p2}{p1}\),

(2)
\(F_{\kappa ,a,c}(x) \le 1\) for x with \({{\mathrm{ord}}}_p(x) > \frac{1}{p1}  1\),

(3)
for \(x \in {\mathbb Z}_p\),
$$\begin{aligned} F_{\kappa ,a,c}(x) = \kappa (a+cx). \end{aligned}$$
Proof
The first and second parts follow from the previous lemma since \({{\mathrm{ord}}}_p \left( \frac{a+cx}{\omega (a)}1 \right) > \frac{1}{p1}\). The third part also follows from the previous lemma. Indeed, as
we have
Thus,
\(\square \)
We can now define the weight \(\kappa \) action for \(\kappa \in W_m\) just as before. Indeed, for f in \({\mathbf A}[r]\) (with \(1\le r < r_p := p^{c_p}\)) and \(\gamma \in \Sigma _0(p)\), we define
which by Lemma 2.5 is again in \({\mathbf A}[r]\). Thus, we have endowed \({\mathbf A}[r]\) with a left \(\Sigma _0(p)\)action. Further, for \(\mu \in {\mathbf {D}}[r]\), we define
which endows \({\mathbf {D}}[r]\) with a right \(\Sigma _0(p)\)action. Again, this automatically endows \({\mathcal D}^\dag \) (resp. \({\mathcal A}^\dag \)) with a right (resp. left) \(\Sigma _0(p)\)action.
2.4 Power series in families over weight space
Let \(R:= {\mathbf A}(W_m)\) denote the space of convergent power series on the closed disc \(W_m\), say in a variable W. We then have
If we set \(w : = \frac{W}{p}\), then \(R\) is simply the Tate algebra \({\mathbb Q}_p\langle \langle w \rangle \rangle \) in the variable w. Set \(R^0\) equal to the unit ball of \(R\) under the sup norm which is simply the integral Tate algebra \({\mathbb Z}_p\langle \langle w \rangle \rangle \).
Consider the space \({\mathbf A}\hat{\otimes }R\). We can think of elements of this space as families of elements of \({\mathbf A}\) over \(W_m\). Indeed for each \(\kappa \in W_m\) with values in \({\mathbb Q}_p\), we have a map
given by evaluating elements of \(R\) at \(\kappa \). Thus, for a fixed element F of \({\mathbf A}\hat{\otimes }R\), we get a family of elements \(\kappa (F) \in {\mathbf A}\) for each \(\kappa \in W_m\) having values in \({\mathbb Q}_p\).
More explicitly, we have
and thus elements of \({\mathbf A}\hat{\otimes }R\) are formal power series in z and w which converge for all \(z \le 1\), \(w \le 1\). Evaluating at \(\kappa \) simply means evaluating w at \((\kappa (\gamma )1)/p\). Thus, as we padically vary \(\kappa \) over \(W_m\), we get a padic family of elements of \({\mathbf A}\).
We now seek to give \({\mathbf A}\hat{\otimes }R\) the structure of a \(\Sigma _0(p)\)module in such a way that the above map (“evaluation at \(\kappa \)”) is equivariant with respect to this action on the source and the weight \(\kappa \) action on the target. We do this by constructing a twovariable power series that interpolates \(F_{\kappa ,a,c}(z)\) as \(\kappa \) varies.
For \(a \in {\mathbb Z}_p^\times \) and \(c \in p{\mathbb Z}_p\), define
where \(\log _\gamma (z) := \log z /\log \gamma \).
Lemma 2.6
For \(a \in {\mathbb Z}_p^\times \) and \(c \in p{\mathbb Z}_p\), we have

(1)
\(K_{a,c,m}(z,w)\) converges for z and w such that \(z < p^{c_p}\) and \(w \le 1\). That is, \(K_{a,c,m}(z,w) \in {\mathbf A}[p^h] \hat{\otimes }R\) for \(h < c_p\),

(2)
\(K_{a,c,m}(z,w) \le 1\) for all such z, w,

(3)
for \(\kappa \in W_m\), we have
$$\begin{aligned} \kappa (K_{a,c,m}(z,w)) = F_{\kappa ,a,c}(z). \end{aligned}$$
Proof
The third part follows immediately from the definitions as
But then the first and second parts follow from this equality and from Lemma 2.5. \(\square \)
With this lemma in hand, we can thus define a \(\Sigma _0(p)\)action on \({\mathbf A}[r] \hat{\otimes }R\) for \(r < p^{c_p}\). For \(f \in {\mathbf A}[r]\), set
and extend this action \(R\)linearly to all of \({\mathbf A}[r] \hat{\otimes }R\).
Lemma 2.7
For \(\kappa \in W_m\) and \(r<p^{c_p}\), we have
is \(\Sigma _0(p)\)equivariant where the source is endowed with the action in (2.3) and the target is endowed with the weight \(\kappa \) action.
Proof
This lemma follows immediately from the definition of both actions. \(\square \)
We mention here two basic properties of the automorphy factor \(K_{a,c,m}(z,w)\), both of which follow directly from the definition and which will be useful later.
Lemma 2.8
We have

(1)
\(K_{1,0,m}(z,w) = 1\),

(2)
\(K_{a,c,m}(z,w)\big _{w = 0} = \omega (a)^m\).
2.5 Distributions in families over weight space
In this section, we discuss families of distributions and their \(\Sigma _0(p)\)actions. To consider families of distributions, a natural place to begin is the space
that is, the space of \(R\)valued distributions. Evaluating such distributions at varying \(\kappa \in W_m\) then gives rise to a family of singlevariable distributions. Moreover, these distributions are again quite concrete. They are uniquely determined by their sequence of moments, and, in this case, each moment is a power series in w.
However, the space \({\mathbf {D}}[r](R)\) turns out to be much larger than what we need to work with, and instead, we consider the space \({\mathbf {D}}[r] \hat{\otimes }R\). Note that there is a natural injection:
but this map need not be surjective. For example, the distribution \(\widetilde{\mu }\in {\mathbf {D}}(R)\) defined by
is not in \({\mathbf {D}}\hat{\otimes }R\). To see this, note that every distribution in \({\mathbf {D}}\hat{\otimes }R\) is a limit of finite sums of elements of the form \(\mu \otimes f\) with \(\mu \in {\mathbf {D}}\) and \(f \in R\). As such, for each n, there are only finitely many coefficients of f which are not in \(p^n {\mathbb Z}_p\). In particular, for any fixed element of \({\mathbf {D}}\hat{\otimes }R\), in all of its moments only finitely many coefficients are not in \(p^n {\mathbb Z}_p\). Note that the distribution \(\widetilde{\mu }\) above clearly does not have this property.
We again have a specialization map
given by evaluation at \(\kappa \in W_m\). We now seek to give an action of \(\Sigma _0(p)\) on \({\mathbf {D}}[r] \hat{\otimes }R\) which makes the above map equivariant when \({\mathbf {D}}[r]\) is given the weight \(\kappa \) action.
To do this, first note that \({\mathbf {D}}[r]\) is an \({\mathbf A}[r]\)module via
where \(f,g \in {\mathbf A}[r]\) and \(\mu \in {\mathbf {D}}[r]\). Thus, \({\mathbf {D}}[r] \hat{\otimes }R\) is naturally an \({\mathbf A}[r] \hat{\otimes }R\)module. Note also that \({\mathbf {D}}[r](R)\) is naturally an \({\mathbf A}[r] \hat{\otimes }R\)module as
Furthermore, we can easily define a weight 0 action of \(\Sigma _0(p)\) on \({\mathbf {D}}[r](R)\) via
for \(\widetilde{\mu }\in {\mathbf {D}}[r](R)\).
Lemma 2.9
Both \({\mathbf {D}}[r] \hat{\otimes }R\) and \({\mathbf {D}}[r](R)\) are \(\Sigma _0(p)\)modules via the formula
for
Proof
This formula clearly gives an action on \({\mathbf {D}}[r](R)\). To complete the proof, we must check that \({\mathbf {D}}[r] \hat{\otimes }R\subseteq {\mathbf {D}}[r](R)\) is preserved by this action. This detail is verified in [6, page 30, Remark 3.1]; we note that in [6], the notation \({\mathbf {D}}[r](R)\) refers to \({\mathbf {D}}[r] \hat{\otimes } R\). \(\square \)
We will also have the need to consider the larger distribution space
(when we solve the “difference equation”). This space again is naturally a \(\Sigma _0(p)\)module and we note that as before
The following lemma will allow us to use the Hecke operator \(U_p\) to pass from \({\mathcal D}^\dag (R)\)valued modular symbols to \({\mathbf {D}}\hat{\otimes }R\)valued ones.
Lemma 2.10
If \(\widetilde{\mu }\in {\mathcal D}^\dag (R)\), then \(\widetilde{\mu }\big \!\left( {\begin{matrix} 1 &{} a \\ 0 &{} p \end{matrix}} \right) \in {\mathbf {D}}\hat{\otimes }R\).
Proof
Since \(\widetilde{\mu }\in {\mathcal D}^\dag (R)\), we have that the sequence \(\{ \widetilde{\mu }(z^j) \}\) is \(O(r^j)\) for every \(r>1\). Furthermore, we have that
since \(K_{1,0,m}(z,w) = 1\) by Lemma 2.8. Because \(p^n \widetilde{\mu }(z^n) \rightarrow 0\), it is clear that the moments of \(\widetilde{\mu }\big \!\left( {\begin{matrix} 1 &{} a \\ 0 &{} p \end{matrix}} \right) \) are bounded and thus this distribution is in \({\mathbf {D}}(R)\). Furthermore, for any M, for n large enough \(p^n \widetilde{\mu }(z^n) \in p^M R^0\). Thus, modulo \(p^M R^0\), the moments of the distribution \(\widetilde{\mu }\big \!\left( {\begin{matrix} 1 &{} a \\ 0 &{} p \end{matrix}} \right) \) only depend on finitely many moments of \(\widetilde{\mu }\). In particular, \(\widetilde{\mu }\big \!\left( {\begin{matrix} 1 &{} a \\ 0 &{} p \end{matrix}} \right) \) can be written as a limit of elements of \({\mathbf {D}}\otimes R\), and hence \(\widetilde{\mu }\big \!\left( {\begin{matrix} 1 &{} a \\ 0 &{} p \end{matrix}} \right) \in {\mathbf {D}}\hat{\otimes }R\). \(\square \)
2.6 Analyzing the automorphy factor
By Lemma 2.6, \(K_{a,c,m}(z,w)\) is in \({\mathbf A}\hat{\otimes }R\). In this section, we will further analyze the coefficients of this automorphy factor in order to gain better control of the \(\Sigma _0(p)\)action on families of distributions. We begin by introducing some rings that will be useful for this purpose.
Consider an abstract Tate algebra, \({\mathbb Q}_p\langle \langle x \rangle \rangle \) and define
where \(c_p = 1  \frac{1}{p1} = \frac{p2}{p1}\), as before. Note that by definition \(S_x \subseteq {\mathbb Z}_p\langle \langle x \rangle \rangle \).
Lemma 2.11
\(S_x\) is a subring of \({\mathbb Z}_p\langle \langle x \rangle \rangle \).
Proof
We only need to check that \(S_x\) is closed under multiplication. To this end, let \(f = \sum a_n x^n\) and \(g = \sum b_n x^n\). Then the nth coefficient of fg equals \(\sum _{i+j=n} a_i b_j\), and we have
Thus \(fg \in S_x\) as desired. \(\square \)
Lemma 2.12
If \(f \in S_x\) with \(f(0) \in {\mathbb Z}_p^\times \), then \(f^{1} \in S_x\).
Proof
Let \(f(x) = \sum _i a_i x^i\) and \(g(x) = \sum _j b_j x^j\) with \(f \cdot g = 1\). We check inductively that \({{\mathrm{ord}}}_p(b_n) \ge n \cdot c_p\). For \(n=0\), this is immediate as \(b_0 = a_0^{1}\). For \(n>0\), we have
By induction, for \(i>0\) we have \({{\mathrm{ord}}}_p (b_{ni}) \ge (ni) \cdot c_p\), and thus
as desired. \(\square \)
Lemma 2.13
If \(r \in {\mathbb Q}\) and \(f(x) = \sum _n a_n x^n \in {\mathbb Q}_p\llbracket x\rrbracket \) are such that

(1)
f(x) converges for all x in the open disc of radius \(p^r\) centered around 0, and

(2)
\(f(x) \le 1\) for all such x,
then \({{\mathrm{ord}}}_p(a_n) \ge nr\).
Proof
Write \(g(x) = f(x/p^r)\) which is then a power series which converges on the open unit disc of \({\mathbb C}_p\). This power series is bounded in size by 1 and thus is in \({\mathcal O}_{{\mathbb C}_p}\llbracket x\rrbracket \) (since the Gauss norm equals the sup norm). Thus, \(a_n/p^{rn} \in {\mathcal O}_{{\mathbb C}_p}\) as desired. \(\square \)
Remark 2.14
From Lemma 2.13, we have that \(S_x\) is simply the collection of \({\mathbb Q}_p\)power series which converge on the disc of radius \(p^{c_p}\) and all of whose values have size less than or equal to 1. This gives another way to see that \(S_x\) is a ring.
Theorem 2.15
For \(K_{a,c,m}(z,w)\) as defined in Eq. (2.2), we have
Proof
Write
To prove, \(K_{a,c,m}(z,w) \in S_z \hat{\otimes }R^0\), we must show that \(T_j(z) \in S_z\) for each \(j \ge 0\). Thus, we must show that the coefficient of \(z^i\) in \(T_j(z)\) has padic valuation at least \(ic_p\) for all \(i,j \ge 0\). But this is equivalent to showing that \(p^{i c_p}\) divides \(R_i(w)\) in \(R^0\) for all \(i \ge 0\).
Next fix some \(w_0\) with \(w_0 \le 1\). Then \(K_{a,c,m}(z,w_0)\) is a power series which converges on the open disc of radius \(p^{c_p}\) and all of its values have size bounded by 1 on this disc (by Lemma 2.6). Thus, by Lemma 2.13, \(R_i(w_0)\) has valuation at least \(i c_p\). But since this is true for every \(w_0\) in the closed unit disc, we have that every coefficient of \(R_i(w)\) has valuation at least \(ic_p\) (since the Gauss norm is the same as the sup norm). \(\square \)
Theorem 2.16
We have
Proof
Mimicking the proof of Theorem 2.15, it suffices to show that \(K_{a,c,m}(z,w)\) converges for \(z \le 1\) and \(w < p^{c_p}\) to something of size less than or equal to 1. To this end, recall that
where \(L = \log _\gamma (\frac{a+cz}{\omega (a)})\). Since p  c, we have \(L \in {\mathbb Z}_p\llbracket z\rrbracket \). Thus, for \(z<1\), we have
If \({{\mathrm{ord}}}_p(w) >  c_p\) this expression is always positive and goes to infinity as \(n \rightarrow \infty \) as desired. \(\square \)
The following lemma will be useful later.
Lemma 2.17
Let
Then \(\frac{g(w)}{w}\) is in \(c \cdot S_w^\times \).
Proof
We first show that g(w) is in \(c \cdot S_w\) (and then automatically g(w) / w is in \(c \cdot S_w\)). The coefficient of \(w^n\) in \(K_{a,c,m}(z,w)\) is
where \(L = \log _\gamma (\frac{a+cz}{\omega (a)} )\). We must show that the coefficient of z in this expression has valuation at least \({{\mathrm{ord}}}_p(c) + n c_p\). But this is easy as \({{\mathrm{ord}}}_p( p^n / n!) \ge c_p\) and the coefficient of z in L is always divisible by c.
To finish the proof, it suffices by Lemma 2.12 to check that the coefficient of w in g(w) is in \(c {\mathbb Z}_p^\times \). This coefficient is the same as the coefficient of wz in \(K_{a,c,m}(z,w)\) which is
which is indeed in \(c {\mathbb Z}_p^\times \). \(\square \)
3 Families of overconvergent modular symbols
3.1 Modular symbols
We review here the theory of modular symbols as formulated in [3, 18, 28]. To this end, let \(\Delta _0 := {{\mathrm{Div}}}^0({\mathbb P}^1({\mathbb Q}))\) denote the set of degree zero divisors on \({\mathbb P}^1({\mathbb Q})\) which we endow with a left action of \({{\mathrm{GL}}}_2({\mathbb Q})\) via linear fractional transformations. Let \(\Gamma \) denote a congruence subgroup of \({{\mathrm{SL}}}_2({\mathbb Z})\) and let V denote a right \(\Gamma \)module. We define \({{\mathrm{Symb}}}_{\Gamma }(V)\), the space of Vvalued modular symbols of level \(\Gamma \), to be the collection of additive homomorphisms \(\varphi : \Delta _0 \rightarrow V\) such that \(\varphi (\gamma D) = \varphi (D) \big  \gamma ^{1}\) for all \(\gamma \in \Gamma \) and \(D \in \Delta _0\).
The modules V we will consider in this paper include \({{\mathrm{Sym}}}^k({\mathbb Q}_p^2)\), \({\mathbf {D}}_k\) and \({\mathbf {D}}\hat{\otimes }R\). The first space has an action of \(\Gamma \) while the second two have an action of \(\Gamma _0 := \Gamma \cap \Gamma _0(p)\). Moreover, in each of these cases, one can extend the action of \(\Gamma _0(p)\) to the algebra
and thus define a Heckeaction on the corresponding spaces of Vvalued modular symbols.
The space \({{\mathrm{Symb}}}_{\Gamma }({{\mathrm{Sym}}}^k({\mathbb Q}_p^2))\) is the space of classical modular symbols; the systems of Heckeeigenvalues occurring in this space match those occurring in \(M_{k+2}(\Gamma )\) (see [5, Proposition 2.5]). The space \({{\mathrm{Symb}}}_{\Gamma _0}({\mathbf {D}}_k)\) is the space of overconvergent modular symbols; the systems of finite slope Heckeeigenvalues occurring in this space essentially match those occurring in \(M^\dag _{k+2}(\Gamma )\), the space of overconvergent modular forms (see [29, Theorem 7.1]).
Lastly, the space \({{\mathrm{Symb}}}_{\Gamma _0}({\mathbf {D}}\hat{\otimes }R)\) is the space of families of overconvergent modular symbols. Indeed, for each \(\kappa \in W_m\), the map \(\kappa :{\mathbf {D}}\hat{\otimes }R\rightarrow {\mathbf {D}}_\kappa \) induces a Heckeequivariant map
Thus, for \(\widetilde{\Phi }\in {{\mathrm{Symb}}}_{\Gamma _0}({\mathbf {D}}\hat{\otimes }R)\) and \(\kappa \) in \(W_m\), we have that \({\text {sp}}_\kappa (\widetilde{\Phi })\) is a weight \(\kappa \) overconvergent modular symbol, and, moreover, as \(\kappa \) varies, \({\text {sp}}_\kappa (\widetilde{\Phi })\) varies in a padic family.
3.2 Constructing families of overconvergent modular symbols
In this section, we describe a method of producing “random” families of overconvergent modular symbols. Here we follow the methods described in [28, Section 2] to explicitly write down modular symbols.
Proposition 3.1
Assume \(\Gamma _0\) is torsionfree. Then there exist divisors \(D_1, \ldots , D_t\) in \(\Delta _0\) and matrices \(\gamma _1, \ldots , \gamma _t\) in \({{\mathrm{SL}}}_2({\mathbb Z})\) such that for any right \(\Gamma \)module V and any \(\phi \in {{\mathrm{Symb}}}_{\Gamma _0}(V)\), we have
where \(\Delta := \left( {\begin{matrix} 1 &{} 1 \\ 0 &{} 1 \end{matrix}} \right)  1\). Conversely, for any \(v_1, \ldots , v_t\) in V satisfying
there is a unique modular symbol \(\phi \in {{\mathrm{Symb}}}_{\Gamma _0}(V)\) such that
for each j.
Proof
See [28, Corollary 2.7]. \(\square \)
Remark 3.2
In [28] explicit algorithms are given to determine the \(D_i\) and the \(\gamma _i\). This is the socalled process of “solving the Manin relations”. In the end, the \(\gamma _i\) together with the identity matrix form a subset of a full set of right coset representatives for \(\Gamma _0\) in \({{\mathrm{SL}}}_2({\mathbb Z})\), and the \(D_i\) are \({\mathbb Z}[\Gamma ]\)generators of \(\Delta _0\).
Proposition 3.1 gives us a strategy for explicitly writing down families of overconvergent modular symbols. Just randomly pick elements \(v_1, \ldots , v_t\) in \({\mathbf {D}}\hat{\otimes }R\), and then try to solve Eq. (3.1).
We will refer to equations of the form \(w \big  \Delta = v\) as difference equations. These equations were studied in detail in [28, Section 4.2] for the the module \({\mathcal D}^\dag \). The following lemma generalizes the situation to \({\mathcal D}^\dag (R)\).
Lemma 3.3
Let \(\Delta := \left( {\begin{matrix} 1 &{} 1 \\ 0 &{} 1 \end{matrix}} \right)  1\) denote the difference operator. We have

(1)
the map \(\Delta : {\mathcal D}^\dag (R)\rightarrow {\mathcal D}^\dag (R)\) is injective;

(2)
if \(\widetilde{\mu }\in {{\mathrm{im}}}(\Delta )\), then \(\widetilde{\mu }({\mathbf 1}) = 0\);

(3)
for \(\widetilde{\mu }\in {\mathcal D}^\dag (R)\) with \(\widetilde{\mu }({\mathbf 1}) = 0\), there exists a unique \(\widetilde{\nu }\in {\mathcal D}^\dag (R)\) such that \(\widetilde{\nu }\big  \Delta = \widetilde{\mu }\).
Proof
The first part follows verbatim as in [28, Lemma 4.5]. The second part is clear as
since \(K_{1,0,m}(z,w) = 1\). For the last part, just proceed as in [28, Theorem 4.5]. Note that the newly constructed measure \(\mu \) still takes values in \(R\). \(\square \)
Remark 3.4

(1)
The explicit formulas for the solution of the difference equation given in [28, Lemma 4.3] apply equally well in the case of families.

(2)
We note in the above lemma that if \(\widetilde{\nu }\) were in the smaller space \({\mathbf {D}}\hat{\otimes }R\), there is no reason for \(\widetilde{\mu }\) to again be in the \({\mathbf {D}}\hat{\otimes }R\) as denominators naturally appear in the solution of the difference equation. These denominators are the primary reason for considering the space \({\mathcal D}^\dag (R)\) in this paper.
Thus, to solve Eq. (3.1) the only condition which we need to verify is that the right hand side has total measure zero. However, for randomly chosen \(v_i \in {\mathcal D}^\dag (R)\), there is no reason for \(\sum _{j=1}^t v_j\big  (\gamma _j1)\) to have total measure zero. Indeed, we compute
and see that the result is just some power series in \(R\). We do note that by Lemma 2.8, we have \(K_{a,c,0}(z,0) = 1\), and thus this power series specializes to 0 in weight 0 if \(m \equiv 0 \text { }({\text {mod}}\text { }p1)\), that is, this power series is divisible by w if \(m \equiv 0 \text { }({\text {mod}}\text { }p1)\).
In our quest to write down a family of overconvergent modular symbols, we have chosen the \(v_j\) arbitrarily, and thus we still have a great deal of flexibility. The following lemma explains precisely how to choose one of the \(v_j\) more carefully to force the total measure of the right hand side of Eq. (3.1) to vanish. In the following lemma, \(\mu _j\) denotes the distribution whose jth moment is 1 and all of whose other moments vanish.
Lemma 3.5
Let \(v'_1, \ldots , v'_t\) be any elements of and set
If \(m \equiv 0 \text { }({\text {mod}}\text { }p1)\), fix any i between 1 and t, and set
If \(m \not \equiv 0 \text { }({\text {mod}}\text { }p1)\), fix some i between 1 and t such that \(a_i^m \not \equiv 1 \text { }({\text {mod}}\text { }p)\), and set
Then, in either case, for all j, and \(\sum _{j=1}^t v_j\big  (\gamma _j1)\) has total measure zero.
Proof
We begin with the case \(m \equiv 0 \text { }({\text {mod}}\text { }p1)\). We first justify that \(v_i\) is in \({\mathcal D}^\dag (R)\). That is, if \(h = \frac{\partial }{\partial z} K_{a_i,c_i,m}(z,w) \big _{z=0}\), we need to check that \(g/h \in R\). Note that h is simply the coefficient of z in \(K_{a_i,c_i,m}(z,w)\) and is thus in \(S_w\) by Theorem 2.16. However, h is not invertible in \(S_w\). Indeed, by Lemma 2.8 part (2), we have \(h(0) = 0\). Fortunately, g also vanishes at \(w = 0\) by the discussion immediately preceding this lemma (this is where we are using the fact that \(m \equiv 0 \text { }({\text {mod}}\text { }p1)\)). Further, by Lemma 2.17, h / w is in \(c_i \cdot S^\times \). By Remark 3.2, we have \(c_i \ne 0\), and thus h / w is invertible in \(R\). Therefore, \(g/h = (g/w) / (h/w)\) is in \(R\).
For the second part, we compute
as \(\mu _1(K_{a_i,c_i,m}(z,w)) = h\).
Now onto the case of \(m \not \equiv 0 \text { }({\text {mod}}\text { }p1)\). We again justify that \(v_i\) is in \({\mathcal D}^\dag (R)\). That is, if \(h = K_{a_i,c_i,m}(0,w)\), we need to check that \(g/(h1) \in R\). By Theorem 2.16, \(h1\) is in \(S_w\). Further, by Lemma 2.8 part (2), the constant term of \(h1\) is \(\omega (a_i)^m1\) which is a unit by assumption. Thus, by Lemma 2.12, \(h1\) is invertible and thus \(g/(h1) \in R\) as desired.
For the second part, we again compute
as \(\mu _0(K_{a_i,c_i,m}(z,w)) = h\). \(\square \)
Remark 3.6
In the case \(m \not \equiv 0 \text { }({\text {mod}}\text { }p1)\), if it happens that \(a_i^m \equiv 1 \text { }({\text {mod}}\text { }p)\) for every i, then a simple computation shows that \(\sum _{j=1}^t v_j \big  (\gamma _j  1)\) has 0th moment which vanishes in weight 0. We could thus proceed as in the case of \(m \equiv 0 \text { }({\text {mod}}\text { }p1)\). We leave the details to the reader, but we note that in our computations we have never encountered this case. Possibly this case never occurs or only occurs in very small level.
Corollary 3.7
Keeping the notation of Lemma 3.5, there exists \(\widetilde{\Phi }\in {{\mathrm{Symb}}}_{\Gamma _0}({\mathcal D}^\dag (R))\) such that
for each j.
Proof
By Lemmas 3.5 and 3.3, there exists \(v_\infty \in {\mathcal D}^\dag (R)\) such that
Thus, by Proposition 3.1, there exists \(\widetilde{\Phi }\in {{\mathrm{Symb}}}_{\Gamma _0}({\mathcal D}^\dag (R))\) such that \(\widetilde{\Phi }(\{0\}\{\infty \}) = v_\infty \), and
for each j as desired. \(\square \)
Remark 3.8
The assumption that \(\Gamma _0\) be torsionfree is not at all essential. In [28, Section 2.5], there is a discussion on how to deal with torsion elements in constructing modular symbols. Further, the arguments of Lemma 3.5 carry through to this case with just minor changes.
3.3 Ordinary families of overconvergent modular symbols
As Hida families are the primary object of interest in this paper, we now describe how to pass to the ordinary subspace of our spaces of modular symbols. To this end, recall that if X is a compact \({\mathbb Z}_p\)module equipped with a compact operator \(U_p\), we define the ordinary subspace \(X^{{{\mathrm{ord}}}} := \bigcap _n U_p^n X\). Then \(X^{{{\mathrm{ord}}}}\) is the largest subspace of X on which \(U_p\) acts invertibly. If moreover X is profinite, then there is a canonical decomposition \(X = X^{{{\mathrm{ord}}}} \oplus X^{{{\mathrm{nil}}}}\) where \(X^{{{\mathrm{nil}}}}\) is the subspace of X on which \(U_p\) acts topologically nilpotently (see [18, Proposition 2.3]). Moreover, projection onto \(X^{{{\mathrm{ord}}}}\) is given by the operator \(e:=\lim \nolimits _{n\rightarrow \infty } U_p^{n!}\).
Unfortunately, \({{\mathrm{Symb}}}_{\Gamma _0}({\mathbf {D}}^0\hat{\otimes }R^0)\) is not a profinite space since the Tate algebra \(R^0\) is not profinite, and thus it is not a priori clear that \({{\mathrm{Symb}}}_{\Gamma _0}({\mathbf {D}}^0\hat{\otimes }R^0)\) admits its ordinary subspace as a direct summand. However, \(R^0= {\mathbb Z}_p\langle \langle w \rangle \rangle \) is contained in \(\Lambda :={\mathbb Z}_p[[w]]\) which is a profinite ring. Moreover, viewing \(\Lambda \) as the ring of bounded functions on the open disc of radius 1 / p contained in \(R^0\), we see that this ring is preserved by the action of \(\Sigma _0(p)\), and thus we get a Heckeequivariant inclusion
Further, we obtain a direct sum decomposition into ordinary and nonordinary parts:
In what follows, the space \({{\mathrm{Symb}}}_{\Gamma _0}({\mathbf {D}}^0 \hat{\otimes }\Lambda )^{{{\mathrm{ord}}}}\) will be our primary object of interest. To ease notation we will denote this space simply by \(X^{{{\mathrm{ord}}}}\).
We note that in the ordinary case no Hecke information should be lost by working on this open disc of radius 1 / p. Indeed, as Hida families extend to all of weight space, one expects that the Heckeeigenvalues of ordinary families of modular symbols should do the same. This fact is stated in the following theorem and proven in Appendix 6.
Theorem 3.9
We have

(1)
\(X^{{{\mathrm{ord}}}}\) is a free \(\Lambda \)module and
$$\begin{aligned} {{\mathrm{rank}}}_\Lambda (X^{{{\mathrm{ord}}}}) = {{\mathrm{rank}}}_{{\mathbb Z}_p}({{\mathrm{Symb}}}_{\Gamma _0}({{\mathrm{Sym}}}^k({\mathbb Z}_p^2))^{{{\mathrm{ord}}}}) \end{aligned}$$for any \(k \equiv m \text { }({\text {mod}}\text { }p1)\),

(2)
for T any Hecke operator,
$$\begin{aligned} {{\mathrm{char}}}(T  X^{{{\mathrm{ord}}}}) \end{aligned}$$has coefficients in \({\mathbb Z}_p[[W]] = {\mathbb Z}_p[[pw]]\); that is, the coefficients of this characteristic polynomial extend to the open unit disc.
3.4 Vector of total measures
In this section, we make the following simple but extremely helpful observation: an element of \(X^{{{\mathrm{ord}}}}\) is completely determined by the total measures of all of its values. Moreover, since a modular symbol is determined by its values on finitely many divisors, one only needs finitely many of these total measures to determine the symbol. We can thus express any element of \(X^{{{\mathrm{ord}}}}\) as a vector with coordinates in \(\Lambda \), and thus reduce many computations with ordinary families of overconvergent modular symbols to computations in a free module over \(\Lambda \).
More precisely, choose \(D_1, \ldots , D_t \in \Delta _0\) which generate \(\Delta _0\) as a \({\mathbb Z}_p[\Gamma _0]\)module (see Remark 3.2). We then define the vector of total measures map
defined by sending \(\Phi \) to the vector \(\left( \Phi (D_i)({\mathbf 1}) \right) _{i=1}^t\).
We note that this construction works equally well for a fixed weight, thus expressing an overconvergent modular symbol as an element of \({\mathbb Z}_p^t\); that is, setting \(X_k^{{{\mathrm{ord}}}}:= {{\mathrm{Symb}}}_{\Gamma _0}({\mathbf {D}}^0_k)^{{{\mathrm{ord}}}}\), we then have a map
defined exactly as above.
Proposition 3.10
We have

(1)
the map \(\alpha \) is injective;

(2)
the map \(\alpha _k\) is injective;

(3)
the induced map
$$\begin{aligned} \overline{\alpha } : X^{{{\mathrm{ord}}}}\otimes \Lambda / {\mathfrak m}{\longrightarrow } (\Lambda /{\mathfrak m})^t \cong {\mathbb F}_p^t \end{aligned}$$is injective. Here \({\mathfrak m}\) is the maximal ideal of \(\Lambda ={\mathbb Z}_p[[w]]\).
Proof
We first note that part (3) implies part (1). Indeed, if K is the kernel of \(\alpha \), then part (3) implies that \(K \otimes \Lambda / {\mathfrak m}= 0\) and thus \(K=0\). Similarly, part (3) implies part (2). Indeed, by Lemma 6.4,
and reducing the map \(\alpha \) modulo \({\mathfrak p}_k\) yields the map \(\alpha _k\). Moreover, \({\mathfrak m}= {\mathfrak p}_k + p\Lambda \). Thus, if K is now the kernel of \(\alpha _k\), by part (3), \(K \otimes \Lambda /{\mathfrak m}= 0\), and thus \(K=0\).
So it suffices to prove part (3). In fact, from the observations above, it suffices to see that
is injective. That is, it suffices to see that if \(\Phi _k \in X_k^{{{\mathrm{ord}}}}\) with \(\Phi _k(D_i)({\mathbf 1})\) divisible by p for each i, then \(\Phi _k\) is divisible by p. Seeking a contradiction, assume that \(\Phi _k = 1\). Since the \(D_i\) generate \(\Delta _0\), we see that \(\Phi _k(D)({\mathbf 1})\) is divisible by p for every \(D \in \Delta _0\). But then
which is divisible by p since \(\Phi _k (\left( {\begin{matrix} 1 &{} a \\ 0 &{} p \end{matrix}} \right) D)({\mathbf 1})\) is divisible by p by assumption. Thus, \( \Phi _k  U_p  < 1\). But since \(\Phi _k\) is in the ordinary subspace, we have \(\Phi _k = \lim _n \Phi _k  U_p^{n!}\) which implies \( \Phi _k  U_p  =  \Phi _k  =1 \). This contradiction establishes part (3) and completes the proof. \(\square \)
Here’s one example of the usefulness of these vector of total measure maps.
Corollary 3.11
We have that \(\left\{ \Phi _1^{{{\mathrm{ord}}}}, \dots , \Phi _j^{{{\mathrm{ord}}}} \right\} \) can be completed to a \(\Lambda \)basis of \(X^{{{\mathrm{ord}}}}\) if and only if \(\left\{ \overline{\alpha }(\Phi _1^{{{\mathrm{ord}}}}), \dots , \overline{\alpha }(\Phi _j^{{{\mathrm{ord}}}}) \right\} \) is a linearly independent set in \({\mathbb F}_p^t\).
Proof
By Proposition 3.10, \(\overline{\alpha }(\Phi _1^{{{\mathrm{ord}}}}), \dots , \overline{\alpha }(\Phi _j^{{{\mathrm{ord}}}})\) are linearly independent in \({\mathbb F}_p^t\) if and only if the images of \(\Phi _1^{{{\mathrm{ord}}}}, \dots , \Phi _j^{{{\mathrm{ord}}}}\) in \(X^{{{\mathrm{ord}}}}/{\mathfrak m}X^{{{\mathrm{ord}}}}\) are linearly independent. By a compact version of Nakayama’s lemma, this is true if and only if \(\Phi _1^{{{\mathrm{ord}}}}, \dots , \Phi _j^{{{\mathrm{ord}}}}\) is the start of a \(\Lambda \)basis of X. \(\square \)
3.5 Bases of \(X^{{{\mathrm{ord}}}}\) and characteristic polynomials of Hecke operators
We present here a method of computing the characteristic polynomials of Hecke operators acting on the ordinary subspace of \(X^{{{\mathrm{ord}}}}\).
We begin by describing a naive idea of how one can form a \(\Lambda \)basis of \(X^{{{\mathrm{ord}}}}\). We first note that we can assume that we know the \(\Lambda \)rank of \(X^{{{\mathrm{ord}}}}\) as Theorem 3.9 expresses this rank in terms of the \({\mathbb Z}_p\)rank of some classical space of modular symbols which by standard methods is readily computed.^{Footnote 3} Let us assume then that we have in hand elements \(\Phi _1^{{{\mathrm{ord}}}}, \ldots , \Phi _j^{{{\mathrm{ord}}}}\) in \(X^{{{\mathrm{ord}}}}\) which are the start of a \(\Lambda \)basis of \(X^{{{\mathrm{ord}}}}\). We now describe how to extend this set to a full \(\Lambda \)basis by working one element at a time.
To this end, produce some “random” element \(\Phi \) of \({{\mathrm{Symb}}}_{\Gamma _0}({\mathbf {D}}^0 \hat{\otimes }\Lambda )\)—for instance, using the methods described in Sect. 3.2. Then, as described in Sect. 3.3, by iterating \(U_p\) we can form \(\Phi ^{{{\mathrm{ord}}}}\), the projection of \(\Phi \) onto \(X^{{{\mathrm{ord}}}}\). If \(\Phi _1^{{{\mathrm{ord}}}}, \ldots , \Phi _j^{{{\mathrm{ord}}}}\) together with \(\Phi ^{{{\mathrm{ord}}}}\) still form the beginning of some \(\Lambda \)basis of \(X^{{{\mathrm{ord}}}}\) (which we can test via Corollary 3.11), we have succeeded in extending our partial basis. Otherwise, we produce another “random” symbol \(\Phi \) and continue repeating this process. As long as our method of producing such symbols is sufficiently random, we will eventually find a symbol \(\Phi ^{{{\mathrm{ord}}}}\) which extends our partial basis.
Now, with a \(\Lambda \)basis of \(X^{{{\mathrm{ord}}}}\) in hand, we next want to compute the characteristic polynomials of Hecke operators acting on \(X^{{{\mathrm{ord}}}}\). To do this, we simply need to write down the associated matrix of any Hecke operators with respect to this basis. To this end, for T a Hecke operator, we must be able to write
with \(a_{ij}(w) \in \Lambda \). To find the power series \(a_{ij}(w)\) which solve these equations, one can use the vector of total measures described in Sect. 3.4. Indeed, it suffices to solve
which is now a system of linear equations over \(\Lambda \).
We note that even though the matrix associated to T will be defined over \(\Lambda \), by Theorem 3.9, the characteristic polynomials of these matrices will lie in \({\mathbb Z}_p[[pw]]\), and thus extend to all of weight space.
Lastly we mention that the above method works equally well for the plus and minus subspace \((X^{{{\mathrm{ord}}}})^\pm \) by simply passing to the ±parts of the random symbols produced.
3.6 Restricting to collections of congruent forms
The methods of the previous section describe how to form a basis of \(X^{{{\mathrm{ord}}}}\) and how to compute the Hecke action on this basis. In the special case when the dimension of \((X^{{{\mathrm{ord}}}})^\pm \) is 1, our single basis element is then an eigensymbol, and thus immediately contains the information of families of Hecke eigenvalues. However, it is extremely rare for \((X^{{{\mathrm{ord}}}})^\pm \) to be onedimensional; this only happens for small primes and small tame level. To partially circumvent this problem, we now describe a decomposition of \(X^{{{\mathrm{ord}}}}\) into Hecke stable subspaces, comprising of congruent families, and it is not at all uncommon for pieces of this decomposition to be 1dimensional.
Let \({\mathbb T}\) denote the Hecke algebra over \(\Lambda \) acting on \(X^{{{\mathrm{ord}}}}\). The ring \({\mathbb T}\) is a semilocal ring with \({\mathbb T}\simeq \oplus _{\mathfrak m}{\mathbb T}_{{\mathfrak m}}\) where \({\mathfrak m}\) varies over the maximal ideals of \({\mathbb T}\). This isomorphism induces a Heckeequivariant isomorphism \(X^{{{\mathrm{ord}}}}\simeq \oplus _{{\mathfrak m}} X^{{{\mathrm{ord}}}}_{{\mathfrak m}}\). We now describe how to compute the characteristic polynomials of Hecke operators acting on \(X^{{{\mathrm{ord}}}}_{{\mathfrak m}}\) for each individual maximal ideal \({\mathfrak m}\).
Fix a prime \(\ell \) and let T denote either \(T_\ell \) or \(U_\ell \) depending on whether or not \(\ell \) divides Np. Set \(\overline{f}_{{\mathfrak m},\ell }\) equal to the characteristic polynomial of T acting on \(X^{{{\mathrm{ord}}}}/{\mathfrak m}X^{{{\mathrm{ord}}}}\) which is a polynomial defined over \({\mathbb F}_p\).^{Footnote 4} For a fixed \({\mathfrak m}\), one can find a prime \(\ell \) so that any lift \({f}_{{\mathfrak m},\ell }(T)\) of \(\overline{f}_{{\mathfrak m},\ell }(T)\) to characteristic 0 acts topologically nilpotently on \(X^{{{\mathrm{ord}}}}_{{\mathfrak m}}\) and invertibly on \(X^{{{\mathrm{ord}}}}_{{\mathfrak m}'}\) for all \({\mathfrak m}' \ne {\mathfrak m}\).
Now, to form a basis of \(X^{{{\mathrm{ord}}}}_{\mathfrak m}\), we can simply follow the method of Sect. 3.5 as long as we can produce sufficiently random symbols in \(X^{{{\mathrm{ord}}}}_{\mathfrak m}\). To do this, we form a random symbol \(\Phi \in X^{{{\mathrm{ord}}}}\) and then iterate the Hecke operator \(\prod _{{\mathfrak m}' \ne {\mathfrak m}} f_{{\mathfrak m}',\ell }(T)\) which results in projecting \(\Phi \) to the subspace \(X^{{{\mathrm{ord}}}}_{\mathfrak m}\), as desired.
Again, we mention that this method also works to produce a basis of \((X^{{{\mathrm{ord}}}}_{{\mathfrak m}})^\pm \).
4 Explicit computations with families of OMSs
In Sect. 3, we described methods of computing with ordinary families of overconvergent modular symbols. However, this discussion was all carried out on a theoretical level as a single \(\Phi \in {{\mathrm{Symb}}}_{\Gamma _0}({\mathbf {D}}^0 \hat{\otimes }\Lambda )\) is determined by an infinite amount of information. In order to compute with these families in practice, one must have a systematic method of approximating each \(\Phi \) with a finite amount of data. Moreover, such approximations must be respected by the Hecke operators. In what follows, we describe our method of approximating families of overconvergent modular symbols. Further, we verify that the methods we described in the previous section still carry through with our approximated families.
4.1 Finite approximation modules in families
We begin by reviewing the methods of [28] where a systematic method of approximating elements of \({\mathbf {D}}_k\) was given which was compatible with the \(\Sigma _0(p)\)action. These approximations allowed for explicit computations to be carried out in the space \({{\mathrm{Symb}}}_{\Gamma _0}({\mathbf {D}}_k)\).
In forming an approximation of a distribution \(\mu \) in \({\mathbf {D}}^0_k\), we note that the naive method of considering the first M moments of \(\mu \) each modulo \(p^M\) is not stable under the matrix action on \({\mathbf {D}}_k\). Instead, in [28], a \(\Sigma _0(p)\)stable filtration on \({\mathbf {D}}^0_k\) was introduced:
and thus one can approximate \(\mu \in {\mathbf {D}}^0_k\) by looking at its image in the finite set \({\mathcal F}_k(M) := {\mathbf {D}}^0_k / {{\mathrm{Fil}}}^M({\mathbf {D}}^0_k)\). Explicitly, one is then approximating a distribution \(\mu \in {\mathbf {D}}^0_k\) by considering its jth moment modulo \(p^{Mj}\) for \(0 \le j \le M\). For this reason, we refer to the \(\Sigma _0(p)\)stable space \({\mathcal F}_k(M)\) as a finite approximation module. The space \({{\mathrm{Symb}}}_{\Gamma _0}({\mathcal F}_k(M))\) is thus a natural space to work in to approximate overconvergent modular symbols.
We seek to generalize this construction to the case of families; that is, we seek a \(\Sigma _0(p)\)stable filtration on \({\mathbf {D}}^0 \hat{\otimes }\Lambda \). One could hope to define a nice filtration on \({\mathbf {D}}^0 \hat{\otimes }\Lambda \) by simply extending the above filtration on \({\mathbf {D}}^0\) by \(\Lambda \)linearity. However, this filtration is not preserved by the \(\Sigma _0(p)\)action defined in Sect. 2.5. Indeed, the \(\Sigma _0(p)\)action on \({\mathbf {D}}^0 \hat{\otimes }\Lambda \) is defined by combining the \({\mathbf A}\)action on \({\mathbf {D}}\) with a weight 0 action. However, \({{\mathrm{Fil}}}^M({\mathbf {D}}^0)\) is not preserved under the \({\mathbf A}\)action. For instance, multiplication by the element z maps \({{\mathrm{Fil}}}^M({\mathbf {D}}^0)\) into \({{\mathrm{Fil}}}^{M1}({\mathbf {D}}^0)\).
We do note however that the subring \({\mathbb Z}_p\llbracket pz\rrbracket \subseteq {\mathbf A}\) does preserve \({{\mathrm{Fil}}}^M({\mathbf {D}}^0)\)—this is immediate from the above definition of \({{\mathrm{Fil}}}^M({\mathbf {D}}^0)\) as multiplication by z simply shifts the moments of a distribution down by one. Moreover, the \(\Sigma _0(p)\)action on \({\mathbf {D}}^0 \hat{\otimes }\Lambda \) does not act through arbitrary elements of \({\mathbf A}\); rather, we are only acting by power series of the form
and thus the only power series in z we need to act by are of the form
But even for \(n=1\), these power series need not be in \({\mathbb Z}_p\llbracket pz\rrbracket \). For instance,
note the troubling term is \(p^{p1} z^p\).
We now turn to Theorem 2.15 to see how far these power series are from being in \({\mathbb Z}_p[[pw]]\). Indeed, this theorem tells us that such power series are in \(S_z\), and thus their jth coefficients have valuation at least \(c_p j\).
We are thus led to modify our filtration at any fixed weight as follows. Set
In \({{\mathrm{Fil}}}^M({\mathbf {D}}^0)\), the sequence of lower bounds on the valuations of the moments was
In \(\widetilde{{{\mathrm{Fil}}}}^M({\mathbf {D}}^0)\), the corresponding sequence begins as
This pattern of \(p1\) terms, with the first two terms stable and the rest decreasing by 1, then continues to repeat.
Lemma 4.1
The weight k action of \(\Sigma _0(p)\) on \({\mathbf {D}}\) preserves \(\widetilde{{{\mathrm{Fil}}}}^M({\mathbf {D}}^0)\).
Proof
We argue as in [28, Prop 7.1] with some small changes. Let \(\mu \in \widetilde{{{\mathrm{Fil}}}}^M {\mathbf {D}}_{k}^0\). For \(j \ge 0\), we must show that \({{\mathrm{ord}}}_p (\mu \big  \gamma )(z^j) \ge M  j \cdot c_p\). We compute
for some \(c_s \in {\mathbb Z}_p\). Since \(\mu \in \widetilde{{{\mathrm{Fil}}}}^M({\mathbf {D}}_{k}^0)\), we have that \({{\mathrm{ord}}}_p \mu (z^s) \ge M  s \cdot c_p\). For \(s \le j\), we then have \({{\mathrm{ord}}}_p \mu (z^s) \ge M  j \cdot c_p\). For \(s \ge j\), an easy computation with the explicit formula above yields that \(c_s\) is divisible by \(p^{sj}\). Thus,
as desired. \(\square \)
Lemma 4.2
The action of \(S_z \subseteq {\mathbf A}\) on \({\mathbf {D}}\) preserves \(\widetilde{{{\mathrm{Fil}}}}^M({\mathbf {D}}^0)\).
Proof
It suffices to check that for \(\mu \in {\mathbf {D}}^0\) and for monomials of the form \(a_n z^n\) such that \({{\mathrm{ord}}}_p(a_n) \ge n \cdot c_p\), we have \(a_n z^n \cdot \mu \in {\mathbf {D}}^0\). To this end, we compute
as desired. \(\square \)
We now simply define a filtration on \({\mathbf {D}}^0 \hat{\otimes }\Lambda \) by:
That is, \(\mu \in \widetilde{{{\mathrm{Fil}}}}^M( {\mathbf {D}}^0 \hat{\otimes }\Lambda )\) if its jth moment is an element of \(\Lambda \) whose padic valuation is at least \(M  j \cdot c_p\), i.e. if when written as a power series in w, all of its coefficients have valuation at least \(M  j \cdot c_p\).
Lemma 4.3
We have that \(\widetilde{{{\mathrm{Fil}}}}^M( {\mathbf {D}}^0 \hat{\otimes }\Lambda )\) is preserved by the \(\Sigma _0(p)\)action.
Proof
The \(\Sigma _0(p)\)action is defined as:
The lemma thus follows from Theorem 2.15, Lemmas 4.1 and 4.2. \(\square \)
We set \(\widetilde{{\mathcal F}}(M) := ({\mathbf {D}}^0 \hat{\otimes }\Lambda )/ \widetilde{{{\mathrm{Fil}}}}^M( {\mathbf {D}}^0 \hat{\otimes }\Lambda )\). Unfortunately, note that \(\widetilde{{\mathcal F}}(M)\) is not finite. Indeed, this module still keeps track of the coefficients of infinitely many powers of w. To fix this, fix \(L>0\), and we define
That is \(\mu \in \widetilde{{{\mathrm{Fil}}}}^{M,L}( {\mathbf {D}}^0 \hat{\otimes }\Lambda )\) if the first L coefficients of its jth moment (thought of as an element of \(\Lambda \)) have valuation at least \(M  j \cdot c_p\). Set
and then
which is finite. We will refer to \(\widetilde{{{\mathrm{Fil}}}}^M({\mathbf {D}}^0 \hat{\otimes }\Lambda )\) as the Mth approximation module and to \(\widetilde{{{\mathrm{Fil}}}}^{M,L}({\mathbf {D}}^0 \hat{\otimes }\Lambda )\) as the (M, L)th finite approximation module.
Remark 4.4
All of the previous discussion goes through equally well if we replace \({\mathbf {D}}^0 \hat{\otimes }\Lambda \) with \({\mathbf {D}}^0\hat{\otimes }R^0\) allowing us to define \(\widetilde{{{\mathrm{Fil}}}}^M({\mathbf {D}}^0\hat{\otimes }R^0)\) and \(\widetilde{{{\mathrm{Fil}}}}^{M,L}({\mathbf {D}}^0\hat{\otimes }R^0)\). We further note that since \(\Lambda / w^L \Lambda \cong R^0/ w^L R^0\), we have
When working with these finite approximation modules, one cannot distinguish \(R^0\) (the Tate algebra) from \(\Lambda \) (the Iwasawa algebra).
4.2 Handling denominators
The one downside to the above formulation of finite approximation modules is that it only allows us to approximate families of distributions whose moments are integral power series. However, in solving the difference equation, the resulting distributions don’t have integral moments (they aren’t even bounded!). To fix this problem, we proceed as in [28, page 29]. Set
Lemma 4.5
We have

(1)
\(\widetilde{K}_{0}(R)\) is a \(\Sigma _0(p)\)module;

(2)
\(p^{M} \widetilde{{\mathcal K}}_{0}(R) \cap {\mathbf {D}}^0\hat{\otimes }R^0= \widetilde{{{\mathrm{Fil}}}}^M({\mathbf {D}}^0\hat{\otimes }R^0).\)
Proof
Part one follows exactly as in Lemma 4.3. Part two follows immediately from the definitions. \(\square \)
We thus have the following alternative description of our approximation modules in families:
Note that these maps are \(\Sigma _0(p)\)isomorphisms. Thus, as long as we are working with distributions in \({\mathbf {D}}^0\hat{\otimes }R^0+ p^M \widetilde{{\mathcal K}}_0(R)\), it makes sense to project to the Mth approximation module.
4.3 Solving the difference equation in \(\widetilde{{\mathcal F}}(M)\)
We now use the description of \(\widetilde{{\mathcal F}}(M)\) given in Sect. 4.2 to explain how one solves the difference equation in these approximation modules. We first review the case of a fixed weight, and then discuss the case of families.
4.3.1 The case of a fixed weight
The following is a slight improvement on [28, Lemma 7.5]. We refer to loc. cit. for undefined notation. In what follows, set
Lemma 4.6
Let \(\mu \in {\mathcal D}^\dag \) and \(\nu \in {\mathbf {D}}^0\) with \(\mu  \Delta = \nu \). Then for any \(M \ge 0\), we have
where \(m = \left\lfloor \frac{\log (M+1)}{\log p} \right\rfloor \).
Proof
By the explicit solution of the difference equation given in [28, Theorem 4.5], it suffices to see for all \(j \ge 1\) that
We must thus check that for \(r \le M\), we have
and for \(r > M\), we have
We start with the case \(r \le M\). To see (4.2), it suffices to see that
for \(r \ge j1\). We thus have that \(M \ge j1\) and so \(\frac{p^m}{j} \in {\mathbb Z}_p\). If \(\frac{p^m}{j} \in p{\mathbb Z}_p\), then we are done by the Clausen–von Staudt theorem as each Bernoulli number is in \(\frac{1}{p} {\mathbb Z}_p\). Thus, we just need to consider the case where \(j = a p^m\) with \(1 \le a \le p1\) and deduce that \(\genfrac(){0.0pt}{}{r}{j1} b_{rj+1} \in {\mathbb Z}_p\).
If \(r=j1\) we are done as \(b_0 =1\). Then, for \(r>j1\), we have that \(b_{rj+1}\) has a p in its denominator if and only if \(p1\) divides \(rj+1\). In this case, we have
and we must deduce that \(\genfrac(){0.0pt}{}{r}{ap^m1}\) is divisible by p. By Lucas’ theorem, it suffices to see that one of the base p digits of \(ap^m1\) is greater than one of the base p digits of r. The base p expansion of \(ap^m 1\) is \((a1~p1~p1 \ldots \, p1)_p\). Since \(r \le M < p^{m+1}\), the only possibility that r has every base p digit larger than those of \(ap^m1\) is if the base p representation of r is \((c~p1~~p1~~\ldots ~~p1)_p\) with \(p1 \ge c > a1\). In this case, \(r = (c+1)p^m1\). But then \(r \equiv c \text { }({\text {mod}}\text { }p1)\) which is impossible as \(r \equiv a1 \text { }({\text {mod}}\text { }p1)\).
Now in the second case where \(r > M\), set \(s = rM\). To see (4.3), if suffices to see that
for \(r \ge j1\). Note that
and thus \(\frac{p^{m+s}}{j}\) is divisible by p. Again, by the Clausen–von Staudt theorem, we are done.
\(\square \)
Corollary 4.7
Let \(\overline{\nu } \in {\mathcal F}_k(M)\) have total measure 0. Then there exists \(\overline{\mu } \in {\mathcal F}_k(M)\) such that
where \(m = \left\lfloor \frac{\log (M+1)}{\log p} \right\rfloor \).
Proof
Lift \(\overline{\nu }\) to some element \({\nu }\) in \({\mathbf {D}}^0\) with total measure 0. Solving the difference [28, Theorem 4.5], then yields \({\mu } \in {\mathcal D}^\dag \) with \({\mu } \big  \Delta = {\nu }\). Then by Lemma 4.6, we have \(\mu \in {\mathbf {D}}^0+ p^M {\mathcal K}_0\). Projecting \(\mu \) to \(({\mathbf {D}}^0+ p^M {\mathcal K}_0)/ p^M {\mathcal K}_0 \cong {\mathbf {D}}^0/ {{\mathrm{Fil}}}^M({\mathbf {D}}^0)\) then yields a solution to the difference equation in the finite approximation module. \(\square \)
We note that the above corollary tells us the existence of solution to the difference equation in \({\mathcal F}(M)\). We now describe how to explicitly write down such a solution. Moreover, by analyzing this explicit solution, we will see that a smaller power of p is needed to control denominators.
To start, we note that the solution to the difference equation in \({\mathcal F}(M)\) is not unique.
Lemma 4.8
If \(\mu \in {{\mathrm{Fil}}}^{M1}({\mathbf {D}}^0)\), then \(\mu \big  \Delta \in {{\mathrm{Fil}}}^M({\mathbf {D}}^0)\).
Proof
Take \(\mu \in {{\mathrm{Fil}}}^{M1}({\mathbf {D}}^0)\), and we compute
Since \(\mu (z^i) \in p^{M1i}{\mathbb Z}_p\) for i between 0 and \(j1\), we see that \((\mu \big  \Delta )(z^j) \in p^{M1(j1)} {\mathbb Z}_p= p^{Mj}{\mathbb Z}_p\). Thus \(\mu \big  \Delta \in {{\mathrm{Fil}}}^{M}({\mathbf {D}}^0)\). \(\square \)
Proposition 4.9
Take \(\overline{\nu } \in {\mathcal F}_k(M)\) with total measure zero, and set \(m = \left\lfloor \frac{\log (M)}{\log p} \right\rfloor \). Define \(\overline{\mu }_0 \in {\mathcal F}_k(M1)\) by
for \(0 \le r \le M2\). If \(\overline{\mu } \in {\mathcal F}_k(M)\) is any element which projects to \(\overline{\mu }_{0}\) in \({\mathcal F}_k(M1)\), then \(\overline{\mu } \big  \Delta = p^m \overline{\nu }\) in \({\mathcal F}_k(M)\).
Proof
We first note that the formula defining \(\overline{\mu }_{0}(z^r)\) makes sense. To see this, note that
by the proof of Lemma 4.6. Further, \(\overline{\nu }(z^j)\) is welldefined modulo \(p^{Mj}\), and thus \(\overline{\mu }_0(z^r)\) is welldefined modulo \(p^{M1r}\). Hence, \(\overline{\mu }_0\) is a welldefined element of \({\mathcal F}(M1)\).
Next, let \(\nu \) denote any lift of \(\overline{\nu }\) to \({\mathbf {D}}^0\) with total measure 0, and let \(\mu \in {\mathcal D}^\dag \) be the unique distribution satisfying \(\mu \big  \Delta = p^m \nu \) (by [28, Theorem 4.5]). Then the image of \(\mu \) in \({\mathcal F}(M1)\) equals \(\overline{\mu }_0\) since the explicit formulas in [28, Theorem 4.5] exactly match the formulas defining \(\overline{\mu }_0\) in this proposition. (Note that our choice of m allows us to form this projection.) Thus, for any \(\overline{\mu } \in {\mathcal F}(M)\) lifting \(\overline{\mu }_0\), we have that the image of \(\mu \) in \({\mathcal F}(M)\) equals \(\overline{\mu }\) up to some distribution taking values in \({{\mathrm{Fil}}}^{M1}({\mathbf {D}}^0)\). Our proposition then follows from Lemma 4.8. \(\square \)
4.3.2 The difference equation in \(\widetilde{{\mathcal F}}(M)\)
We now generalize the discussion of the previous section to the case of families.
Lemma 4.10
Let \(\mu \in {\mathcal D}^\dag (R)\) be such that \(\mu \big  \Delta = \nu \in {\mathbf {D}}^0 \hat{\otimes }\Lambda \). Then for any \(M \ge 0\), we have
where \(m = \left\lfloor \frac{\log (M/c_p+1)}{\log p} \right\rfloor \).
Proof
The proof proceeds nearly the same as in Lemma 4.6. Indeed, we still have an explicit solution to the difference equation, and so it suffices to see that
By definition, this means we need to check that for \(r \le M/c_p\), we have
and for \(r > M/c_p\), we have
Analyzing these two cases then follows exactly as in Lemma 4.6. \(\square \)
Thus, by scaling our distributions by a small power of p we will be able to solve the difference equation in these finite approximation modules.
Corollary 4.11
Let \(\overline{\nu } \in \widetilde{{\mathcal F}}(M)\) have total measure 0. Then there exists \(\overline{\mu } \in \widetilde{{\mathcal F}}(M)\) such that
where \(m = \left\lfloor \frac{\log (M/c_p+1)}{\log p} \right\rfloor \).
Proof
The proof follows verbatim as in Corollary 4.7. \(\square \)
Just as in the fixed weight case, the solution to the difference equation is not unique in \(\widetilde{{\mathcal F}}(M)\). Unfortunately, the analogue of Lemma 4.8 (which describes part of the kernel of \(\Delta \)) does not quite hold in families. We instead just state a slightly weaker version of Proposition 4.9 in families.
Proposition 4.12
Take \(\overline{\nu } \in \widetilde{{\mathcal F}}(M)\) with total measure zero, and set \(m = \left\lfloor \frac{\log (M/c_p)}{\log p} \right\rfloor \). Define \(\overline{\mu } \in \widetilde{{\mathcal F}}(M1)\) by
for \(0 \le r < M/c_p1\). Then \(\overline{\mu } \big  \Delta = p^m \overline{\nu }\) in \(\widetilde{{\mathcal F}}(M1)\).
Proof
To see that the above formulas yield a welldefined element of \(\widetilde{{\mathcal F}}(M1)\), note that
by the proof of Lemma 4.6. Further, \(\overline{\nu }(z^j)\) is welldefined modulo \(p^{\lceil Mjc_p\rceil }\). From the above formula, we then see that \(\overline{\mu }(z^r)\) is welldefined modulo \(p^{\lceil M(r+1)c_p\rceil } = p^{\lceil Mc_p r c_p\rceil }\). Hence, \(\overline{\mu }(z^r)\) is welldefined modulo \(p^{\lceil M1r c_p\rceil }\), and \(\overline{\mu }\) is a welldefined element of \({\mathcal F}(M1)\).
Next, let \(\nu \) denote any lift of \(\overline{\nu }\) to \({\mathbf {D}}^0\hat{\otimes }R^0\) with total measure 0, and let \(\mu \in {\mathcal D}^\dag (R)\) be the unique distribution satisfying \(\mu \big  \Delta = p^m \nu \) (by Lemma 3.3). Then the image of \(\mu \) in \(\widetilde{{\mathcal F}}(M1)\) equals \(\overline{\mu }\) since the explicit formulas defining \(\mu \) exactly match the formulas defining \(\overline{\mu }\) in this proposition. By Lemma 4.10, \(\mu \in {\mathbf {D}}^0\hat{\otimes }R^0+ p^{M1} \widetilde{{\mathcal K}}_0(R)\), and thus projecting to \(\widetilde{{\mathcal F}}(M1)\) gives the desired result. \(\square \)
4.4 The ordinary subspace of \({{\mathrm{Symb}}}_{\Gamma _0}(\widetilde{{\mathcal F}}(M))\)
Since \(\widetilde{{\mathcal F}}(M)\) is defined by taking the reduction modulo various powers of p, the space \({{\mathrm{Symb}}}_{\Gamma _0}(\widetilde{{\mathcal F}}(M))\) has the potential to have a complicated structure even as a \({\mathbb Z}_p\)module. However, if we restrict to the ordinary subspace, the following proposition proves that passing to the Mth approximation module is equivalent to reducing modulo \(p^M\).
Proposition 4.13
The natural map \({\mathbf {D}}^0 \hat{\otimes }\Lambda \rightarrow \widetilde{{\mathcal F}}(M)\) induces an isomorphism
and thus
In particular, \({{\mathrm{Symb}}}_{\Gamma _0}(\widetilde{{\mathcal F}}(M,L))^{{{\mathrm{ord}}}}\) is a free \({\mathbb Z}_p[w]/(p^M,w^L)\)module, and for T a Hecke operator
Proof
We first show that
is surjective. To this end, consider the exact sequence
Identifying \({{\mathrm{Symb}}}_{\Gamma _0}(V)\) with \(H^1_c(\Gamma _0,V)\) and invoking the long exact sequence for cohomology, it suffices to show that \(H^2_c(\Gamma _0,\widetilde{{{\mathrm{Fil}}}}^M({\mathbf {D}}^0) \hat{\otimes }\Lambda )^{{{\mathrm{ord}}}} = 0\). But
and
which vanishes as in the proof of Lemma 6.4.
To check injectivity of the map of this proposition, we must take \(\Phi \in X^{{{\mathrm{ord}}}}\) which takes values in \(\widetilde{{{\mathrm{Fil}}}}(M)\), and show that \(\Phi \) is divisible by \(p^M\). To this end, take the largest possible r such that \(p^r\) divides \(\Phi \), and assume that \(r<M\). Then \(\Psi := p^{r} \Phi \) has size 1 and takes values in
But since \(Mr>0\), this means that the total measure of each value of \(\Psi \) is divisible by p. Arguing as in Proposition 3.10, we then have that \(\Psi  U_p <1\). This is a contradiction since \(\Psi \) is in the ordinary subspace.
The remainder of the proposition all follows formally from the first claim and the fact that \({{\mathrm{Symb}}}_{\Gamma _0}({\mathbf {D}}^0 \hat{\otimes }\Lambda )\) is a free \(\Lambda \)module. \(\square \)
4.5 Vector of total measures
As before, fix \(D_1, \ldots , D_t \in \Delta _0\) which generate \(\Delta _0\) as a \({\mathbb Z}_p[\Gamma _0]\)module. We again define a vector of total measures map, but now for \(\widetilde{{\mathcal F}}(M,L)\)valued symbols. Set
defined by sending \(\Phi \) to the vector \(\left( \Phi (D_i)({\mathbf 1}) \right) _{i=1}^t\).
Proposition 4.14
The map \(\alpha _{M,L}\) is injective.
Proof
By Proposition 4.13, \({{\mathrm{Symb}}}_{\Gamma _0}(\widetilde{{\mathcal F}}(M,L))^{{{\mathrm{ord}}}}\) is simply the reduction of \(X^{{{\mathrm{ord}}}}\) modulo \((p^M,w^L)\Lambda \). Likewise, \(\alpha _{M,L}\) is simply the reduction of \(\alpha \) (from Sect. 3.4) modulo \((p^M,w^L) \Lambda \). Thus, if K is the kernel of \(\alpha _{M,L}\), by Proposition 3.10, we have \(K \otimes \Lambda / {\mathfrak m}= 0\). Thus, \(K = 0\) and \(\alpha _{M,L}\) is injective. \(\square \)
Corollary 4.15
We have \(\left\{ \Phi _1^{{{\mathrm{ord}}}}, \ldots , \Phi _j^{{{\mathrm{ord}}}} \right\} \) is the start of \(\Lambda / (p^M,w^L) \Lambda \)basis of \({{\mathrm{Symb}}}_{\Gamma _0}(\widetilde{{\mathcal F}}(M,L))^{{{\mathrm{ord}}}}\) if and only if \(\left\{ \alpha _{1,1}(\Phi _1^{{{\mathrm{ord}}}}), \ldots , \alpha _{1,1}(\Phi _j^{{{\mathrm{ord}}}}) \right\} \) is a linearly independent set in \((\Lambda / (p,w) \Lambda )^t \cong {\mathbb F}_p^t\).
Proof
The same argument in Corollary 3.11 applies (invoking Proposition 4.14 instead of Proposition 3.10). \(\square \)
4.6 Characteristic polynomials of Hecke operators
In Sect. 3.5, we sketched a method of computing the characteristic polynomials of Hecke operators acting on \(X^{{{\mathrm{ord}}}}\). In this section, we explain how to carry this method out in practice in the finite spaces \({{\mathrm{Symb}}}_{\Gamma _0}(\widetilde{{\mathcal F}}(M,L))^{{{\mathrm{ord}}}}\). Recall that by Proposition 4.13, for T a Hecke operator
Thus, we can (in theory) recover the true characteristic polynomials to any degree of accuracy by taking M and L large enough.
The method of Sect. 3.5 to form a basis \(X^{{{\mathrm{ord}}}}\) was to produce “random” elements of \(X^{{{\mathrm{ord}}}}\) until one was the start of a \(\Lambda \)basis of the space. Then produce random elements until one has two elements forming the start of a \(\Lambda \)basis. Continue this until we have a full basis (whose size we know as in footnote 3).
To carry this method out in \({{\mathrm{Symb}}}_{\Gamma _0}(\widetilde{{\mathcal F}}(M,L))^{{{\mathrm{ord}}}}\), we note that we can form elements in \({{\mathrm{Symb}}}_{\Gamma _0}(\widetilde{{\mathcal F}}(M,L))\) as described in Sect. 3.2. Note that this requires solving the difference equation in \(\widetilde{{\mathcal F}}(M,L)\) which is done explicitly in Proposition 4.12. To form elements of \({{\mathrm{Symb}}}_{\Gamma _0}(\widetilde{{\mathcal F}}(M,L))^{{{\mathrm{ord}}}}\) one then just needs to iterate the \(U_p\)operator.^{Footnote 5}
Further, to determine when elements of \({{\mathrm{Symb}}}_{\Gamma _0}(\widetilde{{\mathcal F}}(M,L))^{{{\mathrm{ord}}}}\) are the start of a \(\Lambda / (p^M,w^L) \Lambda \)basis, we can invoke Corollary 4.15 and examine the associated vectors of total measures modulo \((p,w)\Lambda \). If these vectors are linearly independent over \({\mathbb F}_p\), then the original elements are the start of a basis.
Lastly, if we have a basis \({\mathcal B}= \{\Phi _1^{{{\mathrm{ord}}}}, \ldots , \Phi _d^{{{\mathrm{ord}}}}\}\) of \({{\mathrm{Symb}}}_{\Gamma _0}(\widetilde{{\mathcal F}}(M,L))^{{{\mathrm{ord}}}}\) over \(\Lambda /(p^M,w^L)\) in hand, we describe now how to compute the matrix of a Hecke operator T with respect to \({\mathcal B}\) (which in particular gives the characteristic polynomial of T). To this end, we have
for some \(a_{ij} \in \Lambda / (p^M,w^L)\Lambda \), and our job is to find the \(a_{ij}\). Applying \(\alpha _{M,L}\), the vector of total measures map, we get
Since \(\alpha _{M,L}\) is injective (Proposition 4.14), any solutions to (4.7) will also be solutions to (4.6). Thus we have reduced our question to solving linear equations over \(\Lambda / (p^M,w^L) \Lambda \).
Since the maximal ideal of \(\Lambda / (p^M,w^L) \Lambda \) is not principal, solving linear equations over this ring is not as simple as over, say, \({\mathbb Z}/p^M{\mathbb Z}\). So we include here at least a few words about how one can do this. Assume we have a consistent system of linear equations over \(\Lambda / (p^M,w^L) \Lambda \):
that is, the \({\mathbf v}_i(w)\) and \({\mathbf u}(w)\) in \(\left( \Lambda / (p^M,w^L) \Lambda \right) ^t\) are given and we must find \(a_i(w)\) in \(\Lambda / (p^M,w^L) \Lambda \) solving this equation. Evaluating at \(w=0\) yields
which is a consistent system over \({\mathbb Z}/p^M{\mathbb Z}\). Standard methods then give us the values of \(a_i(0)\) for each i. Then differentiating and evaluating at \(w=0\) gives:
This is another system of linear equations over \({\mathbb Z}/p^M{\mathbb Z}\) (with the \(a'_i(0)\) as the unknowns) which we can again solve. Repeating this method gives the values of each derivative of \(a_i(w)\) at \(w=0\). From this information, we can recover \(a_i(w)\) for each i as desired.
5 Data and examples
In this section, we describe some sample computations that we have carried out with the algorithms implemented as part of this project. Specifically, we include computations of formal qexpansions of Hida families, the structure of Hida–Hecke algebras, Linvariants of modular forms and their symmetric squares, and twovariable padic Lfunctions of Hida families. For the sake of presentation, we have elided some of the data in this section; the full data is presented in Appendix 7.
We briefly indicate about how much time and space each example took to compute on a ‘member server’ on the SageMathCloud in Sage 5.11. These are meant to be ballpark estimates; for instance, we ran Example 5.2 several times taking between 6 minutes and 10 minutes with this difference attributed mostly to the varying load of the server.
5.1 Examples of qexpansions in families
We may view a pordinary family of eigenforms as a formal qexpansion
where the \(a_n(k)\) are Iwasawa functions of the padic weight variable k.
If \(\mathcal {F}\) is a normalized (i.e. \(a_1=1\)) eigenform, then the \(a_n(k)\) are determined by the \(a_\ell (k)\), for \(\ell \) prime, using the standard Hecke operator recurrence relations. We may thus obtain the qexpansion from knowing the \(a_\ell (k)\). Furthermore, \(a_\ell (k)\) is the Hecke eigenvalue of the Hecke operator \(T_\ell \). Thus, if \(\Phi \) is a family of Hecke eigensymbols, we may compute the corresponding \(a_\ell \) by comparing \(\Phi T_\ell \) with \(\Phi \).
More precisely, since \(a_\ell (k)\) is an Iwasawa function, there exist \(A_\ell (W)\in {\mathbb Z}_p\llbracket W \rrbracket \) such that \(a_\ell (k)=A_\ell ((1+p)^k1)\). Comparing \(\Phi T_\ell \) with \(\Phi \) directly yields \(A_\ell (W)\), and then a simple substitution yields \(a_\ell (k)\). In fact, our computations take place in the larger ring \({\mathbb Z}_p\llbracket w \rrbracket = {\mathbb Z}_p\llbracket W/p \rrbracket \). However, by Theorem 3.9, we know that the eigenvalues \(A_\ell (w)\) must land in the subring \({\mathbb Z}_p\llbracket W \rrbracket \).
Below, we provide the first few \(a_\ell \) for some examples of families passing through specific newforms. See Sects. 3.5 and 3.6 for the method used to isolate these examples. We remind the reader that specializing the variable k below to a specific nonnegative integer \(k_0\) gives modular forms of weight \(k_0+2\).
Example 5.1
Let \(p=5\) and \(N=1\) in weights congruent to 2. In this case \(X^{{{\mathrm{ord}}}}\) is 1dimensional and this dimension is entirely explained by the ordinary Eisenstein family. Using the methods of Sect. 3.5, we can produce a basis of this space and since this basis is 1dimensional, its unique element \(\Phi \) is an eigensymbol. By comparing \(\Phi  T_\ell \) with \(\Phi \) we can thus compute families of ordinary Eisenstein eigenvalues. For example, we compute that:
We note that in this case, we have an exact formula for \(a_\ell (k)\) as \(a_\ell (k) = 1 + \omega (\ell )\langle \ell \rangle ^{k+1}\) for \(\ell \ne 5\). Expanding as a power series, we get
which we note matches perfectly with the computations listed above. At \(\ell =p=5\), the Hecke eigenvalue is the constant 1 in the family and our computations witness this.
Example 5.2
Let \(p=11\) and let f be the unique cuspidal newform of weight 2 and level \(\Gamma _0(11)\) (i.e. the one corresponding to the elliptic (and modular) curve \(X_0(11)\)). In this case, \(X^{{{\mathrm{ord}}}}\) is 3dimensional. As in the previous example, one dimension is explained by the ordinary Eisenstein family. The other two dimensions are explained by the Hida family passing through f with this family contributing one dimension to each of the plus and minus subspaces. Since the Eisenstein family lands in the plus subspace, we focus on the minus subspace as \((X^{{{\mathrm{ord}}}})^\) is 1dimensional. Again, a basis of this space is automatically an eigensymbol and from this eigensymbol we computed \(a_\ell \) for \(\ell \le 11\). The full data is included in Appendix 7.1. A sample eigenvalue is
Note that plugging in \(k=122\) into the data, one can verify that the eigenvalues agree with those of (the ordinary 11stabilization of) the modular discriminant \(\Delta \) (up to precision \(11^{11}\)).
This example took about 7 min: computing the eigenfamily took about 5.5 min and computing the 5 eigenvalues took about 1.5 min. Computing the modular symbol space and the eigenfamily used about 55 MB while the eigenvalues used about 45 MB.
Example 5.3
Let \(p=5\) and let f be the unique newform of weight 2 and level \(\Gamma _0(15)\) (i.e. the one corresponding to the elliptic (and modular) curve \(X_0(15)\)). The minus subspace is again 1dimensional so a basis of it is an eigensymbol from which we computed \(a_\ell \) for \(\ell \le 11\). The full data is included in Appendix 7.1. A sample eigenvalue is
Plugging in \(k=62\), one can verify that the eigenvalues agree with those of (the ordinary 5stabilization of) of the unique newform of weight 6 and level \(\Gamma _0(3)\) (up to precision \(5^{10}\)). In weight 22 and level \(\Gamma _0(3)\), there are three Galois conjugacy classes of newforms; their Hecke eigenvalue fields are \({\mathbb Q}, {\mathbb Q}\), and \({\mathbb Q}(\sqrt{11\cdot 59})\), respectively. The first two are not 5ordinary. The third Galois conjugacy class is only ordinary under one of the two embeddings of \({\mathbb Q}(\sqrt{11\cdot 59})\) into \({\mathbb Q}_5\). We plugged in \(k=222\) to the computed \(a_\ell \) and they agreed with the Hecke eigenvalues of this weight 22 and level \(\Gamma _0(3)\) newform (up to precision \(5^{11}\)).
This example took about 11 min: 7 min for the eigenfamily and 4 min for the five eigenvalues. The eigenfamily computation used 45 MB and the eigenvalues used 120 MB.
Example 5.4
Let \(p=5\) and let f be the ordinary 5stabilization of the (unique) newform of weight 2 and level \(\Gamma _0(19)\) (i.e. the one corresponding to the elliptic (and modular) curve \(X_0(19)\)). In this case, \((X^{{{\mathrm{ord}}}})^\) is 8dimensional. However, if \({\mathfrak m}\) is the maximal ideal corresponding to f, we have that \((X^{{{\mathrm{ord}}}}_{\mathfrak m})^\) is 1dimensional. Indeed, \(a_5(f) \equiv 3 \pmod {5}\) and \(a_5(g) \equiv \pm 1 \pmod {5}\) for all of the remaining eigenforms (as they are all new at 5). In particular, the operator \(T_5^21\) acts invertibly on \((X^{{{\mathrm{ord}}}}_{\mathfrak m})^\) and topologically nilpotently on \((X^{{{\mathrm{ord}}}}_{{\mathfrak m}'})^\) for all \({\mathfrak m}' \ne {\mathfrak m}\). In particular, using the methods of Sect. 3.6, we can form a basis of \((X^{{{\mathrm{ord}}}}_{{\mathfrak m}})^\) and obtain an eigensymbol as this space is 1dimensional. The \(a_\ell \) for \(\ell \le 19\) are included in Appendix 7.1. Here is \(a_5\):
In weight 6 and level \(\Gamma _0(19)\), there are four Galois conjugacy classes of newforms; their Hecke eigenvalue fields are \({\mathbb Q}, {\mathbb Q}\), \({\mathbb Q}(\sqrt{3\cdot 59})\), and \(K_4\), respectively, where \(K_4\) is a totally real \(S_4\)quartic extension of discriminant \(101148696=2^3 \cdot 3^3 \cdot 11 \cdot 42571\). All of these are ordinary at 5. The fourth Galois conjugacy class has one embedding into \({\mathbb Q}_5\) (the remaining three are into \({\mathbb Q}_{5^3}\)) and the corresponding newform is the only one whose \(a_5\) is congruent to the \(a_5\) of the elliptic curve \(X_0(19)\). Plugging in \(k=62\) in the computed \(a_\ell \) agrees with the qexpansion of this newform (up to precision \(5^{7}\)).
This example took about 58 min: 35 min for the eigenfamily and 23 min for the eight eigenvalues. The eigenfamily computation used 77 MB and the eigenvalues used 719 MB.
Example 5.5
Let \(p=5\). There are two Galois conjugacy classes of weight 2 newforms of level \(\Gamma _0(95)\); their Hecke eigenvalue fields are \(K_3\) and \(K_4\), where \(K_3\) is the unique (real) cubic field of discriminant 148, and \(K_4\) is the unique totally real quartic field of discriminant 11344. These represent the remaining 7 dimensions in \((X^{{{\mathrm{ord}}}})^{}\) from the previous example. In this example, we deal with the conjugacy class with cubic Hecke eigenvalue field as its \(a_5\) is 1 (while the other conjugacy class has \(a_5=1\)) and hence the forms in this class will have an Linvariant (studied in Sect. 5.3 below). The field \(K_3\) has only one embedding into \({\mathbb Q}_5\) (the remaining embeddings landing in \({\mathbb Q}_{5^2}\)). We let f be the newform corresponding to the embedding in \({\mathbb Q}_5\) and remark it is congruent to an Eisenstein series. We may isolate it from its two other conjugates (as in Sect. 3.6) as its \(a_{61}\) is \(2\text { }({\text {mod}}\text { }5)\), whereas its conjugates have \(a_{61}\equiv 3\text { }({\text {mod}}\text { }5)\). The \(a_\ell \) for \(\ell \le 11\) are included in Appendix 7.1. Here is \(a_5\), which is used to compute the Linvariant below:
This example took about 4 h and 25 min: 4 h and 20 min for the eigenfamily and 5 min for the five eigenvalues. The eigenfamily computation used 1150 MB and the eigenvalues used 110 MB.
Example 5.6
For a bigger example, consider \(p=11\) with tame level \(N=31\). We take f to be 11stabilization of the unique weight two newform on \(\Gamma _0(31)\). This form has coefficients in \({\mathbb Q}\). The space \((X^{{{\mathrm{ord}}}})^\) is 29dimensional in this case, but its localization at the maximal ideal corresponding to f is 1dimensional. We isolate the Hida family through f by iterating the operators \(T_28,U_{11}1,U_{11}+1\). The \(a_\ell \) for \(\ell \le 11\) are included in Appendix 7.1. Here is the \(a_{11}\) in the family:
This example took about 6 h and 37 min: 6 h and 20 min for the eigenfamily and 17 min for the five eigenvalues. The eigenfamily computation used 370 MB and the eigenvalues used 290 MB.
5.2 The structure of Hida algebras
Here, we summarize some computations of the structure of the connected components of Hida algebras.
Example 5.7
Let \(p=3\) and \(N=11\). In this case, there are two 3ordinary cuspforms in any even weight. In weight 2, one of these forms comes from the ordinary 3stabilization of the cuspform associated to \(X_0(11)\). The other form is the unique newform of level 33, and, moreover, these two forms admit a congruence modulo 3. Note then \((X^{{{\mathrm{ord}}}})^\) is 2dimensional with \((X^{{{\mathrm{ord}}}}_{\mathfrak m})^ \simeq (X^{{{\mathrm{ord}}}})^\) making this example fundamentally different from the examples in Sect. 5.1 where we were always able to cut down to a 1dimensional space.
Nonetheless, we know that \({\mathbb T}_{\mathfrak m}\) is a \(\Lambda \)algebra of rank 2 and we seek to understand its structure. From a geometric perspective, possibilities for \({{\mathrm{Spec}}}({\mathbb T}_{\mathfrak m})\) include two copies of weight space glued to together at a finite collection of points (possibly only at the point of characteristic p) or a ramified cover of weight space ramified at finitely many points.
To further understand \({\mathbb T}_{\mathfrak m}\), consider a Hecke operator \(T_\ell \) (or \(U_\ell \)) acting on \((X^{{{\mathrm{ord}}}})^\) and let \(f_\ell \) denote its characteristic polynomial, which is a monic polynomial of degree 2 over \(\Lambda ={\mathbb Z}_p\llbracket W\rrbracket \). Let \(d_\ell (W) \in \Lambda \) denote the discriminant of this polynomial. By padic Weierstrass preparation, we can write
where \(P_\ell (W)\) is a distinguished polynomial of degree say \(\lambda _\ell \) and \(V_\ell (W)\) is a unit power series. We can use information from this decomposition to understand \({\mathbb T}_{\mathfrak m}\). For example, if \(\lambda _\ell \) is odd, then \(\Lambda [T_\ell ]\) is a ramified extension of \(\Lambda \) forcing \({\mathbb T}_{\mathfrak m}/ \Lambda \) to be ramified.
Using the methods outlined in Sect. 3.5, we computed approximations to the characteristic polynomials \(f_\ell \) described above. For example, for \(\ell = 2\), we computed this discriminant to be:
Note that this power series has \(\lambda \)invariant 1 and thus has a unique root \(\alpha _2\) which is defined over \({\mathbb Z}_p\). We explicitly found the following approximation to \(\alpha _2\):
We thus get that
where \(V_\ell (0) \equiv 1 \pmod {p}\). In particular, \(V_\ell (W)\) is a square in \(\Lambda \) and we see that \(d_2(W)\) and \((W\alpha _2)\) differ multiplicatively by a square. In particular, \(\Lambda [T_2] = \Lambda [\sqrt{(W\alpha _2)}]\)
Now for R a ring which is a finite and free \(\Lambda \)module, let \({{\mathrm{disc}}}(R) \subseteq \Lambda \) denote its discriminant ideal. Write \(M \in M_2(\Lambda )\) for the change of basis matrix corresponding to the embedding \(\Lambda [T_2] \subseteq {\mathbb T}_{\mathfrak m}\) both of which are free \(\Lambda \)modules of rank 2. We then have
Since \({{\mathrm{disc}}}(\Lambda [T_2]) = (W\alpha _2)\Lambda \) is a squarefree ideal, we must have that \(\det (M)\) is a unit and \(\Lambda [T_2] = {\mathbb T}_{\mathfrak m}\). In particular, \({\mathbb T}_{\mathfrak m}\simeq \Lambda [\sqrt{(W\alpha _2)}]\).
As a check, we computed \(d_\ell (W)\) for all primes \(\ell < 11\). In each case, \(d_\ell (W)\) had \(\lambda \)invariant equal to 1, and its unique root \(\alpha _\ell \) was congruent to \(\alpha _2\) modulo the precision of the computation.
Computing the basis of the twodimensional \((X^{{{\mathrm{ord}}}}_{\mathfrak m})^\) took 27.5 minutes and 107MB, while computing the Hecke polynomials for primes \(\le 11\) took 33.5 minutes and 462MB.
Example 5.8
Let \(p=37\) and \(N=1\). This example gains its fame from the fact that 37 is an irregular prime with \(B_{32}\) having positive valuation at 37. In particular, there is a cuspform f of weight 32 congruent to the Eisenstein series \(E_{32}\) modulo 37. For this reason we consider the 30th component of weight space (corresponding to the classical weight 32). On this component there are exactly three ordinary normalized eigenforms: f, \(E_{32}^{{{\mathrm{ord}}}}\) (the ordinary 37stabilization of \(E_{32}\)), and a third form not congruent to either f or \(E_{32}^{{{\mathrm{ord}}}}\).
Let \({\mathfrak m}\) denote the maximal ideal of \({\mathbb T}\) corresponding to f and \(E_{32}^{{{\mathrm{ord}}}}\). In this case, the Eisenstein symbols live in the plus part of \(X^{{{\mathrm{ord}}}}\) and thus \((X^{{{\mathrm{ord}}}}_{\mathfrak m})^+\) is rank 2 over \(\Lambda \) (with one dimension coming from the Eisenstein series and the other coming from the Hida family through f). Using the methods of Sect. 3.6, we can form a basis (of size 2) of this space. As in the previous example, we compute the discriminant of the characteristic polynomial of \(T_\ell \) for various \(\ell \). For example, for \(\ell = 2\), we get:
which we note has \(\lambda \)invariant 2. In fact, looking at the Newton polygon of this power series we see that it has two roots each of valuation 1; call these roots \(\alpha _2\) and \(\beta _2\). By inspection, we can only find a single root (mod \(p^6\)), namely:
This suggests that \(\alpha _2 = \beta _2\), that is, that \(d_2(W)\) has a double root at \(\alpha _2\).
We note that a computer computation alone could never prove the equality \(\alpha _2 = \beta _2\) as we are always working modulo a power of 37. Nonetheless, in this example, we can argue as follows. First note that if \(\alpha _2 \ne \beta _2\), then by the same arguments as in Example 5.7 (since \({{\mathrm{disc}}}(\Lambda [T_2])\) is squarefree), we have
with \(u\in {\mathbb Z}_{37}^\times \). In particular, \({\mathbb T}_{\mathfrak m}\) is a domain and its spectrum is thus a single irreducible component. However, looking at the associated Galois representations we will see that this is impossible. Indeed, at the Eisenstein points in the family, the associated Galois representation is reducible while at generic cuspidal points this representation is irreducible. If \({{\mathrm{Spec}}}({\mathbb T}_{\mathfrak m})\) were irreducible, then all Galois representations would have the same behavior (irreducible vs. reducible) except at a finite set of points. This contradiction forces \(\alpha _2 = \beta _2\).
Hence \(d_2(W) = (W\alpha _2)^2 \cdot V_\ell (W)\) with \(V_\ell (0) \equiv 25 \text { }({\text {mod}}\text { }p)\). In particular, \(V_\ell (W)\) is a square and thus \(\Lambda [T_2] \simeq \Lambda [Y] / (Y^2  (W\alpha _2)^2)\). Arguing again with discriminant ideals, we have
where M is the change of basis matrix coming from the inclusion \(\Lambda [T_2] \subseteq {\mathbb T}_{\mathfrak m}\). Since \({{\mathrm{disc}}}(\Lambda [T_2]) = (W\alpha _2)^2 \Lambda \), we have \(\det (M)^2 = 1\) or \((W\alpha _2)^2\). In the later case, we would have that the discriminant ideal of \({\mathbb T}_{\mathfrak m}\) over \(\Lambda \) is a unit, implying that \({\mathbb T}_{\mathfrak m}\) is an étale \(\Lambda \)algebra, and in particular, that the map \(\Lambda \rightarrow {\mathbb T}_{\mathfrak m}\) is unramified. This implies that \((p,W){\mathbb T}_{\mathfrak m}={\mathfrak m}{\mathbb T}_m\). Since \({\mathfrak m}\) corresponds to a qexpansion in \(\mathbb {F}_p\), we have that \({\mathbb T}_{\mathfrak m}/(p,W){\mathbb T}_{\mathfrak m}\cong {\mathbb T}_{\mathfrak m}/{\mathfrak m}{\mathbb T}_m\cong \mathbb {F}_p\) which is a onedimensional vector space over \(\Lambda /(p,W)\cong \mathbb {F}_p\). By Nakayama’s Lemma, \({\mathbb T}_m\) must be rank one over \(\Lambda \), a contradiction. Thus, \({\mathbb T}_{\mathfrak m}\cong \Lambda [T_2]\cong \Lambda [Y] / (Y^2  (W\alpha _2)^2)\).
As a check, we computed \(d_\ell (W)\) for \(\ell \le 11\) and in each case \(\lambda (d_\ell )=2\) and \(\alpha _2\) was a root of \(d_\ell (W)\) modulo our precision.
Geometrically, the spectrum of this ring is two copies of weight space glued together at the weight \(\alpha _2\). This picture is completely consistent with what is known already in this example. Indeed, the 37adic \(\zeta \)function has \(\lambda \)invariant 1. Thus, the Eisenstein family and the cuspidal family meet at a unique weight \(k_z\) – namely the unique root of \(\zeta _{37}(1k)\). In [25, Section 6.2.1], this weight is computed to tremendous precision (1000 padic digits) with the first few digits being
To compare with our computations of \(\alpha _2\), we note that the weight \(k_z\) in the Tvariable corresponds to \((1+p)^{1k_z2}1\) and we do indeed have that
Computing the basis of the twodimensional \((X^{{{\mathrm{ord}}}}_{\mathfrak m})^+\) took 1 hour and 56 min and 410 MB, while computing the Hecke polynomials for primes \(\le 11\) took 9.5 min and 169 MB.
5.3 Linvariants
Linvariants arise when a padic Lfunction vanishes at a point of interpolation due to the vanishing of the Eulertype interpolation factor. The earliest known example of this phenomenon of socalled “trivial zeroes” is due to Ferrero–Greenberg [16]: if \(\psi \) is an even Dirichlet character, then
so \(\left( 1\psi (p)p^{n1}\right) \) vanishes at \(n=1\) whenever \(\psi (p)=1\). In [26], Mazur–Tate–Teitelbaum discovered the same type of vanishing occurs for the padic Lfunction of an elliptic curve, E, with split, multiplicative reduction at p. The interpolation property gives
where \(\Omega _E\) is the Néron period of E. When E is split multiplicative at p, \(a_p(E)=1\), so that the padic Lfunction vanishes for trivial reasons at \(s=1\) (more generally, weight 2 newforms of level exactly divisible by p and whose \(a_p=1\) share the same behaviour). They introduced a new quantity, the padic Linvariant, \({\mathcal L}_p(E)\), given in terms of the padic Tate parameter of E and conjectured that
This was proved by Greenberg and Stevens in [18] (the more general case of newforms included). A main ingredient of their proof was a formula they gave for the Linvariant of a weight 2 newform which is germane to our work:
where \(a_p(k)\) is the pth Fourier coefficient in the formal qexpansion of Eq. (5.1) for the Hida family through f. When f corresponds to an elliptic curve, Sage can already compute these Linvariants using Tate parameters, and we show below that our method provides the same answer. On the other hand, we can also compute cases that don’t correspond to elliptic curves, i.e. newforms whose Hecke eigenvalues don’t lie in \({\mathbb Q}\). Furthermore, our code provides new computations for the Linvariant of the symmetric squares of an eigenform (really, the tracezero adjoint of an eigenform, which is a twist of the symmetric square). In [17], Greenberg proposed a general theory of Linvariants for ordinary motives providing an arithmetic candidate \(\mathcal {L}^{\text {Gr}}_p(M)\) for the padic Linvariant of an ordinary motive M. When M is the tracezero adjoint, \({{\mathrm{ad}}}^0\!f\), of a newform, Hida and the third author [20, 23] gave a formula for Greenberg’s Linvariant which Dasgupta [13] has recently shown is the actual Linvariant: if f is any pordinary newform of weight \(k_0+2\) of such that \(p^2\) does not divide its conductor, then the padic Lfunction of \({{\mathrm{ad}}}^0\!f\) has a trivial zero at \(s=1\) and its Linvariant is given by
We remark that unlike the case of modular forms themselves which (in the pordinary case) only have trivial zeroes in weight 2 and conductor exactly divisible by p, the tracezero adjoint always has a trivial zero. We may therefore consider the Linvariant as varying in the Hida family and our computations allow us to compute the Iwasawa function giving the adjoint Linvariant in a family. Also, note that the nonvanishing of the Linvariant, an important part of Greenberg’s conjecture, is only known in the cases of Dirichlet characters and split, multiplicative elliptic curves (where the result is one from transcendence theory: the theorem of StÉtienne [9], which says that the Tate parameter is transcendental). In particular, the Linvariants of newforms that do not correspond to elliptic curves are not known to be nonzero. Furthermore, up to now, all that has been known for the adjoint of a newform is that either the Linvariants are all zero in a Hida family, or all but finitely many Linvariants are nonzero (since an Iwasawa function has finitely many zeroes). Our computations provide nonvanishing results for specific forms and forms in families. In particular, they show that the adjoint Linvariants of all forms in the Hida families of Examples 5.2–5.4 are nonzero and there can be at most one form in Example 5.5 with vanishing adjoint Linvariant.
We collect some values of Linvariants of modular forms as well as their tracezero adjoints. Again, full data is available in Appendix 7.2.
Example 5.9
The Hida families in Examples 5.2 and 5.3 pass through elliptic curves with split, multiplicative reduction at p (when \(k=0\)) and the data of \(a_p(k)\) provided above allows us to compute the padic Linvariants of these curves. We obtain:
We verified that these agree with Sage’s already available computation of these Linvariants (which is much quicker) to the given precision.
Example 5.10
The Hida family in Example 5.5 passes through a newform with \(p=5\) exactly dividing the level and \(a_p=1\). We compute its Linvariant, concluding that it is nonzero, to be:
Example 5.11
In fact, for each of the Hida families in Examples 5.2–5.5 (which we now label \({\mathcal F}_{11}, {\mathcal F}_{15}\), \({\mathcal F}_{19}\), and \({\mathcal F}_{95}\), respectively), we can use Eq. (5.2) to provide a formula for the symmetric square Linvariant as a function of the weight. The full results can be found in Appendix 7.2.
Plugging in \(k=122\) yields the value of the 11adic symmetric square Linvariant of \(\Delta \):
For the family \({\mathcal F}_{15}\) of Example 5.3, we obtain:
For the family \({\mathcal F}_{19}\) of Example 5.4, we obtain:
For the family \({\mathcal F}_{95}\) of Example 5.5, we obtain:
As Iwasawa power series (in the variable W), these Linvariants have \(\mu \)invariant at least 1 (since a \(\log _p(1+p)\) appears upon taking the derivative with respect to k). We verify that the first three Linvariant functions indeed have \(\mu =1\) and \(\lambda =0\), thus implying they never vanish. For \({\mathcal L}_{5}({{\mathrm{ad}}}^0\!{\mathcal F}_{95})\), the \(\mu \) and \(\lambda \)invariants are both 1 and a computation shows that the Linvariant vanishes at a weight congruent to \(4\cdot 5 + 5^4\text { }({\text {mod}}\text { }5^5)\); of course, this weight is not expected to be classical.
5.4 Twovariable padic Lfunctions
A pordinary eigenform f of classical weight \(k\ge 2\) has a padic Lfunction \(L_p(s,f)\) attached to it following the work of Manin, Amice–Vélu, and Višik. Varying the form padically in a Hida family one can expect to ‘glue’ the onevariable functions together to obtain a twovariable padic Lfunction \(L_p(s,\kappa )\) where \(\kappa \) is a weight variable around a neighborhood of k. That this is the case is due to Ohta (unpublished), Mazur–Kitagawa [24], and Greenberg–Stevens [18]; it was a fundamental ingredient in the latter’s proof of the Mazur–Tate–Teitelbaum conjecture. Greenberg has conjectured that the generic order of vanishing of \(L_p(s,\kappa )\) along the line \(s=\kappa /2\) is at most one (and congruent to the sign of the functional equation of \(L_p(s,f)\) modulo 2) (see [27, p. 439] for this statement and for some important consequences of its proof). Additionally, Greenberg and Stevens end the introduction to [18] by asking about the linear factors of the leading term in the expansion of \(L_p(s,\kappa )\) about \(s=1\) and \(\kappa =2\) when the sign is \(1\).
In this section, we include a few sample computations of twovariable padic Lfunctions through overconvergent modular symbols. To motivate these computations, we quickly review the single variable case. In [30, Theorem 8.3], Glenn Stevens gives a construction of the padic Lfunction of an eigenform f solely in terms of its corresponding overconvergent modular eigensymbol \(\Phi _f\), the unique overconvergent eigensymbol with the same system of eigenvalues as f. To form the padic Lfunction of f, one simply takes \(L_p(f):=\Phi _f(\{\infty \}\{0\})\) which is a distribution on \({\mathbb Z}_p\) and restricts this distribution to \({\mathbb Z}_p^\times \). Now if \(\Phi \in {{\mathrm{Symb}}}_{\Gamma _0}({\mathbf {D}}^0 \hat{\otimes }\Lambda )\) is a family of overconvergent eigensymbols, we analogously define the twovariable padic Lfunction \(L_p(\Phi )\) to be the restriction of \(\Phi (\{\infty \}\{0\})\) to \({\mathbb Z}_p^\times \). The result is a family of distributions on \({\mathbb Z}_p^\times \) whose specialization to any weight is the padic Lfunction of the corresponding eigenform in that weight.
In [28, Section 9], the penultimate author and Stevens explain how in practice one can compute single variable padic Lfunctions from overconvergent modular eigensymbols. The same method, which we describe now, applies in our case and allows us to compute twovariable padic Lfunctions attached to eigenfamilies of overconvergent modular symbols.
For \(\Phi \in {{\mathrm{Symb}}}_{\Gamma _0}({\mathbf {D}}^0 \hat{\otimes }\Lambda )\) an eigenfamily, set \(L_p(\Phi ) := \Phi (\{\infty \}\{0\})\big _{{\mathbb Z}_p^\times }\). Let T denote the cyclotomic variable of the padic Lfunction so that \(L_p(\Phi ) = \sum a_n T^n\), where the \(a_n\) are functions on weight space. Thus, the transformation from the T variable to the s variable is obtained by setting \(T=\gamma ^s1\). Then, as in [28, Section 9.2], we have
where the coefficients \(c_j^{(n)}\) are defined by
To approximate the coefficient \(a_n\), we must truncate the above infinite sum describing it. Since \(v_p\left( \int _{a+p{\mathbb Z}_p}(z\omega (a))^jd\mu _\Phi \right) \ge p^{jv_p(\lambda )}\), we can determine the error in approximating the twovariable padic Lfunction in this way from the following lemma.
Lemma 5.12
For \(n\ge 1\),

(1)
If \(j\le n\), then
$$\begin{aligned} v_p(c^{(n)}_j)\ge \left\lfloor \frac{n}{p1}\right\rfloor j, \end{aligned}$$ 
(2)
if \(j>n\), then
$$\begin{aligned} v_p(c^{(n)}_j)\ge \left\lfloor \frac{n}{p1}\right\rfloor n\left\lfloor \frac{j}{p}\right\rfloor . \end{aligned}$$
Proof
By grouping like powers of y in the product
we obtain the formula
Using that \(v_p(\log _p(\gamma ))=1\), we obtain
We thus proceed to find an upper bound for
Consider \(j\le n\). For a given partition \(a_0+\cdots +a_{n1}=j\), we obtain a 1 in the sum for each nonzero \(a_k\). If we combine m nonzero terms \(a_{k_1},\ldots ,a_{k_m}\) into one term, we gain \(v_p(a_{k_1}+\cdots +a_{k_m})\), but lose \(m1+v_p(a_{k_1})+\cdots +v_p(a_{k_m})\). This is never a net gain, so the optimal partition is that with j ones and \(nj\) zeroes, yielding an upper bound of j.
For \(j>n\), note that the sum of the valuations occurring above is bounded by the same sum where there is no restriction on the length of the partition. Of all partitions of j, the one maximizing the sum of the \(v_p(a_k)\) is \(j=p+p+\cdots +p+r\) with \(0\le r<p\). Indeed, it is clearly optimal for the parts of the partition to be powers of p and, for \(r>1\), the part \(p^r\) contributes r while the sum \(p^{r1}+\cdots +p^{r1}\) (p times) contributes \(p(r1)>r\). \(\square \)
With this lemma in hand, we computed several examples of twovariable padic Lfunctions. Our code produces a power series F(T, w) with T the cyclotomic variable and w the same weight variable as above. In these examples we have made the following normalizations. As throughout the whole paper, the weight variable k is normalized to correspond to modular forms of weight \(k+2\) and is obtained by substituting \(w=((1+p)^k1)/p\). The cyclotomic variable s is shifted by 1 so that \(s=0\) corresponds to the central point of the Lfunction of a weight two modular form; it is obtained by substituting \(T=(1+p)^s1\). Furthermore, the padic Lfunction we compute is only welldefined up to a unit power series in w and we normalize it so that the first nonzero coefficient in T is a power of p times a power of w.
Example 5.13
We consider the same Hida family as in Example 5.2: the unique 11adic Hida family of tame level 1 and branch \(m=0\). In order to get a nonzero padic Lfunction, we must use a symbol in the plus subspace, which no longer has dimension 1. Indeed, the presence of an Eisenstein family raises the dimension of \((X^{{{\mathrm{ord}}}})^+\) to 2. Still, localizing at the maximal ideal \({\mathfrak m}\) corresponding to \(X_0(11)\), we have that \((X^{{{\mathrm{ord}}}}_{\mathfrak m})^+\) is onedimensional since \(X_0(11)\) is not Eisenstein at 2. We obtain the following twovariable padic Lfunction (see Appendix 7.3 for the full expansion):
where \(\alpha =p + 5p^2 + 9p^3 + 9p^4 + 9p^5 + 5p^6 + 8p^7+O(p^8)\). Note that despite the nonvanishing of the central Lvalue of \(X_0(11)\), the padic Lfunction vanishes there due to the presence of an exceptional zero.
In order to numerically verify Greenberg’s conjecture, we consider the power series F(T, w) that yields \(L_{11}({\mathcal F}_{11},s,k)\) as described above. The line \(s=k/2\) corresponds to \((1+T)^2=1+pw\). Factoring
and specializing to \(w=((1+T)^2 1)/p\) yields
As this is a unit power series, this confirms Greenberg’s conjecture for \({\mathcal F}_{11}\), showing that the order of vanishing along \(s=k/2\) is exactly one throughout the Hida family.
Given the family of overconvergent modular symbols \({\mathcal F}_{11}\), computing the padic Lfunction took 30 s and 3 MB.
Example 5.14
Now, consider the elliptic curve 37a, the curve of rank 1 of least conductor, and let \(p=5\). This curve has \(a_5=2\). The space \((X^{{{\mathrm{ord}}}})^+\) has dimension 16 with one dimension coming from the (ordinary 5stabilization of the) Eisenstein series of level 37 and the remaining dimensions being new of level \(37\cdot 5\); hence the remaining dimensions have \(a_5\equiv \pm 1\text { }({\text {mod}}\text { }5)\). Thus, localizing at the maximal ideal \({\mathfrak m}\) corresponding to 37a yields a onedimensional space. We computed the following padic Lfunction (full data in Appendix 7.3):
where \(\alpha =p + 2p^2 + 4p^3 + 2p^4 + p^6 + O(p^7)\).
Again, factoring
and specializing to \(w=((1+T)^2 1)/p\) yields
Again, this confirms Greenberg’s conjecture that the order of vanishing along \(s=k/2\) is exactly one throughout the Hida family.
Given the family of overconvergent modular symbols, computing the padic Lfunction took 174 s and 66 MB.
Example 5.15
The elliptic curve 91b1 is a curve of rank 1 that has split multiplicative reduction at \(p=7\). As such, its onevariable padic Lfunction vanishes to order 2 at the central point, despite the classical Lfunction only vanishing to order 1 there. Greenberg’s conjecture states that the order of vanishing of the twovariable padic Lfunction along the line \(s=k/2\) should however be 0, generically. Our calculations verify this. First off, \((X^{{{\mathrm{ord}}}})^+\) is 9dimensional, with 2 dimensions coming from Eisenstein series, 2 from isogeny classes of elliptic curves (91a and 91b), 2 from a Galois conjugacy class of newforms defined over \({\mathbb Q}(\sqrt{2})\), and 3 from a Galois conjugacy class of newforms defined over the cubic field of discriminant 316. The curve 91b1 is not Eisenstein at 2 and has \(a_7=a_{13}=1\). The quadratic (resp. cubic) Galois conjugacy class has \(a_{13}=1\) (resp. \(a_7=1\)), so that, after localizing at the maximal ideal corresponding to the curve 91b1, we obtain a onedimensional space. The twovariable padic Lfunction we compute is
(the full data is available in Appendix 7.3).
Considering F(T, w) specialized to \(w=((1+T)^2 1)/p\), we obtain
Since this only vanishes at \(T=0\), the generic order of vanishing along \(s=k/2\) is 0, and in fact, the twovariable padic Lfunction only vanishes at \((s,k)=(0,0)\) on the line \(s=k/2\).
Given the family of overconvergent modular symbols, computing the padic Lfunction took 45 s and 24 MB.
Notes
Here, and throughout the paper, we are normalizing the weight variable k to correspond to forms in \(M_{k+2}(\Gamma )\).
To work with other discs in weight space, one needs to replace \({\mathbf {D}}\) with smaller spaces of distributions such as \({\mathbf {D}}[r]\) with \(r<1\).
The \({\mathbb Z}_p\)rank of \({{\mathrm{Symb}}}_{\Gamma _0}({{\mathrm{Sym}}}^k({\mathbb Z}_p^2))^{{{\mathrm{ord}}}}\) is simply given by the number of nonzero roots of the characteristic polynomial of \(U_p\) acting on \({{\mathrm{Symb}}}_{\Gamma _0}({{\mathrm{Sym}}}^k({\mathbb F}_p^2))\).
We note that this polynomial also arises as the characteristic polynomial of T acting on the space of pordinary modular symbols of weight k defined over \({\mathbb F}_p\) for any \(k \equiv m \pmod {p1}\) and is thus readily computed.
To verify if a symbol \(\Phi \) is actually in the ordinary subspace, one looks at \(\Phi \), \(\Phi  U_p\), \(\Phi U_p^2\), \(\ldots \), until there is a relation.
We note that this map would not be surjective if we simply looked at the span of \(z^j\) for \(j=0,\ldots ,k\). See, for instance, [8, Lemma A.4].
References
Andreatta, F., Iovita, A., Pilloni, V.: \(p\)adic families of Siegel modular cuspforms. Ann. Math. 181(2), 623–697 (2015)
Andreatta, F., Iovita, A., Stevens, G.: Overconvergent modular sheaves and modular forms for \({ GL}_{2/F}\). Israel. J. Math. 201(1), 299–359 (2014)
Ash, A., Stevens, G.: Modular forms in characteristic \(l\) and special values of their \(L\)functions. Duke. Math. J. 53(3), 849–868 (1986)
Ash, A., Stevens, G.: \(p\)adic deformations of arithmetic cohomology (2008). http://math.bu.edu/people/ghs/research.html (specifically, http://math.bu.edu/people/ghs/preprints/AshStevens0208.pdf
Bellaïche, J., Dasgupta, S.: The \(p\)adic \(L\)functions of evil Eisenstein series. Compos. Math. 151(6), 999–1040 (2015)
Bellaïche, J.: Critical \(p\)adic \(L\)functions. Invent. Math. 189(1), 1–60 (2012)
Bosch, S., Güntzer, U., Remmert R.: Nonarchimedean analysis. In: Grundlehren der Mathematischen Wissenschaften [Fundamental principles of mathematical sciences], A systematic approach to rigid analytic geometry, vol. 261, SpringerVerlag, Berlin (1984)
Bellaïche, J., Pollack, R.: On \(\mu \)invariants and congruences with Eisenstein series
BarréSirieix, K., Diaz, G., Gramain, F., Philibert, G.: Une preuve de la conjecture de Mahler–Manin. Invent. Math. 124(1–3), 1–9 (1996)
Buzzard, K.: Examples of Hida families. http://www2.imperial.ac.uk/~buzzard/maths/research/notes/. (2006). Accessed 24 Aug 2016
Coleman, R., Mazur, B.: The eigencurve. In: Scholl, A.J., Taylor, R.L. (eds.) Proceedings: Papers from the London Mathematical Society Symposium held in Durham, 9–18 July, 1996, pp. 1–113 (1998)
Coleman, R.F.: \(P\)adic Banach spaces and families of modular forms. Invent. Math. 127(3), 417–479 (1997)
Dasgupta, S.: Factorization of \(p\)adic Rankin \({L}\)series. Invent. Math. 205(1), 221–268 (2016)
Darmon, H., Pollack, R.: Efficient calculation of Stark–Heegner points via overconvergent modular symbols. Israel. J. Math. 153(1), 319–354 (2006)
Emerton, M.: On the interpolation of systems of eigenvalues attached to automorphic Hecke eigenforms. Invent. Math. 164(1), 1–84 (2006)
Ferrero, B., Greenberg, R.: On the behavior of \(p\)adic \(L\)functions at \(s\)=0. Invent. Math. 50(1), 91–102 (1978)
Greenberg, R.: Trivial zeros of \(p\)adic \(L\)functions, \(p\)adic monodromy and the Birch and SwinnertonDyer conjecture. Contemp. Math., Amer. Math. Soc. vol. 165, pp. 149–174, Boston, Providence, (1991)
Greenberg, R., Stevens, G.: \(p\)adic \(L\)functions and \(p\)adic periods of modular forms. Invent. Math. 111(2), 407–447 (1993)
Greenberg, R., Stevens, G.: On the conjecture of Mazur, Tate, and Teitelbaum, \(p\)adic monodromy and the Birch and SwinnertonDyer conjecture. Contemp. Math., Amer. Math. Soc. vol. 165, pp. 183–211, Boston, Providence, (1991)
Harron, R.W.: Linvariants of low symmetric powers of modular forms and Hida deformations (Thesis Ph.D.). ProQuest LLC, Princeton University, Ann Arbor (2009)
Hida, H.: Galois representations into \({\rm GL}_2({\bf Z}_p[[X]])\) attached to ordinary cusp forms. Invent. Math. 85(3), 545–613 (1986)
Hida, H.: Iwasawa modules attached to congruences of cusp forms. Ann. Sci. École Norm. Sup. 19(2), 231–273 (1986)
Hida, H.: Greenberg’s \(\cal L\)invariants of adjoint square Galois representations. Int. Math. Res. Not. 2004(59), 3177–3189 (2004)
Koji, K.: On standard \(p\)adic \(L\)functions of families of elliptic cusp forms, \(p\)adic monodromy and the Birch and SwinnertonDyer conjecture. Contemp. Math., Amer. Math. Soc. vol. 165, pp. 81–110, Boston, Providence, (1994)
Maurer, Y.: Zeros of \(p\)adic \({L}\)functions. Undergraduate project at Imperial College. http://wwwf.imperial.ac.uk/~buzzard/maths/research/notes/ (2000)
Mazur, B., Tate, J., Teitelbaum, J.: On \(p\)adic analogues of the conjectures of Birch and SwinnertonDyer. Invent. Math. 84(1), 1–48 (1986)
Nekovář, J., Plater, A.: On the parity of ranks of selmer groups. Asian. J. Math. 4(2), 437–497 (2000)
Pollack, R., Stevens, G.: Overconvergent modular symbols and \(p\)adic \(L\)functions. Ann. Sci. Éc. Norm. Supér. 44(1), 1–42 (2011)
Pollack, R., Stevens, G.: Critical slope \(p\)adic \(L\)functions. J. Lond. Math. Soc. 87(2), 428–452 (2013)
Stevens, G.: Rigid analytic modular symbols. http://www.math.bu.edu/people/ghs/research.html (1994)
Trifković, M.: Stark–Heegner points on elliptic curves defined over imaginary quadratic fields. Duke. Math. J. 135(3), 415–453 (2006)
The Sage Developers. SageMath, the Sage Mathematics Software System (Version 5.11). http://www.sagemath.org (2013)
Urban, E.: Eigenvarieties for reductive groups. Ann. Math. 174(3), 1685–1784 (2011)
Open Access
This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Acknowledgements
We would like to thank the Southwest Center for Arithmetic Geometry for organizing the 2011 Arizona Winter School where the work on this article began as a student project. We would also like to thank the participants of Sage Days 44 for their work in porting the original Sage scripts into a full blown Sage package. We would like to thank Sage, as well as the SageMath Cloud, where we developed the algorithms and computed the examples in this article. We would like to thank Glenn Stevens for his support of this project and Frank Calegari for some very helpful conversations. Finally, our thanks go to the referee for some comments and suggestions that improved the clarity of this article.
Robert Harron was supported by NSA Grant #H982301310223 during part of this project. Robert Pollack was supported by NSF Grant DMS1303302.
Appendices
Appendix: Comparing \(X^{{{\mathrm{ord}}}}\) with Greenberg–Stevens modular symbols
We note that Hida theory implies that ordinary padic families of cuspidal eigenforms extend to all of weight space. We would thus hope to see that the Heckeeigenvalues occurring in the ordinary subspace of either \({{\mathrm{Symb}}}_{\Gamma _0}({\mathbf {D}}^0\hat{\otimes }R^0)\) or \({{\mathrm{Symb}}}_{\Gamma _0}({\mathbf {D}}^0 \hat{\otimes }\Lambda )\) extend to bounded functions on the entire open disc of radius 1. In particular, in this appendix, we will establish this fact by comparing \({{\mathrm{Symb}}}_{\Gamma _0}({\mathbf {D}}^0 \hat{\otimes }\Lambda )^{{{\mathrm{ord}}}}\) to the space of Greenberg–Stevens twovariable modular symbols.
1.1 Relevant measure spaces
Let \({\mathcal M}({\mathbb Z}_p^\times \times {\mathbb Z}_p)\) denote the space of \({\mathbb Z}_p\)valued measures on \({\mathbb Z}_p^\times \times {\mathbb Z}_p\); that is, the continuous dual of the space of continuous functions on \({\mathbb Z}_p^\times \times {\mathbb Z}_p\). We endow this space with a right action of \({{\mathrm{SL}}}_2({\mathbb Z})\) by
and with the structure of a \(\widetilde{\Lambda }:= {\mathbb Z}_p[[{\mathbb Z}_p^\times ]]\)module by:
where \(a \in {\mathbb Z}_p^\times \) and [a] is the natural image of a in \(\widetilde{\Lambda }\).
For \(\kappa \in {\mathcal W}\), there is a “specialization to weight \(\kappa \)” map:
where
for \(f \in {\mathbf A}_\kappa \).
Proposition 6.1
For \(\kappa \in {\mathcal W}\), the specialization to weight \(\kappa \) map
is \(\Sigma _0(p)\)equivariant. Further, this map is \(\widetilde{\Lambda }\)linear if \(\widetilde{\Lambda }\) acts on \({\mathbf {D}}^0_\kappa \) by \([a] \cdot \mu = \kappa (a) \mu \).
Proof
For \(\gamma = \bigl ( {\begin{matrix} a &{} b \\ c &{} d \end{matrix}} \bigr )\) in \(\Sigma _0(p)\) and \(f \in {\mathbf A}\), we have
while
as desired.
Also, for \(a \in {\mathbb Z}_p^\times \), we have
\(\square \)
Note that \({\mathbf {D}}^0 \hat{\otimes }\Lambda \) and \({\mathbf {D}}(\Lambda ) := {{\mathrm{Hom}}}_{{{\mathrm{cont}}}}({\mathbf A}^0,\Lambda )\) are both naturally \(\widetilde{\Lambda }:= {\mathbb Z}_p[[{\mathbb Z}_p^\times ]]\)modules. Indeed, \(\widetilde{\Lambda }\) is naturally identified with measures on \({\mathbb Z}_p^\times \), and the Amice transform identifies measures on \({\mathbb Z}_p^\times \) with bounded (rigid) functions on \({\mathcal W}\) which, by restriction to \(W_m\), naturally give elements of \(\Lambda \). Then \(\widetilde{\Lambda }\) acts on \({\mathbf {D}}(\Lambda )\) or on \({\mathbf {D}}^0 \hat{\otimes }\Lambda \) simply by scaling the values of the distribution.
1.2 Comparing modular symbols
We seek to compare \({\mathcal M}({\mathbb Z}_p^\times \times {\mathbb Z}_p)\)valued modular symbols with \({\mathbf {D}}^0 \hat{\otimes }\Lambda \)valued modular symbols. We begin with a map.
Proposition 6.2
There is a \(\widetilde{\Lambda }\)linear \(\Sigma _0(p)\)map
given by
That is, for \(\mu \in {\mathcal D}({\mathbb Z}_p^\times \times {\mathbb Z}_p)\), the moments of the distributions \(\mu _\kappa \) vary (rigid) analytically as \(\kappa \) varies over weight space.
Proof
That the moments of the \(\mu _\kappa \) vary analytically is standard. We leave the details the reader. To see that \(\alpha \) is \(\Sigma _0(p)\)equivariant, first note that \(\alpha \) commutes with specialization to weight \(\kappa \). That is,
commutes; this follows directly from the definitions of these maps. Since specialization to weight \(\kappa \) is \(\Sigma _0(p)\)equivariant with either \({\mathcal M}({\mathbb Z}_p^\times \times {\mathbb Z}_p)\) or \({\mathbf {D}}(\Lambda )\) as a source, for \(\mu \in {\mathcal M}({\mathbb Z}_p^\times \times {\mathbb Z}_p)\), we have
while
Thus \(\alpha (\mu )  \gamma \) and \(\alpha (\mu  \gamma )\) have the same specialization to weight \(\kappa \) for all \(\kappa \in W_m\). Thus, by definition, \(\alpha (\mu  \gamma ) = \alpha (\mu )  \gamma \) in \({\mathbf {D}}(\Lambda )\) as desired.
To see \(\widetilde{\Lambda }\)linearity, we can argue the same way since specialization to weight \(\kappa \) (with either source) is \(\widetilde{\Lambda }\)linear if \({\mathbf {D}}^0_\kappa \) is acted on by \(\widetilde{\Lambda }\) with [a] acting by \(\kappa (a)\). \(\square \)
For any \(\widetilde{\Phi }\in {{\mathrm{Symb}}}_{\Gamma _0}({\mathbf {D}}(\Lambda ))^{{{\mathrm{ord}}}}\), we have that \(\widetilde{\Phi }\) is in the image of \(U_p\), and thus as in Lemma 2.10, \(\widetilde{\Phi }\) takes values in \({\mathbf {D}}^0 \hat{\otimes }\Lambda \). Thus,
and we have a Heckeequivariant map
We aim to show that this map is an isomorphism, and thus the characteristic polynomial of a Hecke operator acting on the target is the same as the restriction to \(\Lambda \) of the characteristic polynomial of that Hecke operator on the source. From this, we can deduce that the coefficients of the characteristic polynomials of Hecke operators on the target (which a priori are in \(\Lambda \)) extend to an open disc of radius 1 in weight space.
1.3 Control theorems
Fix a nonnegative integer \(k \equiv m \text { }({\text {mod}}\text { }p1)\) and consider the map \(\widetilde{\Lambda }\rightarrow {\mathbb Z}_p\) given by evaluation at weight k. Let \({\mathfrak p}_k \subseteq \widetilde{\Lambda }\) denote the kernel of this map; it is a principal ideal. We now state several control theorems for spaces of Greenberg–Stevens modular symbols and for our spaces of families of modular symbols. We note that in the below theorems, p is not inverted.
Let \(\widetilde{{\mathcal P}}_k\) denote the \({\mathbb Z}_p\)span of \(\genfrac(){0.0pt}{}{z}{j}\), for \(j=0, \ldots , k\), in the space of \({\mathbb Z}_p\)valued continuous functions on \({\mathbb Z}_p\), and set \(\widetilde{{\mathcal P}}_k^\vee \) equal to the \({\mathbb Z}_p\)dual of \(\widetilde{{\mathcal P}}_k\). We note that \(\widetilde{{\mathcal P}}_k^\vee \) is isomorphic to a lattice in \({{\mathrm{Sym}}}^k({\mathbb Q}_p^2)\). We have a surjective \(\Sigma _0(p)\)equivariant map^{Footnote 6}
and thus a Heckeequivariant map
Lemma 6.3
The above map induces a Heckeequivariant isomorphism
on ordinary subspaces.
Proof
This isomorphism is implicitly given in [18] if we allow p to be inverted. The above integral version is given in [8, Corollary A.9]. \(\square \)
Now, set \({\mathcal P}_k\) equal to the span of 1, z, \(\ldots \), \(z^k\) in \({\mathbf A}^{\!0}_k\); we then have a surjective map \({\mathbf {D}}^0_k \longrightarrow {\mathcal P}_k^\vee \). We now state a control theorem for families of modular symbols.
Lemma 6.4
Specialization to weight k and the above map induce Heckeequivariant isomorphisms:
Proof
The first isomorphism is given in [6, Corollary 3.12] if p is inverted. Mimicking the argument there, we simply need to check that \(H_0(\Gamma _0,{\mathbf {D}}^0 \hat{\otimes }\Lambda )^{{{\mathrm{ord}}}}\) vanishes. Since
it suffices to see that \(({\mathbf {D}}^0 \hat{\otimes }\Lambda )^{{{\mathrm{ord}}}}\) vanishes. (Here, we let \(U_p\) act on \({\mathbf {D}}^0 \hat{\otimes }\Lambda \) by acting by \(\sum _{a=0}^{p1} \left( {\begin{matrix} 1 &{} a \\ 0 &{} p \end{matrix}} \right) \).)
To this end, take \(\widetilde{\mu }\in ({\mathbf {D}}^0 \hat{\otimes }\Lambda )^{{{\mathrm{ord}}}}\), and write \(\widetilde{\mu }= \widetilde{\nu }\big  U_p^n\) for some n. Then
by Lemma 6.5 below. Since this congruence holds for all n, we get our desired result.
For the second isomorphism, we note that this is proven in [29, Theorem 5.4] except that p is inverted. Mimicking the arguments there, but keeping everything integral, we need to show that \(H_0(\Gamma _0,K)^{{{\mathrm{ord}}}} = 0\) where \(K \subseteq {\mathbf {D}}^0_k\) are the distributions which vanish on \(z^j\) for \(0 \le j \le k\). Arguing as above, it suffices to see that \(K^{{{\mathrm{ord}}}} = 0\). For \(\mu \in K^{{{\mathrm{ord}}}}\), write \(\mu = \nu \big  U_p^n\), and thus
as \(\nu ({\mathbf 1}) = 0\). \(\square \)
Lemma 6.5
For all \(j\ge 0\) and \(n \ge 1\), we have \(p^{n1}\) divides \( \sum _{a=0}^{p^n1} a^j\).
Proof
We proceed by induction on n with \(n=1\) being vacuous. Then, for \(n>1\), we have
By induction, \(p^{n2}\) divides \(\sum _{a=0}^{p^{n1}1} a^j\), and thus \(p^{n1}\) divides \(\sum _{a=0}^{p^{n}1} a^j\) as desired. \(\square \)
Lemma 6.6
Let X and Y be \(\Lambda \)modules with Y free over \(\Lambda \). Assume there is a map \(\alpha : X \rightarrow Y\) such that the induced map \(X/{\mathfrak p}_kX \rightarrow Y/{\mathfrak p}_kY\) is an isomorphism for some k. Then \(\alpha \) is an isomorphism.
Proof
Let Z be the \(\Lambda \)module defined by the exact sequence
Thus we have an exact sequence,
Since \(X / {\mathfrak p}_k X \cong Y/{\mathfrak p}_kY\), we have \(Z/{\mathfrak p}_kZ = 0\). But then \(Z=0\) and \(\alpha \) is surjective.
Now let W be the \(\Lambda \)module defined by the exact sequence
By the snake lemma, we then have an exact sequence
Since Y is free, \(Y[{\mathfrak p}_k] = 0\), and since \(X / {\mathfrak p}_k X \cong Y/{\mathfrak p}_kY\), we have \(W/{\mathfrak p}_kW=0\). Thus, \(W=0\) and \(\alpha \) is an isomorphism. \(\square \)
Lemma 6.7
We have \({{\mathrm{Symb}}}_{\Gamma _0}({\mathbf {D}}^0 \hat{\otimes }\Lambda )^{{{\mathrm{ord}}}}\) is free over \(\Lambda \) with finite rank.
Proof
We use the fact that if Y is any \(\Lambda \)module such that \(Y/{\mathfrak p}_kY\) is a free \({\mathbb Z}_p\)module of finite rank, then Y is free over \(\Lambda \) with finite rank. Then note that by Lemma 6.4,
which is indeed free over \({\mathbb Z}_p\) with finite rank (as this last space is a classical space of modular symbols). \(\square \)
Theorem 6.8
The map \(\alpha \) induces a Heckeequivariant isomorphism
Proof
If we choose \(k \equiv m \text { }({\text {mod}}\text { }p1)\) such that \(0 \le k \le p2\), then it is easy to see that \(\widetilde{{\mathcal P}}_k = {\mathcal P}_k\). This theorem then follows from Lemmas 6.6, 6.3 and 6.4. \(\square \)
Corollary 6.9
The characteristic polynomial of any Hecke operator acting on \({{\mathrm{Symb}}}_{\Gamma _0}({\mathbf {D}}^0 \hat{\otimes }\Lambda )^{{{\mathrm{ord}}}}\) has coefficients which converge on all of \(W_m\).
Proof
This corollary follows immediately from Theorem 6.8 as characteristic polynomials of Hecke operators on \({{\mathrm{Symb}}}_{\Gamma _0}({\mathcal M}({\mathbb Z}_p^\times \times {\mathbb Z}_p))^{{{\mathrm{ord}}}}\) have this property. \(\square \)
Appendix: Some data
1.1 qexpansions
Full data for Example 5.2: \(p=11\), tame level \(N=1\), branch \(m=0\).
Full data for Example 5.3: \(p=5\), tame level \(N=3\), branch \(m=0\).
Full data for Example 5.4: \(p=5\), tame level \(N=19\), branch \(m=0\).
Full data for Example 5.5: \(p=5\), tame level \(N=19\), branch \(m=0\).
Full data for Example 5.6: \(p=11\), tame level \(N=31\), branch \(m=0\).
1.2 Linvariants
Full data for Example 5.11:
1.3 Twovariable padic Lfunctions
Full data for Example 5.13:
Full data for Example 5.14:
Full data for Example 5.15:
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0), which permits use, duplication, adaptation, distribution, and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Cite this article
Dummit, E.P., Hablicsek, M., Harron, R. et al. Explicit computations of Hida families via overconvergent modular symbols. Res. Number Theory 2, 25 (2016). https://doi.org/10.1007/s4099301600528
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s4099301600528