Explicit bounds on the coefficients of modular polynomials for the elliptic $j$-invariant
By Florian Breuer and Fabien Pazuki
Abstract
We obtain an explicit upper bound on the size of the coefficients of the elliptic modular polynomials $\Phi _N$ for any $N\geq 1$. These polynomials vanish at pairs of $j$-invariants of elliptic curves linked by cyclic isogenies of degree $N$. The main term in the bound is asymptotically optimal as $N$ tends to infinity.
1. Introduction
For any nonzero polynomial $P$ in one or more variables and complex coefficients, we define its height to be
$$\begin{equation*} h(P) \coloneq \log \max |c|, \quad \text{where $c$ ranges over all coefficients of $P$.} \end{equation*}$$
Let $N$ be a positive integer and denote by $\Phi _N = \Phi _N(X,Y) \in \mathbb{Z}[X,Y]$ the (classical) modular polynomial, which vanishes at pairs of $j$-invariants of elliptic curves linked by a cyclic $N$-isogeny; see Reference La87, Chapter 5. Alternatively, if we view $j$ as the function on the complex upper half-plane where $j(\tau )$ is the $j$-invariant of the complex elliptic curve $\mathbb{C}/(\mathbb{Z}+\tau \mathbb{Z})$, then $\Phi _N(X, j(\tau ))$ is the minimal polynomial of $j(N\tau )$ over $\mathbb{C}(j(\tau ))$.
Modular polynomials have important applications in cryptography and certain algorithms for computing $\Phi _N$ require explicit bounds on the size of the coefficients, so one is interested in explicit bounds on $h(\Phi _N)$.
Paula Cohen Tretkoff Reference Coh84 proved that when $N$ tends to $+\infty$
Inequality Equation 2 has the merit of being completely explicit for all $N\geq 1$, but the main term is slightly too big when compared with the asymptotic of Equation 1.
The goal of the present paper is to prove the following result, where we solve this issue and provide an upper bound with the correct main term for all $N$. Let us first define
$$\begin{equation*} \lambda _N \coloneq \sum _{p^n\|N}\frac{p^n-1}{p^{n-1}(p^2-1)}\log p. \end{equation*}$$
We prove this theorem using a different path than the one followed in Reference Paz19. The main new ingredient is a finer estimate of the Mahler measure of $j$-invariants, coming from previous work of Pascal Autissier Reference Aut03. We also use precise analytic estimates for the discriminant modular form on the fundamental domain of the upper half-plane (under the classical action of $\operatorname {SL}_2(\mathbb{Z})$), and a classical interpolation method to help us derive bounds on the height of a polynomial in two variables, from knowledge of the height of several specializations of this polynomial.
Let us now discuss the optimality of the bound. The main term is the expected one. For lower order terms, notice that
so one changes little replacing $\lambda _N$ by $\kappa _N$ in Theorem 1.1. On the other hand, one would like to get rid of the spurious $\log \log N$ term, but for practical purposes this might be less useful than keeping the constant as small as possible.
It is interesting to consider the functions $b_\lambda (N)$ and $b_\kappa (N)$ for which
These functions are plotted in Figure 1 for $N\leq 400$, based on computations of $\Phi _N$ by Andrew Sutherland Reference Suth using the algorithms in Reference BKS12 (for prime $N$) and Reference BOS16 (for composite $N$).
The content of CohenтАЩs theorem is that $b_\kappa (N)$ and thus also $b_\lambda (N)$ are bounded functions. Theorem 1.1 is equivalent to $b_\lambda (N) \leq \log \log N + 4.436$, which is clearly seen to hold for $N\leq 400$; in fact, $b_\lambda (N) < 2.1$ in this range.
In our proof of Theorem 1.1 we may thus assume that $N > 400$. We explain in Remark 3.2 and in Lemma 3.3 that more computations for $N > 400$ lead to minor improvements on the constant $4.436$.
From Figure 1 it appears that $b_\lambda (N)$ is bounded more tightly than $b_\kappa (N)$, thus suggesting that $\lambda _N$ is a more natural function to use in the bound for $h(\Phi _N)$ than is $\kappa _N$.
Every $\tau \in \mathbb{H}$ defines a lattice $\Lambda _\tau = \mathbb{Z}+ \tau \mathbb{Z}$ in $\mathbb{C}$, and it is well known that every complex elliptic curve is isomorphic to $\mathbb{C}/\Lambda _\tau$ for some $\tau \in \mathbb{H}$. If we denote the $j$-invariant of this elliptic curve by $j(\tau )$, then
We point out that the discriminant of the elliptic curve $E_\tau$ is given by $(2\pi )^{12}\Delta (\tau )$, which is why most sources (e.g., Reference La87) normalize $\Delta$ differently, multiplying the above product by the factor $(2\pi )^{12}$. We choose our normalization to be consistent with Reference Paz19, which contains estimates that we will use.
Let us denote, for $N\geq 1$,
$$\begin{equation*} C_N=\left\{\begin{pmatrix} a & b \\ 0 & d \end{pmatrix} : a,b,d\in \mathbb{Z}, ad=N, a\geq 1, 0\leq b\leq d-1, \gcd (a,b,d)=1 \right\}. \end{equation*}$$
Our goal is to bound the coefficients of the modular polynomial $\Phi _N(X,Y)$. By interpolation, it is enough to estimate the height of $\Phi _N(X, j(\tau ))$ for several carefully chosen $\tau \in \mathbb{H}$.
By Reference BrZu20, Lemma 1.6 the height of $\Phi _N(X,j(\tau ))$ is bounded in terms of its Mahler measure