Modular form theory is a special case of the more general theory of automorphic forms, which are functions defined on Lie groups that transform nicely with respect to the action of certain discrete subgroups, generalizing the example of the modular group . Every modular form is attached to a Galois representation.[1]
The term "modular form", as a systematic description, is usually attributed to Erich Hecke.
Automorphy condition: For any there is the equality[note 1]
Growth condition: For any the function is bounded for
where and the function is identified with the matrix The identification of such functions with such matrices causes composition of such functions to correspond to matrix multiplication. In addition, it is called a cusp form if it satisfies the following growth condition:
Cuspidal condition: For any the function as
As sections of a line bundle
Modular forms can also be interpreted as sections of a specific line bundle on modular varieties. For a modular form of level and weight can be defined as an element of
where is a canonical line bundle on the modular curve
A modular function is a function that is invariant with respect to the modular group, but without the condition that f (z) be holomorphic in the upper half-plane (among other requirements). Instead, modular functions are meromorphic: they are holomorphic on the complement of a set of isolated points, which are poles of the function.
For odd k, only the zero function can satisfy the second condition.
The third condition is also phrased by saying that f is "holomorphic at the cusp", a terminology that is explained below. Explicitly, the condition means that there exist some such that , meaning is bounded above some horizontal line.
The second condition for
reads
respectively. Since S and Tgenerate the modular group SL(2, Z), the second condition above is equivalent to these two equations.
Definition in terms of lattices or elliptic curves
A modular form can equivalently be defined as a function F from the set of lattices in C to the set of complex numbers which satisfies certain conditions:
If we consider the lattice Λ = Zα + Zz generated by a constant α and a variable z, then F(Λ) is an analytic function of z.
If α is a non-zero complex number and αΛ is the lattice obtained by multiplying each element of Λ by α, then F(αΛ) = α−kF(Λ) where k is a constant (typically a positive integer) called the weight of the form.
The absolute value of F(Λ) remains bounded above as long as the absolute value of the smallest non-zero element in Λ is bounded away from 0.
The key idea in proving the equivalence of the two definitions is that such a function F is determined, because of the second condition, by its values on lattices of the form Z + Zτ, where τ ∈ H.
Examples
I. Eisenstein series
The simplest examples from this point of view are the Eisenstein series. For each even integer k > 2, we define Gk(Λ) to be the sum of λ−k over all non-zero vectors λ of Λ:
Then Gk is a modular form of weight k. For Λ = Z + Zτ we have
and
The condition k > 2 is needed for convergence; for odd k there is cancellation between λ−k and (−λ)−k, so that such series are identically zero.
II. Theta functions of even unimodular lattices
An even unimodular latticeL in Rn is a lattice generated by n vectors forming the columns of a matrix of determinant 1 and satisfying the condition that the square of the length of each vector in L is an even integer. The so-called theta function
converges when Im(z) > 0, and as a consequence of the Poisson summation formula can be shown to be a modular form of weight n/2. It is not so easy to construct even unimodular lattices, but here is one way: Let n be an integer divisible by 8 and consider all vectors v in Rn such that 2v has integer coordinates, either all even or all odd, and such that the sum of the coordinates of v is an even integer. We call this lattice Ln. When n = 8, this is the lattice generated by the roots in the root system called E8. Because there is only one modular form of weight 8 up to scalar multiplication,
where q is the square of the nome. Then the modular discriminantΔ(z) = (2π)12η(z)24 is a modular form of weight 12. The presence of 24 is related to the fact that the Leech lattice has 24 dimensions. A celebrated conjecture of Ramanujan asserted that when Δ(z) is expanded as a power series in q, the coefficient of qp for any prime p has absolute value ≤ 2p11/2. This was confirmed by the work of Eichler, Shimura, Kuga, Ihara, and Pierre Deligne as a result of Deligne's proof of the Weil conjectures, which were shown to imply Ramanujan's conjecture.
The second and third examples give some hint of the connection between modular forms and classical questions in number theory, such as representation of integers by quadratic forms and the partition function. The crucial conceptual link between modular forms and number theory is furnished by the theory of Hecke operators, which also gives the link between the theory of modular forms and representation theory.
Modular functions
When the weight k is zero, it can be shown using Liouville's theorem that the only modular forms are constant functions. However, relaxing the requirement that f be holomorphic leads to the notion of modular functions. A function f : H → C is called modular if it satisfies the following properties:
The second condition implies that f is periodic, and therefore has a Fourier series. The third condition is that this series is of the form
It is often written in terms of (the square of the nome), as:
This is also referred to as the q-expansion of f (q-expansion principle). The coefficients are known as the Fourier coefficients of f, and the number m is called the order of the pole of f at i∞. This condition is called "meromorphic at the cusp", meaning that only finitely many negative-n coefficients are non-zero, so the q-expansion is bounded below, guaranteeing that it is meromorphic at q = 0. [note 2]
Sometimes a weaker definition of modular functions is used – under the alternative definition, it is sufficient that f be meromorphic in the open upper half-plane and that f be invariant with respect to a sub-group of the modular group of finite index.[4] This is not adhered to in this article.
Another way to phrase the definition of modular functions is to use elliptic curves: every lattice Λ determines an elliptic curveC/Λ over C; two lattices determine isomorphic elliptic curves if and only if one is obtained from the other by multiplying by some non-zero complex number α. Thus, a modular function can also be regarded as a meromorphic function on the set of isomorphism classes of elliptic curves. For example, the j-invariantj(z) of an elliptic curve, regarded as a function on the set of all elliptic curves, is a modular function. More conceptually, modular functions can be thought of as functions on the moduli space of isomorphism classes of complex elliptic curves.
A modular form f that vanishes at q = 0 (equivalently, a0 = 0, also paraphrased as z = i∞) is called a cusp form (Spitzenform in German). The smallest n such that an ≠ 0 is the order of the zero of f at i∞.
A modular unit is a modular function whose poles and zeroes are confined to the cusps.[5]
Modular forms for more general groups
The functional equation, i.e., the behavior of f with respect to can be relaxed by requiring it only for matrices in smaller groups.
The Riemann surface G\H∗
Let G be a subgroup of SL(2, Z) that is of finite index. Such a group Gacts on H in the same way as SL(2, Z). The quotient topological spaceG\H can be shown to be a Hausdorff space. Typically it is not compact, but can be compactified by adding a finite number of points called cusps. These are points at the boundary of H, i.e. in Q∪{∞},[note 3] such that there is a parabolic element of G (a matrix with trace ±2) fixing the point. This yields a compact topological space G\H∗. What is more, it can be endowed with the structure of a Riemann surface, which allows one to speak of holo- and meromorphic functions.
Important examples are, for any positive integer N, either one of the congruence subgroups
For G = Γ0(N) or Γ(N), the spaces G\H and G\H∗ are denoted Y0(N) and X0(N) and Y(N), X(N), respectively.
The geometry of G\H∗ can be understood by studying fundamental domains for G, i.e. subsets D ⊂ H such that D intersects each orbit of the G-action on H exactly once and such that the closure of D meets all orbits. For example, the genus of G\H∗ can be computed.[6]
Definition
A modular form for G of weight k is a function on H satisfying the above functional equation for all matrices in G, that is holomorphic on H and at all cusps of G. Again, modular forms that vanish at all cusps are called cusp forms for G. The C-vector spaces of modular and cusp forms of weight k are denoted Mk(G) and Sk(G), respectively. Similarly, a meromorphic function on G\H∗ is called a modular function for G. In case G = Γ0(N), they are also referred to as modular/cusp forms and functions of levelN. For G = Γ(1) = SL(2, Z), this gives back the afore-mentioned definitions.
Consequences
The theory of Riemann surfaces can be applied to G\H∗ to obtain further information about modular forms and functions. For example, the spaces Mk(G) and Sk(G) are finite-dimensional, and their dimensions can be computed thanks to the Riemann–Roch theorem in terms of the geometry of the G-action on H.[7] For example,
The modular functions constitute the field of functions of the Riemann surface, and hence form a field of transcendence degree one (over C). If a modular function f is not identically 0, then it can be shown that the number of zeroes of f is equal to the number of poles of f in the closure of the fundamental regionRΓ.It can be shown that the field of modular function of level N (N ≥ 1) is generated by the functions j(z) and j(Nz).[8]
Line bundles
The situation can be profitably compared to that which arises in the search for functions on the projective space P(V): in that setting, one would ideally like functions F on the vector space V which are polynomial in the coordinates of v ≠ 0 in V and satisfy the equation F(cv) = F(v) for all non-zero c. Unfortunately, the only such functions are constants. If we allow denominators (rational functions instead of polynomials), we can let F be the ratio of two homogeneous polynomials of the same degree. Alternatively, we can stick with polynomials and loosen the dependence on c, letting F(cv) = ckF(v). The solutions are then the homogeneous polynomials of degree k. On the one hand, these form a finite dimensional vector space for each k, and on the other, if we let k vary, we can find the numerators and denominators for constructing all the rational functions which are really functions on the underlying projective space P(V).
One might ask, since the homogeneous polynomials are not really functions on P(V), what are they, geometrically speaking? The algebro-geometric answer is that they are sections of a sheaf (one could also say a line bundle in this case). The situation with modular forms is precisely analogous.
Modular forms can also be profitably approached from this geometric direction, as sections of line bundles on the moduli space of elliptic curves.
For a subgroup Γ of the SL(2, Z), the ring of modular forms is the graded ring generated by the modular forms of Γ. In other words, if Mk(Γ) is the vector space of modular forms of weight k, then the ring of modular forms of Γ is the graded ring .
Rings of modular forms of congruence subgroups of SL(2, Z) are finitely generated due to a result of Pierre Deligne and Michael Rapoport. Such rings of modular forms are generated in weight at most 6 and the relations are generated in weight at most 12 when the congruence subgroup has nonzero odd weight modular forms, and the corresponding bounds are 5 and 10 when there are no nonzero odd weight modular forms.
More generally, there are formulas for bounds on the weights of generators of the ring of modular forms and its relations for arbitrary Fuchsian groups.
New forms are a subspace of modular forms[9] of a fixed level which cannot be constructed from modular forms of lower levels dividing . The other forms are called old forms. These old forms can be constructed using the following observations: if then giving a reverse inclusion of modular forms .
A cusp form is a modular form with a zero constant coefficient in its Fourier series. It is called a cusp form because the form vanishes at all cusps.
Generalizations
There are a number of other usages of the term "modular function", apart from this classical one; for example, in the theory of Haar measures, it is a function Δ(g) determined by the conjugation action.
Hilbert modular forms are functions in n variables, each a complex number in the upper half-plane, satisfying a modular relation for 2×2 matrices with entries in a totally real number field.
Siegel modular forms are associated to larger symplectic groups in the same way in which classical modular forms are associated to SL(2, R); in other words, they are related to abelian varieties in the same sense that classical modular forms (which are sometimes called elliptic modular forms to emphasize the point) are related to elliptic curves.
Jacobi forms are a mixture of modular forms and elliptic functions. Examples of such functions are very classical - the Jacobi theta functions and the Fourier coefficients of Siegel modular forms of genus two - but it is a relatively recent observation that the Jacobi forms have an arithmetic theory very analogous to the usual theory of modular forms.
Modular integrals of weight k are meromorphic functions on the upper half plane of moderate growth at infinity which fail to be modular of weight k by a rational function.
Automorphic factors are functions of the form which are used to generalise the modularity relation defining modular forms, so that
The function is called the nebentypus of the modular form. Functions such as the Dedekind eta function, a modular form of weight 1/2, may be encompassed by the theory by allowing automorphic factors.
In the 1960s, as the needs of number theory and the formulation of the modularity theorem in particular made it clear that modular forms are deeply implicated.
Taniyama and Shimura identified a 1-to-1 matching between certain modular forms and elliptic curves. Robert Langlands built on this idea in the construction of his expansive Langlands program, which has become one of the most far-reaching and consequential research programs in math.
In 1994 Andrew Wiles used modular forms to prove Fermat’s Last Theorem. In 2001 all elliptic curves were proven to be modular over the rational numbers. In 2013 elliptic curves were proven to be modular over real quadratic fields. In 2023 elliptic curves were proven to be modular over about half of imaginary quadratic fields, including fields formed by combining the rational numbers with the square root of integers down to −5.[1]
^A meromorphic function can only have a finite number of negative-exponent terms in its Laurent series, its q-expansion. It can only have at most a pole at q = 0, not an essential singularity as exp(1/q) has.
^Gunning, Robert C. (1962), Lectures on modular forms, Annals of Mathematics Studies, vol. 48, Princeton University Press, p. 13
^Shimura, Goro (1971), Introduction to the arithmetic theory of automorphic functions, Publications of the Mathematical Society of Japan, vol. 11, Tokyo: Iwanami Shoten, Theorem 2.33, Proposition 2.26
Diamond, Fred; Shurman, Jerry Michael (2005), A First Course in Modular Forms, Graduate Texts in Mathematics, vol. 228, New York: Springer-Verlag, ISBN978-0387232294Leads up to an overview of the proof of the modularity theorem.
Gelbart, Stephen S. (1975), Automorphic Forms on Adèle Groups, Annals of Mathematics Studies, vol. 83, Princeton, N.J.: Princeton University Press, MR0379375. Provides an introduction to modular forms from the point of view of representation theory.
Serre, Jean-Pierre (1973), A Course in Arithmetic, Graduate Texts in Mathematics, vol. 7, New York: Springer-Verlag. Chapter VII provides an elementary introduction to the theory of modular forms.