Asymptotics of Plancherel measures for symmetric groups
By Alexei Borodin, Andrei Okounkov, and Grigori Olshanski
Abstract
We consider the asymptotics of the Plancherel measures on partitions of $n$ as $n$ goes to infinity. We prove that the local structure of a Plancherel typical partition in the middle of the limit shape converges to a determinantal point process with the discrete sine kernel.
On the edges of the limit shape, we prove that the joint distribution of suitably scaled 1st, 2nd, and so on rows of a Plancherel typical diagram converges to the corresponding distribution for eigenvalues of random Hermitian matrices (given by the Airy kernel). This proves a conjecture due to Baik, Deift, and Johansson by methods different from the Riemann-Hilbert techniques used in their original papers and from the combinatorial proof given by the second author.
Our approach is based on an exact determinantal formula for the correlation functions of the poissonized Plancherel measures in terms of a new kernel involving Bessel functions. Our asymptotic analysis relies on the classical asymptotic formulas for the Bessel functions and depoissonization techniques.
1. Introduction
1.1. Plancherel measures
Given a finite group $G$, by the corresponding Plancherel measure we mean the probability measure on the set $G^\wedge$ of irreducible representations of $G$ which assigns to a representation $\pi \in G^\wedge$ the weight $(\dim \pi )^2/|G|$. For the symmetric group $S(n)$, the set $S(n)^\wedge$ is the set of partitions $\lambda$ of the number $n$, which we shall identify with Young diagrams with $n$ squares throughout this paper. The Plancherel measure on partitions $\lambda$ arises naturally in representation–theoretic, combinatorial, and probabilistic problems. For example, the Plancherel distribution of the first part of a partition coincides with the distribution of the longest increasing subsequence of a uniformly distributed random permutation Reference 31.
We denote the Plancherel measure on partitions of $n$ by $M_n$,
where $\dim \lambda$ is the dimension of the corresponding representation of $S(n)$. The asymptotic properties of these measures as $n\to \infty$ have been studied very intensively; see the References and below.
In the seventies, Logan and Shepp Reference 23 and, independently, Vershik and Kerov Reference 40Reference 42 discovered the following measure concentration phenomenon for $M_n$ as $n\to \infty$. Let $\lambda$ be a partition of $n$ and let $i$ and $j$ be the usual coordinates on the diagrams, namely, the row number and the column number. Introduce new coordinates $u$ and $v$ by
that is, we flip the diagram, rotate it $135^\circ$ as in Figure 1, and scale it by the factor of $n^{-1/2}$ in both directions.
After this scaling, the Plancherel measures $M_n$ converge as $n\to \infty$ (see Reference 23Reference 40Reference 42 for precise statements) to the delta measure supported on the following shape:
The function $\Omega$ is plotted in Figure 1. As explained in detail in Reference 22, this limit shape $\Omega$ is closely connected to Wigner’s semicircle law for distribution of eigenvalues of random matrices; see also Reference 19Reference 20Reference 21.
From a different point of view, the connection with random matrices was observed in Reference 3Reference 4, and also in earlier papers Reference 16Reference 28Reference 29. In Reference 3, Baik, Deift, and Johansson made the following conjecture. They conjectured that in the $n\to \infty$ limit and after proper scaling the joint distribution of $\lambda _i$,$i=1,2,\dots$, becomes identical to the joint distribution of properly scaled largest eigenvalues of a Gaussian random Hermitian matrix (which form the so-called Airy ensemble; see Section 1.4). They proved this for the individual distribution of $\lambda _1$ and $\lambda _2$ in Reference 3 and Reference 4, respectively. A combinatorial proof of the full conjecture was given by one of us in Reference 25. It was based on an interplay between maps on surfaces and ramified coverings of the sphere.
In this paper we study the local structure of a typical Plancherel diagram both in the bulk of the limit shape $\Omega$ and on its edge, where by the study of the edge we mean the study of the behavior of $\lambda _1$,$\lambda _2$, and so on.
We employ an analytic approach based on an exact formula in terms of Bessel functions for the correlation functions of the so-called poissonization of the Plancherel measures $M_n$ (see Theorem 1 in the following subsection), and the so-called depoissonization techniques (see Section 1.4).
The exact formula in Theorem 1 is a limit case of a formula from Reference 8; see also the recent papers Reference 26Reference 27 for more general results. The use of poissonization and depoissonization is very much in the spirit of Reference 3Reference 16Reference 39 and represents a well–known statistical mechanics principle of the equivalence of canonical and grand canonical ensembles.
Our main results are the following two. In the bulk of the limit shape $\Omega$, we prove that the local structure of a Plancherel typical partition converges to a determinantal point process with the discrete sine kernel; see Theorem 3. This result is parallel to the corresponding result for random matrices. On the edge of the limit shape, we give an analytic proof of the Baik-Deift-Johansson conjecture; see Theorem 4. These results will be stated in Sections 1.3 and 1.4 of the present Introduction, respectively.
Simultaneously and independently, results equivalent to our Theorems 2 and 4 were obtained by K. Johansson Reference 17.
1.2. Poissonization and correlation functions
For $\theta >0$, consider the poissonization$M^\theta$ of the measures $M_n$:
This is a probability measure on the set of all partitions. Our first result is the computation of the correlation functions of the measures $M^\theta$.
By correlation functions we mean the following. By definition, set
where $\triangle$ stands for the symmetric difference of two sets, $d$ is the number of squares on the diagonal of $\lambda$, and $p_i$’s and $q_i$’s are the usual Frobenius coordinates of $\lambda$. Recall that $p_i$ is the number of squares in the $i$th row to the right of the diagonal, and $q_i$ is number of squares in the $i$th column below the diagonal. The equality Equation 1.2 is a well–known combinatorial fact discovered by Frobenius; see Ex. I.1.15(a) in Reference 24. Note that, in contrast to $\operatorname {Fr}(\lambda )$, the set $\mathcal{D}(\lambda )$ is infinite and, moreover, it contains all but finitely many negative integers.
The sets $\mathcal{D}(\lambda )$ and $\operatorname {Fr}(\lambda )$ have the following nice geometric interpretation. Let the diagram $\lambda$ be flipped and rotated $135^\circ$ as in Figure 1, but not scaled. Denote by $\omega _\lambda$ a piecewise linear function with $\omega '_\lambda =\pm 1$ whose graph is given by the upper boundary of $\lambda$ completed by the lines
In other words, if we consider $\omega _\lambda$ as a history of a walk on $\mathbb{Z}$, then $\mathcal{D}(\lambda )$ are those moments when a step is made in the negative direction. It is therefore natural to call $\mathcal{D}(\lambda )$ the descent set of $\lambda$. As we shall see, the correspondence $\lambda \mapsto \mathcal{D}(\lambda )$ is a very convenient way to encode the local structure of the boundary of $\lambda$.
The halves in the definition of $\operatorname {Fr}(\lambda )$ have the following interpretation: one splits the diagonal squares in half and gives half to the rows and half to the columns.
Definition 1.1.
The correlation functions of $M^\theta$ are the probabilities that the sets $\operatorname {Fr}(\lambda )$ or, similarly, $\mathcal{D}(\lambda )$ contain a fixed subset $X$. More precisely, we set
where $J_x=J_x(2\sqrt {\theta })$ is the Bessel function of order $x$ and argument $2\sqrt {\theta }$. The diagonal values $\mathsf{K}(x,x)$ are determined by the l’Hospital rule.
This theorem is established in Section 2.1; see also Remark 1.2 below. By the complementation principle (see Sections A.3 and 2.2), Theorem 1 is equivalent to the following
Theorem 2.
For any $X=\{x_1,\dots ,x_s\}\subset \mathbb{Z}$ we have
where $J_x=J_x(2\sqrt \theta )$. The diagonal values $\mathsf{J}(x,x)$ are determined by the l’Hospital rule.
Remark 1.2.
Theorem 1 is a limit case of Theorem 3.3 of Reference 8. For the reader’s convenience a direct proof of it is given in Section 2. Various limit cases of the results of Reference 8 are discussed in Reference 9. By different methods, the formula Equation 1.8 was obtained by K. Johansson Reference 17.
Observe that all Bessel functions involved in the above formulas are of integer order. Also note that the ratios like $\mathsf{J}(x,y)$ are entire functions of $x$ and $y$ because $J_x$ is an entire function of $x$. In particular, the values $\mathsf{J}(x,x)$ are well defined. Various denominator–free formulas for the kernel $\mathsf{J}$ are given in Section 2.1.
exist, finite or infinite. Here $i,j=1,\dots ,s$. Observe that if $d_{ij}$ is finite, then $d_{ij}=x_i(n)-x_j(n)$ for $n\gg 0$.
In the case when $X(n)$ can be represented as $X(n)=X'(n)\cup X''(n)$ and the distance between $X'(n)$ and $X''(n)$ goes to $\infty$ as $n\to \infty$ we shall say that the sequence splits; otherwise, we call it nonsplit. Obviously, $X(n)$ is nonsplit if and only if all $x_i(n)$ stay at a finite distance from each other.
Define the correlation functions $\boldsymbol{\varrho }(n,\,\cdot \,)$ of the measures $M_n$ by the same rule as in Equation 1.4:
We are interested in the limit of $\boldsymbol{\varrho }(n,X(n))$ as $n\to \infty$. This limit will be computed in Theorem 3 below. As we shall see, if $X(n)$ splits, then the limit correlations factor accordingly.
Introduce the following discrete sine kernel which is a translation invariant kernel on the lattice $\mathbb{Z}$,
$$\begin{equation*} \mathsf{S}(k,a)= \begin{cases} 0\,,& \text{$a\ge 2$ or $a\le -2$ and $k\ne 0$}\,,\\ 1 \,, &\text{$a\le -2$ and $k= 0$}\,. \end{cases} \end{equation*}$$
The following result describes the local structure of a Plancherel typical partition.
Theorem 3.
Let $X(n)\subset \mathbb{Z}$ be a regular sequence and let the numbers $a_i$,$d_{ij}$ be defined by Equation 1.10, Equation 1.11. If $X(n)$ splits, that is, if $X(n)=X'(n)\cup X''(n)$ and the distance between $X'(n)$ and $X''(n)$ goes to $\infty$ as $n\to \infty$, then
Notice that, in particular, Theorem 3 implies that, as $n\to \infty$, the shape of a typical partition $\lambda$ near any point of the limit curve $\Omega$ is described by a stationary random process. For distinct points on the curve $\Omega$ these random processes are independent.
Remark 1.5.
By complementation (see Sections A.3 and 3.2), one obtains from Theorem 3 an equivalent statement about the asymptotics of the following correlation functions:
The discrete sine kernel was studied before (see Reference 44Reference 45), mainly as a model case for the continuous sine kernel. In particular, the asymptotics of Toeplitz determinants built from the discrete sine kernel was obtained by H. Widom Reference 45 who was answering a question of F. Dyson. We thank S. Kerov for pointing out this reference.
Remark 1.7.
Note that, in particular, Theorem 3 implies that the limit density (the 1-point correlation function) is given by
This is in agreement with the Logan-Shepp-Vershik-Kerov result about the limit shape $\Omega$. More concretely, the function $\Omega$ is related to the density Equation 1.14 by
Set $w=\dfrac{i}{\sqrt {n}}$. Then the above relation reads $\Delta w \approx \boldsymbol{\varrho }(\infty ,u) \, \Delta u$ and it should be satisfied on the boundary $v=\Omega (u)$ of the limit shape. Since $v=u+2w$, we conclude that
1.4. Behavior near the edge of the spectrum and the Airy ensemble
The discrete sine kernel $\mathsf{S}(k,a)$ vanishes if $a\ge 2$. Therefore, it follows from Theorem 3 that the limit correlations $\lim \boldsymbol{\varrho }(n,X(n))$ vanish if $a_i\ge 2$ for some $i$. However, as will be shown below in Proposition 4.1, after a suitable scaling near the edge $a=2$, the correlation functions $\boldsymbol{\varrho }^\theta$ converge to the correlation functions given by the Airy kernel Reference 12Reference 36
In fact, the following more precise statement is true about the behavior of the Plancherel measure near the edge $a=2$. By symmetry, everything we say about the edge $a=2$ applies to the opposite edge $a=-2$.
Consider the random point process on $\mathbb{R}$ whose correlation functions are given by the determinants
be its random configuration. We call the random variables $\zeta _i$’s the Airy ensemble. It is known Reference 12Reference 36 that the Airy ensemble describes the behavior of the (properly scaled) 1st, 2nd, and so on largest eigenvalues of a Gaussian random Hermitian matrix. The distribution of individual eigenvalues was obtained by Tracy and Widom in Reference 36 in terms of certain Painlevé transcendents.
It has been conjectured by Baik, Deift, and Johansson that the random variables
converge, in distribution and together with all moments, to the Airy ensemble. They verified this conjecture for individual distribution of $\lambda _1$ and $\lambda _2$ in Reference 3 and Reference 4, respectively. In particular, in the case of $\lambda _1$, this generalizes the result of Reference 40Reference 42 that $\frac{\lambda _1}{\sqrt n} \to 2$ in probability as $n\to \infty$. The computation of $\lim \frac{\lambda _1}{\sqrt n}$ was known as the Ulam problem; different solutions to this problem were given in Reference 1Reference 16Reference 32; see also the survey Reference 2.
to the corresponding quantities for the Airy ensemble was established in Reference 25. The proof in Reference 25 was based on a combinatorial interpretation of Equation 1.16 as the asymptotics in a certain enumeration problem for random surfaces.
In the present paper we use different ideas to prove the following
Theorem 4.
As $n\to \infty$, the random variables $\widetilde{\lambda }$ converge, in joint distribution, to the Airy ensemble.
This is done in Section 4 using methods described in the next subsection. The result stated in Theorem 4 was independently obtained by K. Johansson in Reference 17. See, for example, Reference 13 for an application of Theorem 4.
1.5. Poissonization and depoissonization
We obtain Theorems 3 and 4 from Theorem 1 using the so-called depoissonization techniques. We recall that the fundamental idea of depoissonization is the following.
Given a sequence $b_1,b_2,b_3,\dots$ its poissonization is, by definition, the function
Provided the $b_k$’s grow not too rapidly this is an entire function of $\theta$. In combinatorics, it is usually called the exponential generating function of the sequence $\{b_k\}$. Various methods of extracting asymptotics of sequences from their generating functions are classically known and widely used (see for example Reference 39 where such methods are used to obtain the limit shape of a typical partition under various measures on the set of partitions).
A probabilistic way to look at the generating function Equation 1.17 is the following. If $\theta \ge 0$, then $B(\theta )$ is the expectation of $b_\eta$ where $\eta \in \{0,1,2,\dots \}$ is a Poisson random variable with parameter $\theta$. Because $\eta$ has mean $\theta$ and standard deviation $\sqrt \theta$, one expects that
provided the variations of $b_k$ for $|k-n|\le \operatorname {const}\sqrt n$ are small. One possible regularity condition on $b_n$ which implies Equation 1.18 is monotonicity. In a very general and very convenient form, a depoissonization lemma for nonincreasing nonnegative $b_n$ was established by K. Johansson in Reference 16. We use this lemma in Section 4 to prove Theorem 4.
Another approach to depoissonization is to use a contour integral
where $C$ is any contour around $z=0$. Suppose, for a moment, that $b_n$ is constant, $b=b_n=B(z)$. The function $e^z/z^n=e^{z-n\ln z}$ has a unique critical point $z=n$. If we choose $|z|=n$ as the contour $C$, then only neighborhoods of size $|z-n|\le \operatorname {const}\sqrt {n}$ contribute to the asymptotics of Equation 1.19. Therefore, for general $\{b_n\}$, we still expect that provided the overall growth of $B(z)$ is under control and the variations of $B(z)$ for $|z-n| \le \operatorname {const}\sqrt n$ are small, the asymptotically significant contribution to Equation 1.19 will come from $z=n$. That is, we still expect Equation 1.18 to be valid. See, for example, Reference 15 for a comprehensive discussion and survey of this approach.
We use this approach to prove Theorem 3 in Section 3. The growth conditions on $B(z)$ which are suitable in our situation are spelled out in Lemma 3.1.
In our case, the functions $B(\theta )$ are combinations of the Bessel functions. Their asymptotic behavior as $\theta \approx n \to \infty$ can be obtained directly from the classical results on asymptotics of Bessel functions which are discussed, for example, in the fundamental Watson’s treatise Reference 43. These asymptotic formulas for Bessel functions are derived using the integral representations of Bessel functions and the steepest descent method. The different behavior of the asymptotics in the bulk $(-2,2)$ of the spectrum, near the edges $\pm 2$ of the spectrum, and outside of $[-2,2]$ is produced by the different location of the saddle point in these three cases.
1.6. Organization of the paper
Section 2 contains the proof of Theorems 1 and 2 and also various formulas for the kernels $\mathsf{K}$ and $\mathsf{J}$. We also discuss a difference operator which commutes with $\mathsf{J}$ and its possible applications.
Section 3 deals with the behavior of the Plancherel measure in the bulk of the spectrum; there we prove Theorem 3. Theorem 4 and a similar result (Theorem 5) for the poissonized measure $M^\theta$ are established in Section 4.
At the end of the paper there is an Appendix, where we collected some necessary results about Fredholm determinants, point processes, and convergence of trace class operators.
2. Correlation functions of the measures $M^\theta$
As noted above, Theorem 1 is a limit case of Theorem 3.3 of Reference 8. That theorem concerns a family $\{M^{(n)}_{zz'}\}$ of probability measures on partitions of $n$, where $z,z'$ are certain parameters. When the parameters go to infinity, $M^{(n)}_{zz'}$ tends to the Plancherel measure $M_n$. Theorem 3.3 in Reference 8 gives a determinantal formula for the correlation functions of the measure
in terms of a certain hypergeometric kernel. Here $t=zz'>0$ and $\xi \in (0,1)$ is an additional parameter. As $z,z'\to \infty$ and $\xi =\frac{\theta }{t}\to 0$, the negative binomial distribution in Equation 2.1 tends to the Poisson distribution with parameter $\theta$. In the same limit, the hypergeometric kernel becomes the kernel $\mathsf{K}$ of Theorem 1. The Bessel functions appear as a suitable degeneration of hypergeometric functions.
Recently, these results of Reference 8 were considerably generalized in Reference 26, where it was shown how this type of correlation functions can be computed using simple commutation relations in the infinite wedge space.
For the reader’s convenience, we present here a direct and elementary proof of Theorem 1 which uses the same ideas as in Reference 8 plus an additional technical trick, namely, differentiation with respect to $\theta$ which kills denominators. This trick yields a denominator–free integral formula for the kernel $\mathsf{K}$; see Proposition 2.7. Our proof here is a verification, not a derivation. For more conceptual approaches the reader is referred to Reference 26Reference 27Reference 7.
Let $x,y\in \mathbb{Z}+{\textstyle \frac{1}{2}}$. Introduce the following kernel $\mathsf{L}$:
We shall consider the kernels $\mathsf{K}$ and $\mathsf{L}$ as operators in the $\ell ^2$ space on $\mathbb{Z}+{\textstyle \frac{1}{2}}$.
We recall that simple multiplicative formulas (for example, the hook formula) are known for the number $\dim \lambda$ in Equation 1.1. For our purposes, it is convenient to rewrite the hook formula in the following determinantal form. Let $\lambda \!=\!(p_1,\dots ,p_d\,|\,q_1,\dots ,q_d)$ be the Frobenius coordinates of $\lambda$; see Section 1.2. We have
where $\operatorname {Fr}(\lambda )=\{x_1,\dots ,x_s\}\subset \mathbb{Z}+{\textstyle \frac{1}{2}}$ are the modified Frobenius coordinates of $\lambda$ defined in Equation 1.2.
Let $\operatorname {Fr}_*\left(M^\theta \right)$ be the push-forward of $M^\theta$ under the map $\operatorname {Fr}$. Note that the image of $\operatorname {Fr}$ consists of sets $X\subset \mathbb{Z}+{\textstyle \frac{1}{2}}$ having equally many positive and negative elements. For other $X\subset \mathbb{Z}+{\textstyle \frac{1}{2}}$, the right-hand side of Equation 2.3 can be easily seen to vanish. Therefore $\operatorname {Fr}_*\left(M^\theta \right)$ is a determinantal point process (see the Appendix) corresponding to $\mathsf{L}$, that is, its configuration probabilities are determinants of the form Equation 2.3.
Corollary 2.2.
$\det (1+\mathsf{L})=e^{\theta }$.
This follows from the fact that $M^\theta$ is a probability measure. This is explained in Propositions A.1 and A.4 in the Appendix. Note that, in general, one needs to check that $\mathsf{L}$ is a trace class operator.Footnote1 However, because of the special form of $\mathsf{L}$, it suffices to check a weaker claim – that $\mathsf{L}$ is a Hilbert–Schmidt operator, which is immediate.
1
Actually, $\mathsf{L}$ is of trace class because the sum of the absolute values of its matrix elements is finite. We are grateful to P. Deift for this remark.
Theorem 1 now follows from general properties of determinantal point processes (see Proposition A.6 in the Appendix) and the following
Proposition 2.3.
$\mathsf{K}=\mathsf{L}\,(1+\mathsf{L})^{-1}$.
We shall need three identities for Bessel functions which are degenerations of the identities (3.13–15) in Reference 8 for the hypergeometric function. The first identity is due to Lommel (see Reference 43, Section 3.2, or Reference 14, 7.2.(60)):
we find that $r(\nu +1,z)=r(\nu ,z)$. Hence for any $z$ it is a periodic function of $\nu$ and it suffices to show that $\lim _{\nu \to \infty } r(\nu ,z)=0$. Clearly, the left-hand side in Equation 2.6 goes to 0 as $\nu \to \infty$. From the defining series for $J_\nu$ it is clear that
Now the verification of Equation 2.10 becomes a straightforward application of the formulas Equation 2.5 and Equation 2.6, except for the occurrence of the singularity $\nu \in \mathbb{Z}_{\le 0}$ in those formulas. This singularity is resolved using Equation 2.4. This concludes the proof of Proposition 2.3 and Theorem 1.
It is clear that since the $\varepsilon$-factors cancel out of all determinantal formulas, this lemma and Proposition A.8 establish the equivalence of Theorems 1 and 2.
Examine the right-hand side. The terms with $n=0,\dots ,k-1$ vanish because then $1/\Gamma (-k+n+1)=0$. The term with $n=k$ is equal to 1, which corresponds to 1 in the left-hand side. Next, the terms with $n=k+1,\dots ,2k$ vanish because for these values of $n$, the expression $(-2k+n)_n$ vanishes. Finally, for $n\ge 2k+1$, set $n=2k+1+m$. Then the $n$th term in the second sum is equal to minus the $m$th term in the first sum. Indeed, this follows from the trivial relation
Recall that since $J_x$ is an entire function of $x$, the function $\mathsf{J}(x,y)$ is entire in $x$ and $y$. We shall now obtain several denominator–free formulas for the kernel $\mathsf{J}$.
Our argument is similar to an argument due to Tracy and Widom; see the proof of the formula (4.6) in Reference 36. The recurrence relation Equation 2.8 implies that
Consequently, the difference between the left-hand side and the right-hand side of Equation 2.15 is a function which depends only on $x-y$. Let $x$ and $y$ go to infinity in such a way that $x-y$ remains fixed. Because of the asymptotics Equation 2.9 both sides in Equation 2.15 tend to zero and, hence, the difference actually is 0.
■
In the same way as in Reference 36 this results in the following
Corollary 2.10.
For any $a\in \mathbb{Z}$, the restriction of the kernel $\mathsf{J}$ to the subset $\{a,a+1,a+2,\dots \}\subset \mathbb{Z}$ defines a nonnegative trace class operator in the $\ell ^2$ space on that subset.
Proof.
By Proposition 2.9, the restriction of $\mathsf{J}$ on $\{a,a+1,a+2,\dots \}$ is the square of the kernel $(x,y)\mapsto J_{x+y+1-a}(2\sqrt \theta )$. Since the latter kernel is real and symmetric, the kernel $\mathsf{J}$ is nonnegative. Hence, it remains to prove that its trace is finite. Again, by Proposition 2.9, this trace is equal to
$$\begin{equation*} \sum _{s=1}^\infty s \, (J_{a+s+1}(2\sqrt \theta ))^2. \end{equation*}$$
The kernel $\mathsf{J}$ resembles a Christoffel–Darboux kernel and, in fact, the operator in $\ell ^2(\mathbb{Z})$ defined by the kernel $\mathsf{J}$ is an Hermitian projection operator. Recall that $\mathsf{K}=\mathsf{L}(1+\mathsf{L})^{-1}$, where $\mathsf{L}$ is of the form
One can prove that this together with Lemma 2.5 imply that $\mathsf{J}$ is an Hermitian projection kernel. However, in contrast to a Christoffel–Darboux kernel, it projects to an infinite–dimensional subspace.
Note that in Reference 17 the restriction of the kernel $J$ to $\mathbb{Z}_+$ was obtained as a limit of Christoffel–Darboux kernels for Charlier polynomials.
2.4. Commuting difference operator
Consider the difference operators $\Delta$ and $\nabla$ on the lattice $\mathbb{Z}$,
where $\alpha$ and $\beta$ are operators of multiplication by certain functions $\alpha (k)$,$\beta (k)$. The operator Equation 2.17 is self–adjoint in $\ell ^2(\mathbb{Z})$. A straightforward computation shows that
It follows that if $\alpha (s)=0$ for a certain $s\in \mathbb{Z}$, then the space of functions $f(k)$ vanishing for $k<s$ is invariant under $D$.
Proposition 2.12.
Let $[\mathsf{J}]_s$ denote the operator in $\ell ^2(\{s,s+1,\dots \})$ obtained by restricting the kernel $\mathsf{J}$ to $\{s,s+1,\dots \}$. Then the difference Sturm–Liouville operator Equation 2.17 commutes with $[\mathsf{J}]_s$ provided
Since $[\mathsf{J}]_s$ is the square of the operator with the kernel $J_{k+l+1-s}$, it suffices to check that the latter operator commutes with $D$, with the above choice of $\alpha$ and $\beta$. But this is readily checked using Equation 2.18.
■
This proposition is a counterpart of a known fact about the Airy kernel; see Reference 36. Moreover, in the scaling limit when $\theta \to \infty$ and
which commutes with the Airy operator restricted to $(\varsigma ,+ \infty )$. The above differential operator is exactly that of Tracy and Widom Reference 36.
Remark 2.13.
Presumably, this commuting difference operator can be used to obtain, as was done in Reference 36 for the Airy kernel, asymptotic formulas for the eigenvalues of $[\mathsf{J}]_s$, where $s=2\sqrt \theta +\varsigma \,\theta ^{1/6}$ and $\varsigma \ll 0$. Such asymptotic formulas may be very useful if one wishes to refine Theorem 4 and to establish convergence of moments in addition to convergence of distribution functions. For individual distributions of $\lambda _1$ and $\lambda _2$ the convergence of moments was obtained, by other methods, in Reference 3Reference 4.
3. Correlation functions in the bulk of the spectrum
We refer the reader to Section 1.3 of the Introduction for the definition of a regular sequence $X(n)\subset \mathbb{Z}$ and the statement of Theorem 3. Also, in this section, we shall be working in the bulk of the spectrum, that is, we shall assume that all numbers $a_i$ defined in Equation 1.10 lie inside $(-2,2)$. The edges $\pm 2$ of the spectrum and its exterior will be treated in the next section.
In our proof, we shall follow the strategy explained in Section 1.5. Namely, in order to compute the limit of $\boldsymbol{\varrho }(n,X(n))$ we shall use the contour integral
compute the asymptotics of $\boldsymbol{\varrho }^\theta$ for $\theta \approx n$, and estimate $|\boldsymbol{\varrho }^\theta |$ away from $\theta =n$. Both tasks will be accomplished using classical results about the Bessel functions.
We start our proof with the following lemma which formalizes the above informal depoissonization argument. The hypothesis of this lemma is very far from optimal, but it is sufficient for our purposes. For the rest of this section, we fix a number $0<\alpha <1/4$ which shall play an auxiliary role.
imply that the integral $\int _{S_1}$ decays exponentially provided $C$ is large enough. On $S_2$, the inequality Equation 3.2 applies for sufficiently large $n$ and gives
Denote by $\mathcal{F}$ the algebra (with respect to term-wise addition and multiplication) of sequences $\{f_n(z)\}$ which satisfy the properties Equation 3.1 and Equation 3.2 for some, depending on the sequence, constants $f_\infty$ and $\gamma$. Introduce the map
In the proof of this proposition it will be convenient to allow $X\subset \mathbb{C}$. For complex sequences $X$ we shall require $a\in \mathbb{R}$; the number $d\in \mathbb{C}$ may be arbitrary.
Lemma 3.5.
Suppose that a sequence $X\subset \mathbb{C}$ is as above and, additionally, suppose that $\Im x_n$,$\Im y_n$ are bounded and $d\ne 0$. Then the sequence $\left\{\mathsf{J}(x_n,y_n;z)\right\}$ satisfies Equation 3.2 with $f_\infty = \mathsf{S}(d,a)$ and certain $\gamma$.
We shall use Debye’s asymptotic formulas for Bessel functions of complex order and large complex argument; see, for example, Section 8.6 in Reference 43. Introduce the function
provided that $z\to \infty$ in such a way that $u$ stays in some neighborhood of $(-2,2)$; the precise form of this neighborhood can be seen in Figure 22 in Section 8.61 of Reference 43. Because we assume that
and because $|z/n-1|<n^{-\alpha }$, the ratios ${x_n}/{\sqrt {z}}$,${y_n}/{\sqrt {z}}$ stay close to $(-2,2)$. For future reference, we also point out that the constant in $O\left(z^{-1/2}\right)$ in Equation 3.4 is uniform in $u$ provided $u$ is bounded away from the endpoints $\pm 2$.
First we estimate $\Im \left(\sqrt {z} \, G(u)\right)$. The function $G$ clearly takes real values on the real line. From the obvious estimate
Below we shall need this lemma for a variable sequence $X=\{x_n,y_n\}$. Therefore, let us spell out explicitly under what conditions on $X$ the estimates in Lemma 3.5 remain uniform. We need the sequences $\frac{x_n}{\sqrt {n}}$ and $\frac{y_n}{\sqrt {n}}$ to converge uniformly; then, in particular, the ratios $\frac{x_n}{\sqrt {n}}$ and $\frac{y_n}{\sqrt {n}}$ are uniformly bounded away from $\pm 2$. Also, we need $\Im x_n$ and $\Im y_n$ to be uniformly bounded. Finally, we need $|d|$ to be uniformly bounded from below.
First, we check the condition Equation 3.2. In the case $d\ne 0$ this was done in the previous lemma. Suppose, therefore, that $\{x_n\}$ is a regular sequence in $\mathbb{Z}_{\ge 0}$ and consider the asymptotics of $\mathsf{J}(x_n,x_n;z)$.
Because the function $\mathsf{J}(x,y;z)$ is an entire function of $x$ and $y$ we have
which is valid for $|\arg z|<\pi$ and even for $\arg z=\pm \pi$ provided $\Re x>0$ or $x\in \mathbb{Z}$.
If $x\in \mathbb{Z}$, then the second summand in Equation 3.8 vanishes and the first summand is $O\left(e^{\operatorname {const}|z|^{1/2}}\right)$ uniformly in $x\in \mathbb{Z}$. This implies the estimate Equation 3.1 provided $d\ne 0$.
It remains, therefore, to check Equation 3.1 for $\mathsf{J}(x_n,x_n;z)$ where $\{x_n\}\in \mathbb{Z}$ is a regular sequence. Again, we use Equation 3.7. Observe that since $\Re \sqrt {z}\ge 0$, the second summand in Equation 3.8 is uniformly small provided $\Im x$ is bounded from above and $\Re x$ is bounded from below. Therefore, Equation 3.7 produces the Equation 3.1 estimate for $x_n\ge 1$. For $x_n\le 0$ we use the relation Equation 2.13 and the reccurence Equation 2.16 to obtain the estimate.
Let $X(n)$ be a regular sequence and let the numbers $a_i$ and $d_{ij}$ be defined by Equation 1.10, Equation 1.11. We shall assume that $|a_i|<2$ for all $i$. The validity of the theorem in the case when $|a_i|\ge 2$ for some $i$ will be obvious from the results of the next section.
where the first line is the definition of $\boldsymbol{\varrho }^\theta$ and the second is Theorem 2. From Equation 3.9 it is obvious that $\boldsymbol{\varrho }^\theta$ is entire. Therefore, we can apply Lemma 3.1 to it. It is clear that Lemma 3.1, together with Proposition 3.4, implies Theorem 3. The factorization Equation 1.12 follows from the vanishing $\mathsf{S}(\infty ,a)=0$.
■
3.2. Asymptotics of $\rho (n,X)$
Recall that the correlation functions $\rho (n,X)$ were defined by
The asymptotics of these correlation functions can be easily obtained from Theorem 3 by complementation (see Sections A.3 and 2.2), and the result is the following.
Let $X(n)\subset \mathbb{Z}+{\textstyle \frac{1}{2}}$ be a regular sequence. If it splits, then $\lim _{n\to \infty } \rho (n,X(n))$ factors as in Equation 1.12. Suppose therefore, that $X(n)$ is nonsplit. Here one has to distinguish two cases. If $X(n)\subset \mathbb{Z}_{\ge 0}+{\textstyle \frac{1}{2}}$ or $X(n)\subset \mathbb{Z}_{\le 0}-{\textstyle \frac{1}{2}}$, then we shall say that this sequence is off-diagonal. Geometrically, it means that $X(n)$ corresponds to modified Frobenius coordinates of only one kind: either the row ones or the column ones. For off-diagonal sequences we obtain from Theorem 3 by complementation that
where $\mathsf{S}$ is the discrete sine kernel and $a=a_1=a_2=\dots$.
If $X(n)$ is nonsplit and diagonal, that is, if it is nonsplit and includes both positive and negative numbers, then one has to assume additionally that the number of positive and negative elements of $X(n)$ stabilizes for sufficiently large $n$. In this case the limit correlations are given by the kernel
provided $x$ and $x+1$ have the same sign and similarly for $y$. Therefore, if the subsets $X\subset \mathbb{Z}+{\textstyle \frac{1}{2}}$ and $X+m$,$m\in \mathbb{Z}$, have the same number of positive and negative elements, then
4. Edge of the spectrum: Convergence to the Airy ensemble
4.1. Results and strategy of proof
In this section we prove Theorem 4 which was stated in Section 1.4 of the Introduction. We refer the reader to Section 1.4 for a discussion of the relation between Theorem 4 and the results obtained in Reference 3Reference 4Reference 25.
where $A(x)$ is the Airy function Equation 1.15. The Airy ensemble is, by definition, a random point process on $\mathbb{R}$, whose correlation functions are given by
This ensemble was studied in Reference 36. We denote by $\zeta _1 > \zeta _2 > \dots$ a random configuration of the Airy ensemble. Theorem 4 says that after a proper scaling and normalization, the rows $\lambda _1,\lambda _2,\dots$ of a Plancherel random partition $\lambda$ converge in joint distribution to the Airy ensemble. Namely, the random variables $\widetilde{\lambda }$,
converge, in joint distribution, to the Airy ensemble as $n\to \infty$.
In the proof of Theorem 4, we shall follow the strategy explained in Section 1.5 of the Introduction. First, we shall prove that under the poissonized measure $M^\theta$ on the set of partitions $\lambda$, the random variables $\widetilde{\lambda }$ converge, in joint distribution, to the Airy ensemble as $\theta \approx n\to \infty$. This result is stated below as Theorem 5. From this, using certain monotonicity and Lemma 4.7 which is due to K. Johansson, we shall conclude that the same is true for the measures $M_n$ as $n\to \infty$.
The proof of Theorem 5 will be based on the analysis of the behavior of the correlation functions of $M^\theta$,$\theta \approx n \to \infty$, near the point $2\sqrt {n}$. From the expression for correlation functions of $M^\theta$ given in Theorem 1 it is clear that this amounts to the study of the asymptotics of $J_{2\sqrt {n}}(2\sqrt \theta )$ when $\theta \approx n \to \infty$. This asymptotics is classically known and from it we shall derive the following
where $\zeta _1>\zeta _2>\dots$ is the Airy ensemble.
Observe that the limit behavior of $\widetilde{\lambda }$ is, obviously, identical with the limit behavior of similarly scaled $1$st,$2$nd, and so on maximal Frobenius coordinates.
Proofs of Proposition 4.1 and Theorem 5 are given Section 4.2. In Section 4.3, using a depoissonization argument based on Lemma 4.7 we deduce Theorem 4.
Remark 4.2.
We consider the behavior of any number of first rows of $\lambda$, where $\lambda$ is a Plancherel random partition. By symmetry, same results describe the behavior of any number of first columns of $\lambda$.
for some kernel $K(x,y)$. Let $I$ be a possibly infinite interval $I\subset \mathbb{R}$. By $[K]_I$ we denote the operator in $L^2(I,dx)$ obtained by restricting the kernel on $I\times I$. Assume $[K]_I$ is a trace class operator. Then the intersection of the random configuration $X$ with $I$ is finite almost surely and
More generally, if $I_1,\dots ,I_m$ is a finite family of pairwise nonintersecting intervals such that the operators $[K]_{I_1},\dots ,[K]_{I_m}$ are trace class, then
are finite linear combinations of expressions of the form Equation 4.2. Therefore, in order to show the convergence in distribution of point processes with determinantal correlation functions, it suffices to show the convergence of expressions of the form Equation 4.2.
The formula Equation 4.2 is discussed, for example, in Reference 37. See also Theorem 2 in Reference 35. It remains valid for processes on a lattice such as $\mathbb{Z}$ in which case the kernel $K$ should be an operator in $\ell ^2(\mathbb{Z})$.
As verified, for example, in Proposition A.11 in the Appendix, the right-hand side of Equation 4.2 is continuous in each $[K]_{I_i}$ with respect to the trace norm. We shall show that after a suitable embedding of $\ell ^2(\mathbb{Z})$ into $L^2(\mathbb{R})$ the kernel $\mathsf{J}(x,y;\theta )$ converges to the Airy kernel $\mathsf{A}(x,y)$ as $\theta \to \infty$.
Namely, we shall consider a family of embeddings $\ell ^2(\mathbb{Z})\to L^2(\mathbb{R})$, indexed by a positive number $r>0$, which are defined by
where $\chi _k\in \ell ^2(\mathbb{Z})$ is the characteristic function of the point $k\in \mathbb{Z}$ and, similarly, the function on the right is the characteristic function of a segment of length $r^{-1/3}$. Observe that this embedding is isometric. Let $\mathsf{J}_r$ denote the kernel on $\mathbb{R}\times \mathbb{R}$ that is obtained from the kernel $\mathsf{J}(\,\cdot \,,\,\cdot \,,r^2)$ on $\mathbb{Z}\times \mathbb{Z}$ using the embedding Equation 4.3. We shall establish the following
and $a_i^-(r),a_i^+(r)\to a_i$ as $r\to \infty$. Then, on the one hand, the probability in the left-hand side of Equation 4.1 lies between the corresponding probabilities for $a_i^-(r)$ and $a_i^+(r)$. On the other hand, the probabilities for $a_i^-(r)$ and $a_i^+(r)$ can be expressed in the form Equation 4.2 for the kernel $\mathsf{J}_r$ and by Proposition 4.3 and continuity of the Airy kernel they converge to the corresponding probabilities given by the Airy kernel as $r\to \infty$.
■
Now we get to the proofs of Propositions 4.1 and 4.3 which will require some computations. Recall that the Airy function can be expressed in terms of Bessel functions as follows:
First suppose that $s\le \varepsilon r$. Set $\nu =r+Ar^{\frac{1}{3}}+s$. We shall use Equation 4.7 with $\alpha =\operatorname {arccosh}(\nu /r)$. Provided $\varepsilon$ is chosen small enough and $r$ is sufficiently large, $\alpha$ will be close to $0$ and we will be able to use Taylor expansions. For $r\gg 0$ we have
that is, one sum for $l\le \varepsilon r$ and the other for $l> \varepsilon r$, and apply Lemma 4.5 to these two sums. Note that $2r$ here corresponds to $r$ in Lemma 4.5; this produces factors of $2^{\frac{1}{3}}$ and does not affect the estimate.
Let the $c_i$’s stand for some positive constants not depending on $M$. From Equation 4.9 we obtain the following estimate for the first sum:
Let us fix $\delta >0$ and pick $M>0$ according to Lemma 4.6. Since, by assumption, $x$ and $y$ lie in a compact set of $\mathbb{R}$, we can fix $m$ such that $x,y \ge m$. Set
and it converges to this integral as $r\to +\infty$. Since the absolute value of the second term in the right-hand side of Equation 4.13 does not exceed $\delta r^{-\frac{1}{3}}$ by the choice of $N$, we get
as $r\to +\infty$, and this estimate is uniform on compact sets. Now let $\delta \to 0$ and $M\to +\infty$. By Equation 4.5 the integral Equation 4.12 converges uniformly in $x$ and $y$ on compact sets and we obtain the claim of the proposition.
It is clear that Proposition 4.1 implies the convergence of $[\mathsf{J}_r]_a$ to $[\mathsf{A}]_a$ in the weak operator topology. Therefore, by Proposition A.9, it remains to prove that $\operatorname {tr}[\mathsf{J}_r]_a\to \operatorname {tr}[\mathsf{A}]_a$ as $r\to +\infty$. We have
where the $o(1)$ correction comes from the fact that $a$ may not be a number of the form $\frac{k-2r}{r^{1/3}}$,$k\in \mathbb{Z}$. By Equation 4.11 we have
Since we already established the uniform convergence of kernels on compact sets, it is enough to show that both Equation 4.14 and Equation 4.15 go to zero as $a\to +\infty$ and $r\to +\infty$. For the Airy kernel this is clear from Equation 4.5. For the kernel $\mathsf{J}_r$ it is equivalent to the following statement: for any $\delta >0$ there exists $M_0>0$ such that for all $M>M_0$ and large enough $r$ we have
$$\begin{equation} \left|\sum _{l=1}^\infty l \, J_{2r+Mr^{\frac{1}{3}}+l}^2(2r)\right|<\delta \,. \cssId{e410}{\tag{4.16}} \end{equation}$$
We shall employ Lemma 4.5 for $A=M$. Again, we split the sum in Equation 4.16 into two parts:
This is the distribution function corresponding to the measure $M^\theta$.
The measures $M_n$ can be obtained as distribution at time $n$ of a certain random growth process of a Young diagram; see e.g. Reference 42. This implies that
Appendix A. General properties of determinantal point processes
In this Appendix, we collect some necessary facts about determinantal point processes, their correlation functions, Fredholm determinants, and convergence of trace class operators.
Let $\mathfrak{X}$ be a countable set, let $\operatorname {Conf}(\mathfrak{X})=2^{\mathfrak{X}}$ be the set of subsets of $\mathfrak{X}$ and denote by $\operatorname {Conf}(\mathfrak{X})_0\subset \operatorname {Conf}(\mathfrak{X})$ the set of finite subsets of $\mathfrak{X}$. We call elements of $\operatorname {Conf}(\mathfrak{X})$ configurations. Let $L$ be a kernel on $\mathfrak{X}$, that is, a function on