Harold Widom’s work in random matrix theory
Abstract
This is a survey of Harold Widom’s work in random matrices. We start with his pioneering papers on the sine-kernel determinant, continue with his and Craig Tracy’s groundbreaking results concerning the distribution functions of random matrix theory, touch on the remarkable universality of the Tracy–Widom distributions in mathematics and physics, and close with Tracy and Widom’s remarkable work on the asymmetric simple exclusion process.
1. Introduction
The distributions of random matrix theory govern the statistical properties of a wide variety of large systems which do not obey the usual laws of classical probability. Such systems appear in many different areas of applied science and technology, including heavy nuclei, polymer growth, high-dimensional data analysis, and certain percolation processes. The four distribution functions play a particularly important role in the mathematical apparatus of random matrices. The first one describes the emptiness formation probability in the bulk of the spectrum of a large random matrix, and it is explicitly given in terms of the sine-kernel Fredholm determinant. The second, the third, and the forth distributions are given in terms of the Airy kernel Fredholm determinant, and they describe the edge fluctuations of the eigenvalues in the large size limit of the matrices taken from the three classical Gaussian ensembles, unitary (GUE), orthogonal (GOE), and symplectic (GSE). The last three of these distributions are known now as the Tracy–Widom distribution functions.
A key analytical observation concerning the distribution functions of random matrix theory, which was made on many occasions in the papers Reference 5Reference 6Reference 7Reference 10Reference 11Reference 12, is that they satisfy certain nonlinear integrable PDEs. This property, which generalizes the first result of this type established in Reference JMMS for the sine-kernel determinant, follows in turn from a remarkable Fredholm determinant representation for the random matrix distributions. The existence of such representations in several important examples beyond the sine-kernel case was also first shown in the above-mentioned papers.
What follows is an overview of the principal contributions of Harold Widom to random matrix theory. We start with his papers Reference 3 and Reference 9 on the sine-kernel determinant, which brought mathematical rigor to the theory of this, the first universal distribution function of random matrix theory.
2. The sine-kernel determinant
Let be a union of disjoint intervals in Consider the Fredholm determinant .
where is the trace class operator in with kernel
The determinant plays a central role in random matrix theory. Indeed, it is the probability of finding no eigenvalues in the union of intervals for a random Hermitian matrix chosen from the Gaussian unitary ensemble (GUE), in the bulk scaling limit with mean spacing 1 (see Reference Meh and Figure 1). Moreover, in the one interval case, the second derivative of describes the distribution of normalized spacings of eigenvalues of large random GUE matrices (see, e.g., Reference Dei1). The determinant also appears in quantum and statistical mechanics. For instance, it describes the emptiness formation probability in the one-dimensional impenetrable Bose gas Reference Len and the gap probability in the one-dimensional Coulomb gas, at inverse temperature (see, e.g., Reference Dys1). The key analytical issue related to is its large behavior, i.e., the large gap asymptotics. Harold Widom made major contributions to the resolution of this question.
In the one interval case, after rescaling and translation, we may assume , For this case, in 1973, des Cloizeaux and Mehta .Reference dCM showed that as ,
for some constant In 1976 Dyson .Reference Dys2 showed that in fact has a full asymptotic expansion of the form
Dyson identified all the constants , , , Of particular interest is the constant . which he found to be ,
where is the Riemann zeta-function. It should be noted that Dyson obtained this result using one of the early results of Widom Reference 1 on the asymptotics of Toeplitz determinants with symbols supported on circular arcs.
The results in Reference dCM and Reference Dys2 were not fully rigorous. In Reference 3, using an adaptation of Szegő’s classical method to the continuous analogues of orthogonal polynomials (the so-called Krein functions), Widom gave the first rigorous proof of the leading asymptotics in Equation 2.1 in the form
as Actually, Widom proved a slightly stronger result, .
as In addition, Widom computed the leading asymptotics of the quantity . which is the ratio of the probability that there is at most one eigenvalue in the interval , to In the subsequent paper .Reference 9, Widom considered the multi-interval case, and showed that as , ,
where is a negative constant and is a certain bounded oscillatory function of which was described up to the solution of a Jacobi inversion problem. ,Footnote1 The method of Reference 9 is a further development of the approach in Reference 3. As in Reference 3, Widom also computes the leading asymptotics of .
An explicit formula for in terms of the Riemann theta-function, was obtained later in ,Reference DIZ.
The formula Equation 2.3 for in Reference Dys2 was in the form of conjecture: A rigorous proof of Equation 2.3 was only given in 2004. In fact, two proofs of this formula were given independently in Reference Kra and Reference Ehr. It is remarkable that the proof of Equation 2.3 in Reference Kra again uses (in conjunction with the nonlinear steepest descent method) Widom’s computation in Reference 1.
3. The Tracy–Widom distribution functions
The joint probability densities of the eigenvalues of random matrices from the GOE, GUE, and GSE ensembles are given by
where is a normalization constant (or partition function) and
The famous Tracy–Widom distribution functions, commonly denoted as describe the edge fluctuation of the eigenvalues in the large , limit. They are defined via the scaling limits,
where is the largest eigenvalue drawn from the ensembles with density Equation 3.1. The central theme of the series of papers Reference 5Reference 6Reference 7Reference 10Reference 12 is the following analytical description of these distributions, which links them to the theory of integrable systems.
Let be the trace-class operator in with kernel
where is the Airy function,
Then, the Tracy–Widom distribution, is given by the Airy-kernel Fredholm determinant (see ,Reference For),
Moreover, as shown in Reference 6, the following formula is valid for the Fredholm determinant on the right-hand side of Equation 3.4:
where is the Hastings–McLeod solution of the second Painlevé equation, i.e., the solution of the ODE
uniquely determined by the boundary condition,
Similar Painlevé representations for the other two Tracy–Widom distribution functions (see Reference 10) have the form
where
and is the same Hastings–McLeod solution to the second Painlevé transcendent.Footnote2 Painlevé equations are integrable in the sense of Lax pairs, which, in particular, implies that their solutions admit a Riemann–Hilbert (RH) representation. A Riemann–Hilbert representation can be viewed as the nonabelian analogue of the familiar integral representations of the classical special functions, such as the Bessel functions, the Airy function, etc. A key consequence of the RH representation is that the Painlevé functions possess one of the principal features of a classical special function—a mechanism, viz. the nonlinear steepest descent method, to evaluate explicitly relevant asymptotic connection formulae (see, e.g., Reference FIKN). Specifically, in the case of the Hastings–McLeod solution, the integrability of the second Painlevé equation underlies the fact that, in addition to the asymptotic behavior Equation 3.7 at one also knows the asymptotic behavior of , at which is described in ,Reference HaMcFootnote3 as
A similar representation for the sine-kernel determinant (involving, this time, a special solution of the fifth Painlevé equation) was given earlier in Reference JMMS.
Hastings and McLeod derived Equation 3.9 using the inverse scattering transform which was a precursor of the Riemann–Hilbert method.
Herein lies the importance of the Tracy–Widom formulae Equation 3.5 and Equation 3.8—they provide the key distribution functions of random matrix theory with explicit representations that are amenable to detailed asymptotic analysis.
An immediate corollary of formula Equation 3.5 (and the known asymptotics Equation 3.9 of the Painlevé function) is an explicit formula for the large negative behavior of the distribution function :
The value of the constant was conjectured by Tracy and Widom in the paper Reference 6 to be the same as for the sine-kernel determinant, which is given by
where is the Riemann zeta-function.Footnote4
This conjecture was independently proved in Reference DIK and Reference BBD. Also, in Reference BBD, similar asymptotic results were established for the other two Tracy–Widom distributions.
Another important advantage of the Tracy–Widom formula Equation 3.8 is that it makes it possible to design, using the exact connection formula Equation 3.7, Equation 3.8, Equation 3.9, a very efficient scheme (see Reference Die, Reference DBT) for the numerical evaluationFootnote5 of the distribution functions .
It should be noted that the Hastings–McLeod solution is very unstable; indeed, a small change in the pre-exponential numerical factor in the normalization condition Equation 3.7 yields a completely different type of behavior of for negative including the appearance of singularities (e.g., see again ,Reference FIKN). Herein lies the importance of the knowledge of the behavior Equation 3.9 in order to adjust the numerical procedure appropriately. Said differently, knowledge of the connection formulae makes it possible to transform an unstable ODE initial value problem into a stable ODE boundary value problem.
The appearance of the Painlevé equations in the Tracy–Widom formulae is not accidental. As follows from results in Reference 7, the connection to integrable systems is already encoded in the determinant formula Equation 3.4. Indeed, the Airy-kernel integral operator belongs to a class of integral operators with kernels of the form
acting in where , is a union of intervals,
and , are This class of integral operators has appeared frequently in many applications related to random matrices and statistical mechanics. -functions.Footnote6 In Reference 7, it is shown that if the functions and satisfy a linear differential equation,
An integral operator with kernel Equation 3.12 is a special case of a so-called integrable Fredholm operator, i.e., an integral operator whose kernel is of the form
with some functions and defined on a contour This type of integral operator was singled out as a distinguished class in .Reference IIKS (see also Reference Dei2; in a different context, unrelated to integrable systems, these operators were also studied in the earlier work Reference Sak). A crucial property of a kernel Equation 3.13 is that the associated resolvent kernel is again an integrable kernel. Moreover, the functions “ and ”“ corresponding to the resolvent are determined via an auxiliary matrix Riemann–Hilbert problem whose jump-matrix is explicitly constructed in terms of the original ” -functions.
where
and , , and , are polynomials, then the Fredholm determinant, can be expressed in terms of the solution to a certain system of nonlinear partial differential equations with the end points , as independent variables. This system is integrable in the sense of Lax,Footnote7 and in the case of the Airy kernel it reduces to a single ODE—the second Painlevé equation.
The Lax-integrability of the Tracy–Widom system associated with the kernel Equation 3.12 was proven in Reference Pal, where this system was identified as a special case of isomonodromy deformation equations and the Fredholm determinant was identified as the corresponding -function.
In Reference 8, the results of Reference 7 were extended to the general case, that is, to the kernels of the form
acting in Here, . is, as before, a union of intervals, the constant matrix is antisymmetric, and are satisfying the linear differential equation -functions
where
The main result of Reference 8 is the derivation, for such operators
4. Universality of the Tracy–Widom distributions
A remarkable fact is that the Tracy–Widom distribution functions
4.1. General invariant ensembles and the Wigner ensembles
The GUE, GOE, and GSE ensembles are, respectively, particular examples of unitary (UE), orthogonal (OE), and symplectic (SE) ensembles of random matrix theory. The joint probability density of the eigenvalues for these ensembles are given by (cf. Equation 3.1)
where
(More general potential functions are also allowed.) GUE, GOE, and GSE correspond to the choice
where
Complementary to the invariant ensembles (UE, OE, and SE) are the so-called Wigner matrix ensembles, i.e., random matrices with independent identically distributed entries. Tracy–Widom edge universality for these ensembles is also valid, and this important fact was proved by Soshnikov Reference Sos. In addition, universality in the bulk of the spectrum for the Wigner ensembles is now established; see, e.g., Reference Erd for a comprehensive survey of the results and a historical review.
The Tracy–Widom distribution appears also in Hermitian matrix models with varying weights. These are unitary ensembles with the potential
Under certain regularity conditions, the leading large
where
The formula for the free energy reads
As shown in Reference DKM, in the case that
where
The number of intervals in the support of the equilibrium measure depends on the values of the parameters
For
As
where
and
4.2. Random permutations
Let
where
In addition, the authors of Reference BDJ proved convergence of moments:
It is worth noting that Harold Widom suggested in Reference 14 an alternative proof of the convergence of the moments Equation 4.12 based on the Borodin–Okounkov Fredholm determinant formula for Toeplitz determinants. The latter was first derived in Reference BoOk, and has been proven extremely useful in the theory and application of Toeplitz determinants. There are now several different proofs of this formula and the first one (after the original proof of Borodin and Okounkov) was again suggested by Harold Widom jointly with Estelle Basor in Reference 13. It is remarkable that this proof in turn is again based on one of the old papers of Widom Reference 2. We also refer to the article by Basor, Böttcher, and Ehrhardt Reference BBE in this volume for more details on the Borodin–Okounkov formula and its history and connection with Widom’s work on Toeplitz determinants. It is important to note that the Borodin–Okounkov formula was in fact discovered earlier by Geronimo and Case in Reference GeCa, but its significance was overlooked at the time.
Random permutations appear in an enormous array of problems. Some of the most fundamental connections are with measures on Young diagrams, random growth processes, interacting particle systems, last passage percolation models, and with various tiling problems. Formula Equation 4.11 and its generalizations for more complex types of permutations, which involve
We close this section by mentioning one remarkable generalization of the Ulam problem. Consider
In joint work Reference 4 with Janko Gravner and Craig Tracy, Harold Widom proved that
5. The asymmetric simple exclusion process. Harold Widom’s papers Reference 15Reference 16Reference 17
The asymmetric simple exclusion process (ASEP) was introduced (in the probability literature) in Spitzer’s 1970 work Reference Spi. It is a continuous-time Markov process of interacting particles on the integer lattice
When
The extension of this result to ASEP when
This method dates back to the 1931 work of Hans Bethe Reference Bet in studying the Heisenberg XXX spin chain Reference Hei.
The starting point for Tracy and Widom’s work on ASEP is an exact formula for the transition probability of the
where
where
When
There is hope, however. The generator of ASEP (the operator on the right-hand side of the Master equation) is related by a similarity transform to the Hamiltonian of the XXZ quantum spin chain, itself a special limit of the six-vertex model. These models have a long history in the realm of exactly solvable models in statistical mechanics (e.g., Reference Bax), and it was known since Reference Lieb that the coordinate Bethe ansatz can be used to diagonalize the six-vertex model transfer matrix. For the six-vertex model and XXZ spin chain, one has to assume boundary conditions (e.g., work on a torus). However, since the generator of ASEP is stochastic, one can work directly on the full space
The following formula was proved in Reference 15. The
Here, the sum
where the product is of terms
This sum over the symmetric group may seem reminiscent of the expansion formula for determinants. In the special case when
which recovers one aspect of the determinantal structure for TASEP; see Reference Sch.
Returning to Equation 5.1, just as in random matrix eigenvalue measures, the challenge now becomes how to extract large
Let us now record the main result of the asymptotics performed in Reference 17. In fact, in Reference 18, a more general class of initial data was considered in which sites in
Assume that
Then Reference 17Reference 18 shows that (cf. Equation 4.3 and Equation 4.11),
and
We emphasize again that ASEP is not a determinantal process and hence none of the usual integrable techniques—such as the orthogonal polynomial approach, the Riemann–Hilbert method, the Borodin–Okounkov formula, or the theory of integrable Fredholm operators—are readily applicable. In spite of the absence of such tools, the Tracy–Widom distributions still arise!
Besides demonstrating that the Tracy–Widom distribution is universal for ASEP regardless of the choice of parameters (provided
Over the decade or so since Tracy and Widom’s initial work on ASEP, there have been a number of developments in the field of integrable probability in this direction. This is too vast a subject to try to survey here, so we simply refer to a few existing surveys Reference ABWReference Bor2Reference BoGoReference BoPe1Reference BoPe2Reference BoWhReference Cor1Reference Cor2Reference Cor3Reference Cor4Reference OConReference QuSpReference Zyg and mention some general topics: Kardar–Parisi–Zhang universality class, replica method, Markov duality, quantum Toda Hamiltonian, Macdonald processes (and limits to Whittaker, Jack, Hall–Littlewood, or Schur processes), stochastic vertex models, spin
About the authors
Ivan Corwin is professor of mathematics at Columbia University. The first paper he seriously read in his PhD was work of Tracy and Widom on the asymmetric simple exclusion process. He is honored to have the chance to memorialize Harold Widom by sharing some of his spectacular work with the mathematical community.
Percy Deift is professor of mathematics at the Courant Institute. He has been a student of Harold Widom’s seminal work on Toeplitz determinants and random matrix theory for many years. He is honored to coauthor this paper celebrating Widom’s outstanding contributions to mathematics and to analysis, in particular.
Alexander Its is professor of mathematics at Indiana University–Purdue University Indianapolis. A very substantial part of his research during the last 30+ years has been greatly influenced by Harold Widom’s fundamental work on analysis of Toeplitz operators and matrices and the numerous exciting applications of these works to mathematical physics and, specifically, to random matrices. He is honored to be a coauthor of the paper celebrating some of Harold’s classical results.