The conjugate gradient algorithm on well-conditioned Wishart matrices is almost deterministic

Deift, Percy; Trogdon, Thomas

doi:10.1090/qam/1574

Authors: Percy Deift and Thomas Trogdon
Journal: Quart. Appl. Math. 79 (2021), 125-161
MSC (2010): Primary 65F10, 60B20
DOI: https://doi.org/10.1090/qam/1574
Published electronically: July 9, 2020
MathSciNet review: 4188626
Full-text PDF

Abstract | References | Similar Articles | Additional Information

Abstract: We prove that the number of iterations required to solve a random positive definite linear system with the conjugate gradient algorithm is almost deterministic for large matrices. We treat the case of Wishart matrices $W = XX^*$ where $X$ is $n \times m$ and $n/m \sim d$ for $0 < d < 1$. Precisely, we prove that for most choices of error tolerance, as the matrix increases in size, the probability that the iteration count deviates from an explicit deterministic value tends to zero. In addition, for a fixed iteration count, we show that the norm of the error vector and the norm of the residual converge exponentially fast in probability, converge in mean, and converge almost surely.

References

Mark J. Ablowitz and Athanassios S. Fokas, Complex variables: introduction and applications, 2nd ed., Cambridge Texts in Applied Mathematics, Cambridge University Press, Cambridge, 2003. MR 1989049
Bernhard Beckermann and Arno B. J. Kuijlaars, Superlinear convergence of conjugate gradients, SIAM J. Numer. Anal. 39 (2001), no. 1, 300–329. MR 1860727, DOI https://doi.org/10.1137/S0036142999363188
Alex Bloemendal, Antti Knowles, Horng-Tzer Yau, and Jun Yin, On the principal components of sample covariance matrices, Probab. Theory Related Fields 164 (2016), no. 1-2, 459–552. MR 3449395, DOI https://doi.org/10.1007/s00440-015-0616-x
Z. D. Bai, B. Q. Miao, and G. M. Pan, On asymptotics of eigenvectors of large sample covariance matrix, Ann. Probab. 35 (2007), no. 4, 1532–1572. MR 2330979, DOI https://doi.org/10.1214/009117906000001079
Zhidong Bai and Jack W. Silverstein, Spectral analysis of large dimensional random matrices, 2nd ed., Springer Series in Statistics, Springer, New York, 2010. MR 2567175
Z. D. Bai and Y. Q. Yin, Limit of the smallest eigenvalue of a large-dimensional sample covariance matrix, Ann. Probab. 21 (1993), no. 3, 1275–1294. MR 1235416
Ioana Dumitriu and Alan Edelman, Matrix models for beta ensembles, J. Math. Phys. 43 (2002), no. 11, 5830–5847. MR 1936554, DOI https://doi.org/10.1063/1.1507823
P. Deift, L. C. Li, and C. Tomei, Toda flows with infinitely many variables, J. Funct. Anal. 64 (1985), no. 3, 358–402. MR 813206, DOI https://doi.org/10.1016/0022-1236%2885%2990065-5
Percy A. Deift, Govind Menon, Sheehan Olver, and Thomas Trogdon, Universality in numerical computations with random data, Proc. Natl. Acad. Sci. USA 111 (2014), no. 42, 14973–14978. MR 3276499, DOI https://doi.org/10.1073/pnas.1413446111
Percy A. Deift, Thomas Trogdon, and Govind Menon, On the condition number of the critically-scaled Laguerre unitary ensemble, Discrete Contin. Dyn. Syst. 36 (2016), no. 8, 4287–4347. MR 3479516, DOI https://doi.org/10.3934/dcds.2016.36.4287
Kenneth R. Davidson and Stanislaw J. Szarek, Local operator theory, random matrices and Banach spaces, Handbook of the geometry of Banach spaces, Vol. I, North-Holland, Amsterdam, 2001, pp. 317–366. MR 1863696, DOI https://doi.org/10.1016/S1874-5849%2801%2980010-3
Alan Edelman and Brian D. Sutton, Tails of condition number distributions, SIAM J. Matrix Anal. Appl. 27 (2005), no. 2, 547–560. MR 2179688, DOI https://doi.org/10.1137/040614256
Gerald B. Folland, Real analysis, 2nd ed., Pure and Applied Mathematics (New York), John Wiley & Sons, Inc., New York, 1999. Modern techniques and their applications; A Wiley-Interscience Publication. MR 1681462
P. J. Forrester, Log-gases and random matrices, London Mathematical Society Monographs Series, vol. 34, Princeton University Press, Princeton, NJ, 2010. MR 2641363
A. Greenbaum, Behavior of slightly perturbed Lanczos and conjugate-gradient recurrences, Linear Algebra Appl. 113 (1989), 7–63. MR 978581, DOI https://doi.org/10.1016/0024-3795%2889%2990285-1
Herman H. Goldstine and John von Neumann, Numerical inverting of matrices of high order. II, Proc. Amer. Math. Soc. 2 (1951), 188–202. MR 41539, DOI https://doi.org/10.1090/S0002-9939-1951-0041539-X
A. Guionnet and O. Zeitouni, Concentration of the spectral measure for large matrices, Electron. Comm. Probab. 5 (2000), 119–136. MR 1781846, DOI https://doi.org/10.1214/ECP.v5-1026
Magnus R. Hestenes and Eduard Stiefel, Methods of conjugate gradients for solving linear systems, J. Research Nat. Bur. Standards 49 (1952), 409–436 (1953). MR 0060307
Arno B. J. Kuijlaars, Convergence analysis of Krylov subspace iterations with methods from potential theory, SIAM Rev. 48 (2006), no. 1, 3–40. MR 2219308, DOI https://doi.org/10.1137/S0036144504445376
A. Lytova and L. Pastur, Central limit theorem for linear eigenvalue statistics of random matrices with independent entries, Ann. Probab. 37 (2009), no. 5, 1778–1840. MR 2561434, DOI https://doi.org/10.1214/09-AOP452
J. C. Mason and D. C. Handscomb, Chebyshev polynomials, Chapman & Hall/CRC, Boca Raton, FL, 2003. MR 1937591
Govind Menon and Thomas Trogdon, Smoothed analysis for the conjugate gradient algorithm, SIGMA Symmetry Integrability Geom. Methods Appl. 12 (2016), Paper No. 109, 22. MR 3568010, DOI https://doi.org/10.3842/SIGMA.2016.109

E. Paquette and T. Trogdon, Universality for the conjugate gradient and MINRES algorithms on sample covariance matrices, arXiv:2007.00640, 1–42, 2020.

Natesh S. Pillai and Jun Yin, Universality of covariance matrices, Ann. Appl. Probab. 24 (2014), no. 3, 935–1001. MR 3199978, DOI https://doi.org/10.1214/13-AAP939

Daniel Spielman and Shang-Hua Teng, Smoothed analysis of algorithms: why the simplex algorithm usually takes polynomial time, Proceedings of the Thirty-Third Annual ACM Symposium on Theory of Computing, ACM, New York, 2001, pp. 296–305. MR 2120328, DOI https://doi.org/10.1145/380752.380813

Lloyd N. Trefethen and David Bau III, Numerical linear algebra, Society for Industrial and Applied Mathematics (SIAM), Philadelphia, PA, 1997. MR 1444820

Roman Vershynin, High-dimensional probability, Cambridge Series in Statistical and Probabilistic Mathematics, vol. 47, Cambridge University Press, Cambridge, 2018. An introduction with applications in data science; With a foreword by Sara van de Geer. MR 3837109

J Wishart, The generalised product moment distribution in samples from a normal multivariate population, Biometrika 20A (1928), no. 1-2, 32–52.