Skip to Main Content
Remote Access Theory of Probability and Mathematical Statistics

Theory of Probability and Mathematical Statistics

ISSN 1547-7363(online) ISSN 0094-9000(print)

   
 
 

 

A note on the prediction error of principal component regression in high dimensions


Authors: Laura Hucker and Martin Wahl
Journal: Theor. Probability and Math. Statist. 109 (2023), 37-53
MSC (2020): Primary 62H25
DOI: https://doi.org/10.1090/tpms/1196
Published electronically: October 3, 2023
Full-text PDF

Abstract | References | Similar Articles | Additional Information

Abstract: We analyze the prediction error of principal component regression (PCR) and prove high probability bounds for the corresponding squared risk conditional on the design. Our first main result shows that PCR performs comparably to the oracle method obtained by replacing empirical principal components by their population counterparts, provided that an effective rank condition holds. On the other hand, if the latter condition is violated, then empirical eigenvalues start to have a significant upward bias, resulting in a self-induced regularization of PCR. Our approach relies on the behavior of empirical eigenvalues, empirical eigenvectors and the excess risk of principal component analysis in high-dimensional regimes.


References [Enhancements On Off] (What's this?)

References

Similar Articles

Retrieve articles in Theory of Probability and Mathematical Statistics with MSC (2020): 62H25

Retrieve articles in all journals with MSC (2020): 62H25


Additional Information

Laura Hucker
Affiliation: Institut für Mathematik, Humboldt-Universität zu Berlin, Unter den Linden 6, 10099 Berlin, Germany
Email: huckerla@math.hu-berlin.de

Martin Wahl
Affiliation: Fakultät für Mathematik, Universität Bielefeld, Postfach 100131, 33615 Bielefeld, Germany
MR Author ID: 1054755
Email: martin.wahl@math.uni-bielefeld.de

Keywords: Principal component regression, prediction error, principal component analysis, excess risk, eigenvalue upward bias, benign overfitting
Received by editor(s): June 28, 2022
Accepted for publication: January 10, 2023
Published electronically: October 3, 2023
Additional Notes: The first author was supported by Deutsche Forschungsgemeinschaft (DFG) - FOR5381 - 460867398.
The research of the second author has been partially funded by Deutsche Forschungsgemeinschaft (DFG) - SFB1294 - 318763901.
Article copyright: © Copyright 2023 Taras Shevchenko National University of Kyiv