Data Dependent Concentration Bounds for Sequential Prediction Algorithms

Zhang, Tong

doi:10.1007/11503415_12

Tong Zhang²⁰

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3559))

Included in the following conference series:

International Conference on Computational Learning Theory

3517 Accesses
11 Citations

Abstract

We investigate the generalization behavior of sequential prediction (online) algorithms, when data are generated from a probability distribution. Using some newly developed probability inequalities, we are able to bound the total generalization performance of a learning algorithm in terms of its observed total loss. Consequences of this analysis will be illustrated with examples.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Azuma, K.: Weighted sums of certain dependent random variables. Tohoku Math. Journal 3, 357–367 (1967)
Article MathSciNet Google Scholar
Bennett, G.: Probability inequalities for the sum of independent random variables. Journal of the American Statistical Association 57, 33–45 (1962)
Article MATH Google Scholar
Blum, A., Kalai, A., Langford, J.: Beating the hold-out: Bounds for k-fold and progressive cross-validation. In: COLT 1999, pp. 203–208 (1999)
Google Scholar
Cesa-Bianchi, N., Conconi, A., Gentile, C.: On the generalization ability of on-line learning algorithms. IEEE Transactions on Information Theory, 2050–2057 (2004)
Google Scholar
Collins, M.: Discriminative training methods for hidden markov models: Theory and experiments with perceptron algorithms. In: Proc. EMNLP 2000 (2002)
Google Scholar
de la Pẽna V.H.: A general class of exponential inequalities for martingales and ratios. The Annals of Probability 27, 537–564 (1999)
Google Scholar
Freedman, D.A.: On tail probabilities for martingales. The Annals of Probability 3, 100–118 (1975)
Article MATH Google Scholar
Freund, Y., Schapire, R.E.: A decision-theoretic generalization of on-line learning and an application to boosting. J. Comput. Syst. Sci. 55(1), 119–139 (1997)
Article MATH MathSciNet Google Scholar
Hoeffding, W.: Probability inequalities for sums of bounded random variables. Journal of the American Statistical Association 58(301), 13–30 (1963)
Article MATH MathSciNet Google Scholar
Littlestone, N.: From on-line to batch learning. In: COLT 1989, pp. 269–284 (1989)
Google Scholar
Littlestone, N., Warmuth, M.K.: The weighted majority algorithm. Information and Computation 108, 212–261 (1994)
Article MATH MathSciNet Google Scholar
Vovk, V.: Aggregating strategies. In: COLT 1990, pp. 371–383 (1990)
Google Scholar

Download references

Author information

Authors and Affiliations

IBM T.J. Watson Research Center, Yorktown Heights, NY, 10598, USA
Tong Zhang

Authors

Tong Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Leoben, A-8700, Leoben, Austria
Peter Auer
Department of Electrical Engineering, Technion, P.O. Box, 3200, Haifa, Israel
Ron Meir

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, T. (2005). Data Dependent Concentration Bounds for Sequential Prediction Algorithms. In: Auer, P., Meir, R. (eds) Learning Theory. COLT 2005. Lecture Notes in Computer Science(), vol 3559. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11503415_12

Download citation

DOI: https://doi.org/10.1007/11503415_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26556-6
Online ISBN: 978-3-540-31892-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics