An alternating direction method of multipliers with a worst-case $O(1/n^2)$ convergence rate
HTML articles powered by AMS MathViewer
- by Wenyi Tian and Xiaoming Yuan;
- Math. Comp. 88 (2019), 1685-1713
- DOI: https://doi.org/10.1090/mcom/3388
- Published electronically: December 12, 2018
- HTML | PDF | Request permission
Abstract:
The alternating direction method of multipliers (ADMM) is being widely used for various convex programming models with separable structures arising in many scientific computing areas. The ADMM’s worst-case $O(1/n)$ convergence rate measured by the iteration complexity has been established in the literature when its penalty parameter is a constant, where $n$ is the iteration counter. Research on ADMM’s worst-case $O(1/n^2)$ convergence rate, however, is still in its infancy. In this paper, we suggest applying a rule proposed recently by Chambolle and Pock to iteratively update the penalty parameter and show that the ADMM with this adaptive penalty parameter has a worst-case $O(1/n^2)$ convergence rate in the ergodic sense. Without a strong convexity requirement on the objective function, our assumptions on the model are mild and can be satisfied by some representative applications. We test the least absolute shrinkage and selection operator (LASSO) model and numerically verify the significant acceleration effectiveness of the faster ADMM with a worst-case $O(1/n^2)$ convergence rate. Moreover, the faster ADMM is more favorable than the ADMM with a constant penalty parameter, as the former is much less sensitive to the initial value of the penalty parameter and can sometimes produce very high-accuracy solutions.References
- Heinz H. Bauschke and Patrick L. Combettes, Convex analysis and monotone operator theory in Hilbert spaces, CMS Books in Mathematics/Ouvrages de Mathématiques de la SMC, Springer, New York, 2011. With a foreword by Hédy Attouch. MR 2798533, DOI 10.1007/978-1-4419-9467-7
- Daniel Boley, Local linear convergence of the alternating direction method of multipliers on quadratic or linear programs, SIAM J. Optim. 23 (2013), no. 4, 2183–2207. MR 3123832, DOI 10.1137/120878951
- S. Boyd, N. Parikh, E. Chu, B. Peleato, and J. Eckstein, Distributed optimization and statistical learning via the alternating direction method of multipliers, Found. Trends Mach. Learning 3 (2010), no. 1, 1–122.
- Antonin Chambolle and Thomas Pock, A first-order primal-dual algorithm for convex problems with applications to imaging, J. Math. Imaging Vision 40 (2011), no. 1, 120–145. MR 2782122, DOI 10.1007/s10851-010-0251-1
- Antonin Chambolle and Thomas Pock, On the ergodic convergence rates of a first-order primal-dual algorithm, Math. Program. 159 (2016), no. 1-2, Ser. A, 253–287. MR 3535925, DOI 10.1007/s10107-015-0957-3
- T. F. Chan and R. Glowinski, Finite element approximation and iterative solution of a class of mildly nonlinear elliptic equations, Technical Report, Computer Science Department, Stanford University, CA, 1978.
- Etienne Corman and Xiaoming Yuan, A generalized proximal point algorithm and its convergence rate, SIAM J. Optim. 24 (2014), no. 4, 1614–1638. MR 3268621, DOI 10.1137/130940402
- Damek Davis and Wotao Yin, Convergence rate analysis of several splitting schemes, Splitting methods in communication, imaging, science, and engineering, Sci. Comput., Springer, Cham, 2016, pp. 115–163. MR 3617562
- Damek Davis and Wotao Yin, Faster convergence rates of relaxed Peaceman-Rachford and ADMM under regularity assumptions, Math. Oper. Res. 42 (2017), no. 3, 783–805. MR 3685266, DOI 10.1287/moor.2016.0827
- David L. Donoho and Yaakov Tsaig, Fast solution of $l_1$-norm minimization problems when the solution may be sparse, IEEE Trans. Inform. Theory 54 (2008), no. 11, 4789–4812. MR 2589865, DOI 10.1109/TIT.2008.929958
- Jim Douglas Jr. and H. H. Rachford Jr., On the numerical solution of heat conduction problems in two and three space variables, Trans. Amer. Math. Soc. 82 (1956), 421–439. MR 84194, DOI 10.1090/S0002-9947-1956-0084194-4
- J. Eckstein, Some saddle-function splitting methods for convex programming, Optim. Methods Softw. 4 (1994), no. 1, 75–83.
- Jonathan Eckstein and Dimitri P. Bertsekas, On the Douglas-Rachford splitting method and the proximal point algorithm for maximal monotone operators, Math. Programming 55 (1992), no. 3, Ser. A, 293–318. MR 1168183, DOI 10.1007/BF01581204
- Jonathan Eckstein and Wang Yao, Understanding the convergence of the alternating direction method of multipliers: theoretical and computational perspectives, Pac. J. Optim. 11 (2015), no. 4, 619–644. MR 3420805
- Michel Fortin and Roland Glowinski, Augmented Lagrangian methods, Studies in Mathematics and its Applications, vol. 15, North-Holland Publishing Co., Amsterdam, 1983. Applications to the numerical solution of boundary value problems; Translated from the French by B. Hunt and D. C. Spicer. MR 724072
- D. Gabay and B. Mercier, A dual algorithm for the solution of nonlinear variational problems via finite element approximation, Comput. Math. Appl. 2 (1976), no. 1, 17–40.
- Roland Glowinski, On alternating direction methods of multipliers: a historical perspective, Modeling, simulation and optimization for science and technology, Comput. Methods Appl. Sci., vol. 34, Springer, Dordrecht, 2014, pp. 59–82. MR 3330832, DOI 10.1007/978-94-017-9054-3_{4}
- R. Glowinski and A. Marrocco, Sur l’approximation, par éléments finis d’ordre un, et la résolution, par pénalisation-dualité, d’une classe de problèmes de Dirichlet non linéaires, Rev. Française Automat. Informat. Recherche Opérationnelle Sér. Rouge Anal. Numér. 9 (1975), no. R-2, 41–76 (French, with English summary). MR 388811
- Tom Goldstein, Brendan O’Donoghue, Simon Setzer, and Richard Baraniuk, Fast alternating direction optimization methods, SIAM J. Imaging Sci. 7 (2014), no. 3, 1588–1623. MR 3240850, DOI 10.1137/120896219
- Deren Han and Xiaoming Yuan, Local linear convergence of the alternating direction method of multipliers for quadratic programs, SIAM J. Numer. Anal. 51 (2013), no. 6, 3446–3457. MR 3143838, DOI 10.1137/120886753
- Bingsheng He, Li-Zhi Liao, Deren Han, and Hai Yang, A new inexact alternating directions method for monotone variational inequalities, Math. Program. 92 (2002), no. 1, Ser. A, 103–118. MR 1892298, DOI 10.1007/s101070100280
- Bingsheng He, Yanfei You, and Xiaoming Yuan, On the convergence of primal-dual hybrid gradient algorithm, SIAM J. Imaging Sci. 7 (2014), no. 4, 2526–2537. MR 3284566, DOI 10.1137/140963467
- Bingsheng He and Xiaoming Yuan, Convergence analysis of primal-dual algorithms for a saddle-point problem: from contraction perspective, SIAM J. Imaging Sci. 5 (2012), no. 1, 119–149. MR 2902659, DOI 10.1137/100814494
- Bingsheng He and Xiaoming Yuan, On the $O(1/n)$ convergence rate of the Douglas-Rachford alternating direction method, SIAM J. Numer. Anal. 50 (2012), no. 2, 700–709. MR 2914282, DOI 10.1137/110836936
- Bingsheng He and Xiaoming Yuan, On non-ergodic convergence rate of Douglas-Rachford alternating direction method of multipliers, Numer. Math. 130 (2015), no. 3, 567–577. MR 3347463, DOI 10.1007/s00211-014-0673-6
- Magnus R. Hestenes, Multiplier and gradient methods, J. Optim. Theory Appl. 4 (1969), 303–320. MR 271809, DOI 10.1007/BF00927673
- P.-L. Lions and B. Mercier, Splitting algorithms for the sum of two nonlinear operators, SIAM J. Numer. Anal. 16 (1979), no. 6, 964–979. MR 551319, DOI 10.1137/0716071
- B. Martinet, Régularisation d’inéquations variationnelles par approximations successives, Rev. Française Informat. Recherche Opérationnelle 4 (1970), no. Sér, Sér. R-3, 154–158 (French). MR 298899
- Renato D. C. Monteiro and Benar F. Svaiter, Iteration-complexity of block-decomposition algorithms and the alternating direction method of multipliers, SIAM J. Optim. 23 (2013), no. 1, 475–507. MR 3033116, DOI 10.1137/110849468
- Jean-Jacques Moreau, Proximité et dualité dans un espace hilbertien, Bull. Soc. Math. France 93 (1965), 273–299 (French). MR 201952
- Arkadi Nemirovski, Prox-method with rate of convergence $O(1/t)$ for variational inequalities with Lipschitz continuous monotone operators and smooth convex-concave saddle point problems, SIAM J. Optim. 15 (2004), no. 1, 229–251. MR 2112984, DOI 10.1137/S1052623403425629
- Yu. Nesterov, Gradient methods for minimizing composite functions, Math. Program. 140 (2013), no. 1, Ser. B, 125–161. MR 3071865, DOI 10.1007/s10107-012-0629-5
- Yu. E. Nesterov, A method for solving the convex programming problem with convergence rate $O(1/k^{2})$, Dokl. Akad. Nauk SSSR 269 (1983), no. 3, 543–547 (Russian). MR 701288
- M. J. D. Powell, A method for nonlinear constraints in minimization problems, Optimization (Sympos., Univ. Keele, Keele, 1968) Academic Press, London-New York, 1969, pp. 283–298. MR 272403
- R. Tyrrell Rockafellar, Convex analysis, Princeton Landmarks in Mathematics, Princeton University Press, Princeton, NJ, 1997. Reprint of the 1970 original; Princeton Paperbacks. MR 1451876
- R. Shefi, Rate of convergence analysis for convex nonsmooth optimization algorithms, PhD Thesis, Tel Aviv University, Israel, 2015.
- Min Tao and Xiaoming Yuan, On the optimal linear convergence rate of a generalized proximal point algorithm, J. Sci. Comput. 74 (2018), no. 2, 826–850. MR 3761279, DOI 10.1007/s10915-017-0477-9
- WenYi Tian and Xiaoming Yuan, Convergence analysis of primal-dual based methods for total variation minimization with finite element approximation, J. Sci. Comput. 76 (2018), no. 1, 243–274. MR 3812969, DOI 10.1007/s10915-017-0623-4
- Robert Tibshirani, Regression shrinkage and selection via the lasso, J. Roy. Statist. Soc. Ser. B 58 (1996), no. 1, 267–288. MR 1379242
- Xiangfeng Wang and Xiaoming Yuan, The linearized alternating direction method of multipliers for Dantzig selector, SIAM J. Sci. Comput. 34 (2012), no. 5, A2792–A2811. MR 3023726, DOI 10.1137/110833543
- Junfeng Yang and Xiaoming Yuan, Linearized augmented Lagrangian and alternating direction methods for nuclear norm minimization, Math. Comp. 82 (2013), no. 281, 301–329. MR 2983026, DOI 10.1090/S0025-5718-2012-02598-1
- K. Yang, Z. P. Cai, J. Z. Li, and G. H. Lin, A stable gene selection in microarray data analysis, BMC Bioinformatics 7 (2006), no. 1, 228.
- Xiaoqun Zhang, Martin Burger, and Stanley Osher, A unified primal-dual algorithm framework based on Bregman iteration, J. Sci. Comput. 46 (2011), no. 1, 20–46. MR 2753250, DOI 10.1007/s10915-010-9408-8
Bibliographic Information
- Wenyi Tian
- Affiliation: Center for Applied Mathematics, Tianjin University, Tianjin 300072, People’s Republic of China
- MR Author ID: 999800
- Email: twymath@gmail.com
- Xiaoming Yuan
- Affiliation: Department of Mathematics, The University of Hong Kong, Hong Kong, People’s Republic of China
- MR Author ID: 729439
- Email: xmyuan@hku.hk
- Received by editor(s): July 20, 2016
- Received by editor(s) in revised form: September 3, 2017, May 9, 2018, and June 8, 2018
- Published electronically: December 12, 2018
- Additional Notes: The first author was partially supported by the National Natural Science Foundation of China (No. 11701416).
The second author was partially supported by the General Research Fund from Hong Kong Research Grants Council: 12313516. - © Copyright 2018 American Mathematical Society
- Journal: Math. Comp. 88 (2019), 1685-1713
- MSC (2010): Primary 90C25, 65K10
- DOI: https://doi.org/10.1090/mcom/3388
- MathSciNet review: 3925481