Machine learning of continuous and discrete variational ODEs with convergence guarantee and uncertainty quantification
HTML articles powered by AMS MathViewer
- by Christian Offen;
- Math. Comp.
- DOI: https://doi.org/10.1090/mcom/4120
- Published electronically: June 26, 2025
Abstract:
The article introduces a method to learn dynamical systems that are governed by Euler–Lagrange equations from data. The method is based on Gaussian process regression and identifies continuous or discrete Lagrangians and is, therefore, structure preserving by design. A rigorous proof of convergence as the distance between observation data points converges to zero and lower bounds for convergence rates are provided. Next to convergence guarantees, the method allows for quantification of model uncertainty, which can provide a basis of adaptive sampling techniques. We provide efficient uncertainty quantification of any observable that is linear in the Lagrangian, including of Hamiltonian functions (energy) and symplectic structures, which is of interest in the context of system identification. The article overcomes major practical and theoretical difficulties related to the ill-posedness of the identification task of (discrete) Lagrangians through a careful design of geometric regularisation strategies and through an exploit of a relation to convex minimisation problems in reproducing kernel Hilbert spaces.References
- Robert A. Adams and John J. F. Fournier, Sobolev spaces, 2nd ed., Pure and Applied Mathematics (Amsterdam), vol. 140, Elsevier/Academic Press, Amsterdam, 2003. MR 2424078
- T. Aoshima, T. Matsubara, and T. Yaguchi, Deep discrete-time Lagrangian mechanics, ICLR SimDL, vol. 49, 2021, pp. 1–8.
- Rémi Arcangéli, María Cruz López de Silanes, and Juan José Torrens, An extension of a bound for functions in Sobolev spaces, with applications to $(m,s)$-spline interpolation and smoothing, Numer. Math. 107 (2007), no. 2, 181–211. MR 2328845, DOI 10.1007/s00211-007-0092-z
- V. I. Arnold, Mathematical Methods of Classical Mechanics, Graduate Texts in Mathematics, Springer New York, New York, NY, 1989, DOI 10.1007/978-1-4757-2063-1.
- Victor Barwell and Alan George, A comparison of algorithms for solving symmetric indefinite systems of linear equations, ACM Trans. Math. Software 2 (1976), no. 3, 242–251. MR 418432, DOI 10.1145/355694.355697
- Pau Batlle, Yifan Chen, Bamdad Hosseini, Houman Owhadi, and Andrew M. Stuart, Error analysis of kernel/GP methods for nonlinear and parametric PDEs, J. Comput. Phys. 520 (2025), Paper No. 113488, 23. MR 4813995, DOI 10.1016/j.jcp.2024.113488
- Tom Bertalan, Felix Dietrich, Igor Mezić, and Ioannis G. Kevrekidis, On learning Hamiltonian systems from data, Chaos 29 (2019), no. 12, 121107, 9. MR 4045468, DOI 10.1063/1.5128231
- Jeff Bezanson, Alan Edelman, Stefan Karpinski, and Viral B. Shah, Julia: a fresh approach to numerical computing, SIAM Rev. 59 (2017), no. 1, 65–98. MR 3605826, DOI 10.1137/141000671
- James R. Bunch and Linda Kaufman, Some stable methods for calculating inertia and solving symmetric linear systems, Math. Comp. 31 (1977), no. 137, 163–179. MR 428694, DOI 10.1090/S0025-5718-1977-0428694-0
- José F. Cariñena and Luis A. Ibort, Non-Noether constants of motion, J. Phys. A 16 (1983), no. 1, 1–7. MR 700177
- R. Chen and M. Tao, Data-driven prediction of general Hamiltonian dynamics via learning exactly-symplectic maps, Proceedings of the 38th International Conference on Machine Learning, Proceedings of Machine Learning Research, vol. 139, PMLR, 18–24 July 2021, pp. 1717–1727.
- Yifan Chen, Bamdad Hosseini, Houman Owhadi, and Andrew M. Stuart, Solving and learning nonlinear PDEs with Gaussian processes, J. Comput. Phys. 447 (2021), Paper No. 110668, 29. MR 4311012, DOI 10.1016/j.jcp.2021.110668
- Y. Chen, B. Xu, T. Matsubara, and T. Yaguchi, Variational principle and variational integrators for neural symplectic forms, ICML Workshop on New Frontiers in Learning, Control, and Dynamical Systems, 2023.
- A. Christmann and I. Steinwart, Kernels and reproducing kernel Hilbert spaces, Springer New York, New York, NY, 2008, pp. 110–163.
- N. Da Costa, M. Pförtner, L. Da Costa, and P. Hennig, Sample path regularity of Gaussian processes from the covariance kernel, 2024, arXiv:2312.14886.
- M. Cranmer, S. Greydanus, S. Hoyer, P. Battaglia, D. Spergel, and S. Ho, Lagrangian neural networks, 2020, arXiv:2003.04630.
- Marco David and Florian Méhats, Symplectic learning for Hamiltonian neural networks, J. Comput. Phys. 494 (2023), Paper No. 112495, 13. MR 4652800, DOI 10.1016/j.jcp.2023.112495
- Eva Dierkes, Christian Offen, Sina Ober-Blöbaum, and Kathrin Flaßkamp, Hamiltonian neural networks with automatic symmetry detection, Chaos 33 (2023), no. 6, Paper No. 063115, 12. MR 4596708, DOI 10.1063/5.0142969
- Joseph Diestel, Sequences and series in Banach spaces, Graduate Texts in Mathematics, vol. 92, Springer-Verlag, New York, 1984. MR 737004, DOI 10.1007/978-1-4612-5200-9
- Morris L. Eaton, Multivariate statistics, Wiley Series in Probability and Mathematical Statistics: Probability and Mathematical Statistics, John Wiley & Sons, Inc., New York, 1983. A vector space approach. MR 716321
- Sølve Eidnes and Kjetil Olsen Lye, Pseudo-Hamiltonian neural networks for learning partial differential equations, J. Comput. Phys. 500 (2024), Paper No. 112738, 25. MR 4685886, DOI 10.1016/j.jcp.2023.112738
- G. Evangelisti and S. Hirche, Physically consistent learning of conservative Lagrangian systems with Gaussian processes, 2022 IEEE 61st Conference on Decision and Control (CDC), IEEE, 2022.
- Jinchao Feng, Charles Kulick, Yunxiang Ren, and Sui Tang, Learning particle swarming models from data with Gaussian processes, Math. Comp. 93 (2024), no. 349, 2391–2437. MR 4759379, DOI 10.1090/mcom/3915
- I. M. Gelfand, S. V. Fomin, and R. A. Silverman, Calculus of variations, Dover Books on Mathematics, Dover Publications, 2000.
- S. Greydanus, M. Dzamba, and J. Yosinski, Hamiltonian neural networks, Advances in Neural Information Processing Systems, vol. 32, Curran Associates, Inc., 2019.
- Michael Hardy, Combinatorics of partial derivatives, Electron. J. Combin. 13 (2006), no. 1, Research Paper 1, 13. MR 2200529, DOI 10.37236/1027
- Marc Henneaux, Equations of motion, commutation relations and ambiguities in the Lagrangian formalism, Ann. Physics 140 (1982), no. 1, 45–64. MR 660925, DOI 10.1016/0003-4916(82)90334-7
- J. Hu, J.-P. Ortega, and D. Yin, A structure-preserving kernel method for learning Hamiltonian systems, Mathematics of Computation, 2025, DOI 10.1090/mcom/4106.
- P. Jin, Z. Zhang, A. Zhu, Y. Tang, and G. E. Karniadakis, SympNets: intrinsic structure-preserving symplectic networks for identifying Hamiltonian systems, Neural Net. 132 (2020), 166–179.
- Y. Lishkova, P. Scherer, S. Ridderbusch, M. Jamnik, P. Liò, S. Ober-Blöbaum, and C. Offen, Discrete Lagrangian neural networks with automatic symmetry discovery, IFAC-PapersOnLine 56 (2023), no. 2, 3203–3210, 22nd IFAC World Congress.
- Fei Lu, Ming Zhong, Sui Tang, and Mauro Maggioni, Nonparametric inference of interaction laws in systems of agents from trajectory data, Proc. Natl. Acad. Sci. USA 116 (2019), no. 29, 14424–14433. MR 3984488, DOI 10.1073/pnas.1822012116
- G. Marmo and G. Morandi, On the inverse problem with symmetries, and the appearance of cohomologies in classical Lagrangian dynamics, Rep. Math. Phys. 28 (1989), no. 3, 389–410. MR 1136761, DOI 10.1016/0034-4877(89)90071-2
- G. Marmo and C. Rubano, On the uniqueness of the Lagrangian description for charged particles in external magnetic field, Il Nuovo Cimento A 98 (1987), no. 4, 387–399.
- Jerrold E. Marsden and Tudor S. Ratiu, Introduction to mechanics and symmetry, 2nd ed., Texts in Applied Mathematics, vol. 17, Springer-Verlag, New York, 1999. A basic exposition of classical mechanical systems. MR 1723696, DOI 10.1007/978-0-387-21792-5
- J. E. Marsden and M. West, Discrete mechanics and variational integrators, Acta Numer. 10 (2001), 357–514. MR 2009697, DOI 10.1017/S096249290100006X
- Francis J. Narcowich, Joseph D. Ward, and Holger Wendland, Sobolev bounds on functions with scattered zeros, with applications to radial basis function surface fitting, Math. Comp. 74 (2005), no. 250, 743–763. MR 2114646, DOI 10.1090/S0025-5718-04-01708-9
- Sina Ober-Blöbaum and Christian Offen, Variational learning of Euler-Lagrange dynamics from data, J. Comput. Appl. Math. 421 (2023), Paper No. 114780, 18. MR 4496876, DOI 10.1016/j.cam.2022.114780
- C. Offen, Machine learning of discrete field theories with guaranteed convergence and uncertainty quantification, 2024, arXiv:2407.07642.
- C. Offen, Software: Christian-Offen/Lagrangian_GP: initial release of GitHub repository, 2024, DOI 10.5281/zenodo.11093645.
- C. Offen and S. Ober-Blöbaum, Symplectic integration of learned Hamiltonian systems, Chaos 32 (2022), no. 1, Paper No. 013122, 13. MR 4363449, DOI 10.1063/5.0065913
- Christian Offen and Sina Ober-Blöbaum, Learning discrete Lagrangians for variational PDEs from data and detection of travelling waves, Geometric science of information. Part I, Lecture Notes in Comput. Sci., vol. 14071, Springer, Cham, [2023] ©2023, pp. 569–579. MR 4638297, DOI 10.1007/978-3-031-38271-0_{5}7
- C. Offen and S. Ober-Blöbaum, Learning of discrete models of variational PDEs from data, Chaos 34 (2024), 013104.
- Juan-Pablo Ortega and Daiying Yin, Learnability of linear port-Hamiltonian systems, J. Mach. Learn. Res. 25 (2024), Paper No. [68], 56. MR 4749104
- Houman Owhadi and Clint Scovel, Operator-adapted wavelets, fast solvers, and numerical homogenization, Cambridge Monographs on Applied and Computational Mathematics, vol. 35, Cambridge University Press, Cambridge, 2019. From a game theoretic approach to numerical approximation and algorithm design. MR 3971243
- M. Pförtner, I. Steinwart, P. Hennig, and J. Wenger, Physics-informed Gaussian process regression generalizes linear PDE solvers, 2024, arXiv:2212.12474.
- H. Qin, Machine learning and serving of discrete field theories, Sci. Rep. 10 (2020), no. 1, 19329.
- Joaquin Quiñonero-Candela and Carl Edward Rasmussen, A unifying view of sparse approximate Gaussian process regression, J. Mach. Learn. Res. 6 (2005), 1939–1959. MR 2249877
- C. Rackauckas and Q. Nie, Differentialequations.jl–a performant and feature-rich ecosystem for solving differential equations in Julia, J. Open Res. Softw. 5 (2017), no. 1, 15.
- Carl Edward Rasmussen and Christopher K. I. Williams, Gaussian processes for machine learning, Adaptive Computation and Machine Learning, MIT Press, Cambridge, MA, 2006. MR 2514435
- Katharina Rath, Christopher G. Albert, Bernd Bischl, and Udo von Toussaint, Symplectic Gaussian process regression of maps in Hamiltonian systems, Chaos 31 (2021), no. 5, Paper No. 053121, 24. MR 4259865, DOI 10.1063/5.0048129
- T. Roubíček, Calculus of Variations, John Wiley & Sons Ltd., 2015, pp. 1–38.
- Walter Rudin, Functional analysis, 2nd ed., International Series in Pure and Applied Mathematics, McGraw-Hill, Inc., New York, 1991. MR 1157815
- Robert Schaback and Holger Wendland, Kernel techniques: from machine learning to meshless methods, Acta Numer. 15 (2006), 543–639. MR 2269747, DOI 10.1017/S0962492906270016
- Florian Schäfer, Matthias Katzfuss, and Houman Owhadi, Sparse Cholesky factorization by Kullback-Leibler minimization, SIAM J. Sci. Comput. 43 (2021), no. 3, A2019–A2046. MR 4267493, DOI 10.1137/20M1336254
- Florian Schäfer, T. J. Sullivan, and Houman Owhadi, Compression, inversion, and approximate PCA of dense kernel matrices at near-linear computational complexity, Multiscale Model. Simul. 19 (2021), no. 2, 688–730. MR 4243658, DOI 10.1137/19M129526X
- M. Titsias, Variational learning of inducing variables in sparse Gaussian processes, Proceedings of the Twelfth International Conference on Artificial Intelligence and Statistics (Hilton Clearwater Beach Resort, Clearwater Beach, Florida), Proceedings of Machine Learning Research, vol. 5, PMLR, 16–18 April 2009, pp. 567–574.
- Mats Vermeeren, Modified equations for variational integrators, Numer. Math. 137 (2017), no. 4, 1001–1037. MR 3719050, DOI 10.1007/s00211-017-0896-4
- Holger Wendland and Christian Rieger, Approximate interpolation with applications to selecting smoothing parameters, Numer. Math. 101 (2005), no. 4, 729–748. MR 2195325, DOI 10.1007/s00211-005-0637-y
Bibliographic Information
- Christian Offen
- Affiliation: Department of Mathematics, Paderborn University, Warburger Str. 100, 33098 Paderborn, Germany
- MR Author ID: 1275998
- ORCID: 0000-0002-5940-8057
- Email: christian.offen@uni-paderborn.de
- Received by editor(s): April 30, 2024
- Received by editor(s) in revised form: February 19, 2025, and May 2, 2025
- Published electronically: June 26, 2025
- © Copyright 2025 by the author
- Journal: Math. Comp.
- MSC (2020): Primary 34A55; Secondary 35R30, 65L09, 37J99, 70F17
- DOI: https://doi.org/10.1090/mcom/4120