Open AccessPosted Content
Machine Learning as Statistical Data Assimilation.
TL;DR: A strong equivalence is identified between neural network based machine learning methods and the formulation of statistical data assimilation, known to be a problem in statistical physics, and this provides a design method for optimal networks for ML applications.
read more
Abstract: We identify a strong equivalence between neural network based machine learning (ML) methods and the formulation of statistical data assimilation (DA), known to be a problem in statistical physics. DA, as used widely in physical and biological sciences, systematically transfers information in observations to a model of the processes producing the observations. The correspondence is that layer label in the ML setting is the analog of time in the data assimilation setting. Utilizing aspects of this equivalence we discuss how to establish the global minimum of the cost functions in the ML context, using a variational annealing method from DA. This provides a design method for optimal networks for ML applications and may serve as the basis for understanding the success of "deep learning". Results from an ML example are presented.
When the layer label is taken to be continuous, the Euler-Lagrange equation for the ML optimization problem is an ordinary differential equation, and we see that the problem being solved is a two point boundary value problem. The use of continuous layers is denoted "deepest learning". The Hamiltonian version provides a direct rationale for back propagation as a solution method for the canonical momentum; however, it suggests other solution methods are to be preferred.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Machine Learning of Time Series Using Time-Delay Embedding and Precision Annealing
TL;DR: Two techniques used in statistical data assimilation are borrowed: time-delay embedding to prepare input data and precision annealing as a training method to identify the number of training pairs required to produce good generalizations for the time series.
9
•Posted Content
Machine Learning of Time Series Using Time-delay Embedding and Precision Annealing
TL;DR: In this paper, the equivalence between statistical data assimilation and supervised machine learning is used to predict segments of a time series using a precision annealing approach to identify the global minimum of the action.
Marine data assimilation in the UK: the past, the present and the vision for the future
Jozef Skákala,David Ford,Keith Haines,Amos S. Lawless,Matthew J. Martin,Philip Browne,Marcin Chrust,Stefano Ciavatta,Alison Fowler,Daniel Lea,Matthew D. Palmer,Andrea Rochner,Jennifer Waters,Hao Zuo,Michael Bell,D. M. Carneiro,Yumeng Chen,Susan Kay,Dale Partridge,Martin F. Price,Richard Renshaw,Georgy Shapiro,James While +22 more
- 20 Jun 2024
TL;DR: The UK has been a leading force in marine data assimilation (MDA) research, with significant advancements in the past two decades. Future trends include rapid growth of machine learning and digital twin applications, increased computational power, and the need for integrated approaches.
References
Deep learning
TL;DR: Deep learning is making major advances in solving problems that have resisted the best attempts of the artificial intelligence community for many years, and will have many more successes in the near future because it requires very little engineering by hand and can easily take advantage of increases in the amount of available computation and data.
67K
Finding Structure in Time
TL;DR: A proposal along these lines first described by Jordan (1986) which involves the use of recurrent links in order to provide networks with a dynamic memory and suggests a method for representing lexical categories and the type/token distinction is developed.
11.8K
Dermatologist-level classification of skin cancer with deep neural networks
Andre Esteva,Brett Kuprel,Roberto A. Novoa,Justin M. Ko,Susan M. Swetter,Susan M. Swetter,Helen M. Blau,Sebastian Thrun +7 more
TL;DR: This work demonstrates an artificial intelligence capable of classifying skin cancer with a level of competence comparable to dermatologists, trained end-to-end from images directly, using only pixels and disease labels as inputs.
11.8K
Development and Validation of a Deep Learning Algorithm for Detection of Diabetic Retinopathy in Retinal Fundus Photographs
Varun Gulshan,Lily Peng,Marc Coram,Martin C. Stumpe,Derek Wu,Arunachalam Narayanaswamy,Subhashini Venugopalan,Kasumi Widner,Tom Madams,Jorge Cuadros,Ramasamy Kim,Rajiv Raman,Philip C. Nelson,Jessica L. Mega,Dale R. Webster +14 more
TL;DR: An algorithm based on deep machine learning had high sensitivity and specificity for detecting referable diabetic retinopathy and diabetic macular edema in retinal fundus photographs from adults with diabetes.
A limited memory algorithm for bound constrained optimization
TL;DR: An algorithm for solving large nonlinear optimization problems with simple bounds is described, based on the gradient projection method and uses a limited memory BFGS matrix to approximate the Hessian of the objective function.