Simulation-based inference as a paradigm for scientific machine learning in the cryosphere and beyond
<jats:p>Mechanistic or dynamical models based on governing equations are ubiquitous throughout science and engineering. Such models, also referred to as &#8220;simulators&#8221;, are typically characterized by a forward mapping from some set of inputs or parameters to one or more output quantities of interest. In many cases, the inputs required for the forward model are either unknown or represent approximations of unresolved processes. Bayesian inference provides a natural framework for constraining this model uncertainty using observed data. Such a framework is especially valuable in cryospheric application domains such as permafrost research, where direct observations of many quantities of interest, e.g. subsurface temperature and soil moisture, are only sparsely available. Mechanistic models based on known physics therefore play an indispensable role in filling these gaps. However, virtually all methods for Bayesian inference require repeated evaluation of the forward model which is often computationally challenging, especially for dynamical systems. As a result, computational requirements of statistical inference very quickly become intractable even for systems of only moderate complexity. The burgeoning field of &#8220;simulation-based inference&#8221; (SBI) aims to leverage modern computational methods from machine learning (ML) and data assimilation (DA) to overcome these challenges and facilitate large scale uncertainty quantification in complex scientific models. In this work, we show how SBI can be seen as a unifying theoretical framework that bridges the gap between existing DA methods (e.g. variants of the ensemble Kalman filter) and full-fledged Bayesian inference with the goal of facilitating hybrid statistical-physical modeling of complex systems. We present a novel set of software tools for making SBI more accessible to researchers along with benchmarks of several methods drawn from both the ML and DA literature. Two of these benchmarks are based on use cases from Arctic land surface modeling: degree day approximation of snowmelt and geothermal inversion of historical climate change from boreholes in Arctic permafrost. We highlight the contributions that SBI can make to solving such inverse problems and discuss further potential applications in the cryosphere and beyond.</jats:p>