Search | arXiv e-print repository

CQUESST: A dynamical stochastic framework for predicting soil-carbon sequestration

Authors: Dan Pagendam, Jeff Baldock, David Clifford, Ryan Farquharson, Lawrence Murray, Mike Beare, Denis Curtin, Noel Cressie

Abstract: A statistical framework we call CQUESST (Carbon Quantification and Uncertainty from Evolutionary Soil STochastics), which models carbon sequestration and cycling in soils, is applied to a long-running agricultural experiment that controls for crop type, tillage, and season. The experiment, known as the Millenium Tillage Trial (MTT), ran on 42 field-plots for ten years from 2000-2010; here CQUESST… ▽ More A statistical framework we call CQUESST (Carbon Quantification and Uncertainty from Evolutionary Soil STochastics), which models carbon sequestration and cycling in soils, is applied to a long-running agricultural experiment that controls for crop type, tillage, and season. The experiment, known as the Millenium Tillage Trial (MTT), ran on 42 field-plots for ten years from 2000-2010; here CQUESST is used to model soil carbon dynamically in six pools, in each of the 42 agricultural plots, and on a monthly time step for a decade. We show how CQUESST can be used to estimate soil-carbon cycling rates under different treatments. Our methods provide much-needed statistical tools for quantitatively inferring the effectiveness of different experimental treatments on soil-carbon sequestration. The decade-long data are of multiple observation types, and these interacting time series are ingested into a fully Bayesian model that has a dynamic stochastic model of multiple pools of soil carbon at its core. CQUESST's stochastic model is motivated by the deterministic RothC soil-carbon model based on nonlinear difference equations. We demonstrate how CQUESST can estimate soil-carbon fluxes for different experimental treatments while acknowledging uncertainties in soil-carbon dynamics, in physical parameters, and in observations. CQUESST is implemented efficiently in the probabilistic programming language Stan using its MapReduce parallelization, and it scales well for large numbers of field-plots, using software libraries that allow for computation to be shared over multiple nodes of high-performance computing clusters. △ Less

Submitted 9 November, 2024; originally announced November 2024.

arXiv:2201.01564 [pdf, other]

The effect of biologically mediated decay rates on modelling soil carbon sequestration in agricultural settings

Authors: Mohammad Javad Davoudabadi, Daniel Pagendam, Christopher Drovandi, Jeff Baldock, Gentry White

Abstract: Microbial biomass carbon (MBC), a crucial soil labile carbon fraction, is the most active component of the soil organic carbon (SOC) that regulates bio-geochemical processes in terrestrial ecosystems. Some studies in the literature ignore the effect of microbial population growth on carbon decomposition rates. In reality, we might expect that the decomposition rate should be related to the populat… ▽ More Microbial biomass carbon (MBC), a crucial soil labile carbon fraction, is the most active component of the soil organic carbon (SOC) that regulates bio-geochemical processes in terrestrial ecosystems. Some studies in the literature ignore the effect of microbial population growth on carbon decomposition rates. In reality, we might expect that the decomposition rate should be related to the population of microbes in the soil and have a positive relationship with the size of the microbial biomass pool. In this study, we explore the effect of microbial population growth on the accuracy of modelling soil carbon sequestration by developing and comparing two soil carbon models that consider a carrying capacity and limit to the growth of the microbial pool. We apply our models to three datasets, two small and one large datasets, and we select the best model in terms of having the best predictive performance through two model selection methods. Through this analysis we reveal that commonly used complex soil carbon models can over-fit in the presence of both small and large time-series datasets, and our simpler model can produce more accurate predictions. We conclude that considering the microbial population growth in a soil carbon model improves the accuracy of a model in the presence of a large dataset. △ Less

Submitted 5 January, 2022; originally announced January 2022.

Comments: arXiv admin note: substantial text overlap with arXiv:2105.04789

arXiv:2106.09647 [pdf, other]

Deep Learning Through the Lens of Example Difficulty

Authors: Robert J. N. Baldock, Hartmut Maennel, Behnam Neyshabur

Abstract: Existing work on understanding deep learning often employs measures that compress all data-dependent information into a few numbers. In this work, we adopt a perspective based on the role of individual examples. We introduce a measure of the computational difficulty of making a prediction for a given input: the (effective) prediction depth. Our extensive investigation reveals surprising yet simple… ▽ More Existing work on understanding deep learning often employs measures that compress all data-dependent information into a few numbers. In this work, we adopt a perspective based on the role of individual examples. We introduce a measure of the computational difficulty of making a prediction for a given input: the (effective) prediction depth. Our extensive investigation reveals surprising yet simple relationships between the prediction depth of a given input and the model's uncertainty, confidence, accuracy and speed of learning for that data point. We further categorize difficult examples into three interpretable groups, demonstrate how these groups are processed differently inside deep models and showcase how this understanding allows us to improve prediction accuracy. Insights from our study lead to a coherent view of a number of separately reported phenomena in the literature: early layers generalize while later layers memorize; early layers converge faster and networks learn easy data and simple functions first. △ Less

Submitted 18 June, 2021; v1 submitted 17 June, 2021; originally announced June 2021.

Comments: Main paper: 15 pages, 8 figures. Appendix: 31 pages, 40 figures

arXiv:2105.04789 [pdf, other]

doi 10.1038/s41598-024-53516-z

Innovative Approaches in Soil Carbon Sequestration Modelling for Better Prediction with Limited Data

Authors: Mohammad Javad Davoudabadi, Daniel Pagendam, Christopher Drovandi, Jeff Baldock, Gentry White

Abstract: Soil carbon accounting and prediction play a key role in building decision support systems for land managers selling carbon credits, in the spirit of the Paris and Kyoto protocol agreements. Land managers typically rely on computationally complex models fit using sparse datasets to make these accounts and predictions. The model complexity and sparsity of the data can lead to over-fitting, leading… ▽ More Soil carbon accounting and prediction play a key role in building decision support systems for land managers selling carbon credits, in the spirit of the Paris and Kyoto protocol agreements. Land managers typically rely on computationally complex models fit using sparse datasets to make these accounts and predictions. The model complexity and sparsity of the data can lead to over-fitting, leading to inaccurate results when making predictions with new data. Modellers address over-fitting by simplifying their models and reducing the number of parameters, and in the current context this could involve neglecting some soil organic carbon (SOC) components. In this study, we introduce two novel SOC models and a new RothC-like model and investigate how the SOC components and complexity of the SOC models affect the SOC prediction in the presence of small and sparse time series data. We develop model selection methods that can identify the soil carbon model with the best predictive performance, in light of the available data. Through this analysis we reveal that commonly used complex soil carbon models can over-fit in the presence of sparse time series data, and our simpler models can produce more accurate predictions. The published version of this study is available in Scientific Reports (https://www.nature.com/articles/s41598-024-53516-z/<10.1038/s41598-024-53516-z>) △ Less

Submitted 10 February, 2024; v1 submitted 11 May, 2021; originally announced May 2021.

Comments: 28 pages, 7 figures

Journal ref: Sci Rep 14, 3191 (2024)

arXiv:2006.10455 [pdf, other]

What Do Neural Networks Learn When Trained With Random Labels?

Authors: Hartmut Maennel, Ibrahim Alabdulmohsin, Ilya Tolstikhin, Robert J. N. Baldock, Olivier Bousquet, Sylvain Gelly, Daniel Keysers

Abstract: We study deep neural networks (DNNs) trained on natural image data with entirely random labels. Despite its popularity in the literature, where it is often used to study memorization, generalization, and other phenomena, little is known about what DNNs learn in this setting. In this paper, we show analytically for convolutional and fully connected networks that an alignment between the principal c… ▽ More We study deep neural networks (DNNs) trained on natural image data with entirely random labels. Despite its popularity in the literature, where it is often used to study memorization, generalization, and other phenomena, little is known about what DNNs learn in this setting. In this paper, we show analytically for convolutional and fully connected networks that an alignment between the principal components of network parameters and data takes place when training with random labels. We study this alignment effect by investigating neural networks pre-trained on randomly labelled image data and subsequently fine-tuned on disjoint datasets with random or real labels. We show how this alignment produces a positive transfer: networks pre-trained with random labels train faster downstream compared to training from scratch even after accounting for simple effects, such as weight scaling. We analyze how competing effects, such as specialization at later layers, may hide the positive transfer. These effects are studied in several network architectures, including VGG16 and ResNet18, on CIFAR10 and ImageNet. △ Less

Submitted 11 November, 2020; v1 submitted 18 June, 2020; originally announced June 2020.

Comments: Accepted, NeurIPS2020

arXiv:1904.04154 [pdf, other]

Bayesian Neural Networks at Finite Temperature

Authors: Robert J. N. Baldock, Nicola Marzari

Abstract: We recapitulate the Bayesian formulation of neural network based classifiers and show that, while sampling from the posterior does indeed lead to better generalisation than is obtained by standard optimisation of the cost function, even better performance can in general be achieved by sampling finite temperature ($T$) distributions derived from the posterior. Taking the example of two different de… ▽ More We recapitulate the Bayesian formulation of neural network based classifiers and show that, while sampling from the posterior does indeed lead to better generalisation than is obtained by standard optimisation of the cost function, even better performance can in general be achieved by sampling finite temperature ($T$) distributions derived from the posterior. Taking the example of two different deep (3 hidden layers) classifiers for MNIST data, we find quite different $T$ values to be appropriate in each case. In particular, for a typical neural network classifier a clear minimum of the test error is observed at $T>0$. This suggests an early stopping criterion for full batch simulated annealing: cool until the average validation error starts to increase, then revert to the parameters with the lowest validation error. As $T$ is increased classifiers transition from accurate classifiers to classifiers that have higher training error than assigning equal probability to each class. Efficient studies of these temperature-induced effects are enabled using a replica-exchange Hamiltonian Monte Carlo simulation technique. Finally, we show how thermodynamic integration can be used to perform model selection for deep neural networks. Similar to the Laplace approximation, this approach assumes that the posterior is dominated by a single mode. Crucially, however, no assumption is made about the shape of that mode and it is not required to precisely compute and invert the Hessian. △ Less

Submitted 8 April, 2019; originally announced April 2019.

Comments: 11 pages, 4 figures

Showing 1–6 of 6 results for author: Baldock, J