-
Joint Alignment of Multivariate Quasi-Periodic Functional Data Using Deep Learning
Authors:
Vi Thanh Pham,
Jonas Bille Nielsen,
Klaus Fuglsang Kofoed,
Jørgen Tobias Kühl,
Andreas Kryger Jensen
Abstract:
The joint alignment of multivariate functional data plays an important role in various fields such as signal processing, neuroscience and medicine, including the statistical analysis of data from wearable devices. Traditional methods often ignore the phase variability and instead focus on the variability in the observed amplitude. We present a novel method for joint alignment of multivariate quasi…
▽ More
The joint alignment of multivariate functional data plays an important role in various fields such as signal processing, neuroscience and medicine, including the statistical analysis of data from wearable devices. Traditional methods often ignore the phase variability and instead focus on the variability in the observed amplitude. We present a novel method for joint alignment of multivariate quasi-periodic functions using deep neural networks, decomposing, but retaining all the information in the data by preserving both phase and amplitude variability. Our proposed neural network uses a special activation of the output that builds on the unit simplex transformation, and we utilize a loss function based on the Fisher-Rao metric to train our model. Furthermore, our method is unsupervised and can provide an optimal common template function as well as subject-specific templates. We demonstrate our method on two simulated datasets and one real example, comprising data from 12-lead 10s electrocardiogram recordings.
△ Less
Submitted 14 November, 2023;
originally announced December 2023.
-
Monitoring a developing pandemic with available data
Authors:
María Luz Gámiz,
Enno Mammen,
María Dolores Martínez-Miranda,
Jens Perch Nielsen
Abstract:
This paper addresses pandemic statistics from a management perspective. Both input and output are easy to understand. Focus is on operations and cross border communication. To be able to work with simple available data some new missing data issues have to be solved from a mathematical statistical point of view. We illustrate our approach with data from France collected during the recent Covid-19 p…
▽ More
This paper addresses pandemic statistics from a management perspective. Both input and output are easy to understand. Focus is on operations and cross border communication. To be able to work with simple available data some new missing data issues have to be solved from a mathematical statistical point of view. We illustrate our approach with data from France collected during the recent Covid-19 pandemic. Our new benchmark method also introduces a potential new division of labour while working with pandemic statistics allowing crucial input to be fed to the model via prior knowledge from external experts.
△ Less
Submitted 19 August, 2023;
originally announced August 2023.
-
Low quality exposure and point processes with a view to the first phase of a pandemic
Authors:
María Luz Gámiz,
Enno Mammen,
María Dolores Martínez-Miranda,
Jens Perch Nielsen
Abstract:
In the early days of development of a pandemic there is no time for complicated data collection. One needs a simple cross-country benchmark approach based on robust data that is easy to understand and easy to collect. The recent pandemic has shown us what early available pandemic data might look like, because statistical data was published every day in standard news outlets in many countries. This…
▽ More
In the early days of development of a pandemic there is no time for complicated data collection. One needs a simple cross-country benchmark approach based on robust data that is easy to understand and easy to collect. The recent pandemic has shown us what early available pandemic data might look like, because statistical data was published every day in standard news outlets in many countries. This paper provides new methodology for the analysis data where exposure is only vaguely understood and where the very definition of exposure might change over time. The exposure of poor quality is used to analyse and forecast events. Our example of such exposure is daily infections during a pandemic and the events are number of new infected patients in hospitals every day. Examples are given with French Covid-19 data on hospitalized patients and numbers of infected.
△ Less
Submitted 19 August, 2023;
originally announced August 2023.
-
Optimal Inspection and Maintenance Planning for Deteriorating Structural Components through Dynamic Bayesian Networks and Markov Decision Processes
Authors:
P. G. Morato,
K. G. Papakonstantinou,
C. P. Andriotis,
J. S. Nielsen,
P. Rigo
Abstract:
Civil and maritime engineering systems, among others, from bridges to offshore platforms and wind turbines, must be efficiently managed as they are exposed to deterioration mechanisms throughout their operational life, such as fatigue or corrosion. Identifying optimal inspection and maintenance policies demands the solution of a complex sequential decision-making problem under uncertainty, with th…
▽ More
Civil and maritime engineering systems, among others, from bridges to offshore platforms and wind turbines, must be efficiently managed as they are exposed to deterioration mechanisms throughout their operational life, such as fatigue or corrosion. Identifying optimal inspection and maintenance policies demands the solution of a complex sequential decision-making problem under uncertainty, with the main objective of efficiently controlling the risk associated with structural failures. Addressing this complexity, risk-based inspection planning methodologies, supported often by dynamic Bayesian networks, evaluate a set of pre-defined heuristic decision rules to reasonably simplify the decision problem. However, the resulting policies may be compromised by the limited space considered in the definition of the decision rules. Avoiding this limitation, Partially Observable Markov Decision Processes (POMDPs) provide a principled mathematical methodology for stochastic optimal control under uncertain action outcomes and observations, in which the optimal actions are prescribed as a function of the entire, dynamically updated, state probability distribution. In this paper, we combine dynamic Bayesian networks with POMDPs in a joint framework for optimal inspection and maintenance planning, and we provide the formulation for developing both infinite and finite horizon POMDPs in a structural reliability context. The proposed methodology is implemented and tested for the case of a structural component subject to fatigue deterioration, demonstrating the capability of state-of-the-art point-based POMDP solvers for solving the underlying planning optimization problem. Within the numerical experiments, POMDP and heuristic-based policies are thoroughly compared, and results showcase that POMDPs achieve substantially lower costs as compared to their counterparts, even for traditional problem settings.
△ Less
Submitted 28 November, 2021; v1 submitted 9 September, 2020;
originally announced September 2020.
-
Wasserstein Neural Processes
Authors:
Andrew Carr,
Jared Nielsen,
David Wingate
Abstract:
Neural Processes (NPs) are a class of models that learn a mapping from a context set of input-output pairs to a distribution over functions. They are traditionally trained using maximum likelihood with a KL divergence regularization term. We show that there are desirable classes of problems where NPs, with this loss, fail to learn any reasonable distribution. We also show that this drawback is sol…
▽ More
Neural Processes (NPs) are a class of models that learn a mapping from a context set of input-output pairs to a distribution over functions. They are traditionally trained using maximum likelihood with a KL divergence regularization term. We show that there are desirable classes of problems where NPs, with this loss, fail to learn any reasonable distribution. We also show that this drawback is solved by using approximations of Wasserstein distance which calculates optimal transport distances even for distributions of disjoint support. We give experimental justification for our method and demonstrate performance. These Wasserstein Neural Processes (WNPs) maintain all of the benefits of traditional NPs while being able to approximate a new class of function mappings.
△ Less
Submitted 9 January, 2020; v1 submitted 1 October, 2019;
originally announced October 2019.
-
Non-Smooth Backfitting for Excess Risk Additive Regression Model with Two Survival Time-Scales
Authors:
Munir Hiabu,
Jens P. Nielsen,
Thomas H. Scheike
Abstract:
We present a new backfitting algorithm estimating the complex structured non-parametric survival model of Scheike (2001) without having to use smoothing. The considered model is a non-parametric survival model with two time-scales that are equivalent up to a constant that varies over the subjects. Covariate effects are modelled linearly on each time scale by additive Aalen models. Estimators of th…
▽ More
We present a new backfitting algorithm estimating the complex structured non-parametric survival model of Scheike (2001) without having to use smoothing. The considered model is a non-parametric survival model with two time-scales that are equivalent up to a constant that varies over the subjects. Covariate effects are modelled linearly on each time scale by additive Aalen models. Estimators of the cumulative intensities on the two time-scales are suggested by solving local estimating equations jointly on the two time-scales. We are able to estimate the cumulative intensities solving backfitting estimating equations without using smoothing methods and we provide large sample properties and simultaneous confidence bands. The model is applied to data on myocardial infarction providing a separation of the two effects stemming from time since diagnosis and age.
△ Less
Submitted 1 April, 2019;
originally announced April 2019.
-
Fast, Precise Myelin Water Quantification using DESS MRI and Kernel Learning
Authors:
Gopal Nataraj,
Jon-Fredrik Nielsen,
Mingjie Gao,
Jeffrey A. Fessler
Abstract:
Purpose: To investigate the feasibility of myelin water content quantification using fast dual-echo steady-state (DESS) scans and machine learning with kernels.
Methods: We optimized combinations of steady-state (SS) scans for precisely estimating the fast-relaxing signal fraction ff of a two-compartment signal model, subject to a scan time constraint. We estimated ff from the optimized DESS acq…
▽ More
Purpose: To investigate the feasibility of myelin water content quantification using fast dual-echo steady-state (DESS) scans and machine learning with kernels.
Methods: We optimized combinations of steady-state (SS) scans for precisely estimating the fast-relaxing signal fraction ff of a two-compartment signal model, subject to a scan time constraint. We estimated ff from the optimized DESS acquisition using a recently developed method for rapid parameter estimation via regression with kernels (PERK). We compared DESS PERK ff estimates to conventional myelin water fraction (MWF) estimates from a longer multi-echo spin-echo (MESE) acquisition in simulation, in vivo, and ex vivo studies.
Results: Simulations demonstrate that DESS PERK ff estimators and MESE MWF estimators achieve comparable error levels. In vivo and ex vivo experiments demonstrate that MESE MWF and DESS PERK ff estimates are quantitatively comparable measures of WM myelin water content. To our knowledge, these experiments are the first to demonstrate myelin water images from a SS acquisition that are quantitatively similar to conventional MESE MWF images.
Conclusion: Combinations of fast DESS scans can be designed to enable precise ff estimation. PERK is well-suited for ff estimation. DESS PERK ff and MESE MWF estimates are quantitatively similar measures of WM myelin water content.
△ Less
Submitted 24 September, 2018;
originally announced September 2018.
-
Multiplicative local linear hazard estimation and best one-sided cross-validation
Authors:
Maria Luz Gamiz,
Maria Dolores Martinez-Miranda,
Jens Perch Nielsen
Abstract:
This paper develops detailed mathematical statistical theory of a new class of cross-validation techniques of local linear kernel hazards and their multiplicative bias corrections. The new class of cross-validation combines principles of local information and recent advances in indirect cross-validation. A few applications of cross-validating multiplicative kernel hazard estimation do exist in the…
▽ More
This paper develops detailed mathematical statistical theory of a new class of cross-validation techniques of local linear kernel hazards and their multiplicative bias corrections. The new class of cross-validation combines principles of local information and recent advances in indirect cross-validation. A few applications of cross-validating multiplicative kernel hazard estimation do exist in the literature. However, detailed mathematical statistical theory and small sample performance are introduced via this paper and further upgraded to our new class of best one-sided cross-validation. Best one-sided cross-validation turns out to have excellent performance in its practical illustrations, in its small sample performance and in its mathematical statistical theoretical performance.
△ Less
Submitted 16 October, 2017;
originally announced October 2017.
-
Dictionary-Free MRI PERK: Parameter Estimation via Regression with Kernels
Authors:
Gopal Nataraj,
Jon-Fredrik Nielsen,
Clayton Scott,
Jeffrey A. Fessler
Abstract:
This paper introduces a fast, general method for dictionary-free parameter estimation in quantitative magnetic resonance imaging (QMRI) via regression with kernels (PERK). PERK first uses prior distributions and the nonlinear MR signal model to simulate many parameter-measurement pairs. Inspired by machine learning, PERK then takes these parameter-measurement pairs as labeled training points and l…
▽ More
This paper introduces a fast, general method for dictionary-free parameter estimation in quantitative magnetic resonance imaging (QMRI) via regression with kernels (PERK). PERK first uses prior distributions and the nonlinear MR signal model to simulate many parameter-measurement pairs. Inspired by machine learning, PERK then takes these parameter-measurement pairs as labeled training points and learns from them a nonlinear regression function using kernel functions and convex optimization. PERK admits a simple implementation as per-voxel nonlinear lifting of MRI measurements followed by linear minimum mean-squared error regression. We demonstrate PERK for $T_1,T_2$ estimation, a well-studied application where it is simple to compare PERK estimates against dictionary-based grid search estimates. Numerical simulations as well as single-slice phantom and in vivo experiments demonstrate that PERK and grid search produce comparable $T_1,T_2$ estimates in white and gray matter, but PERK is consistently at least $23\times$ faster. This acceleration factor will increase by several orders of magnitude for full-volume QMRI estimation problems involving more latent parameters per voxel.
△ Less
Submitted 6 October, 2017;
originally announced October 2017.
-
On Local Optima in Learning Bayesian Networks
Authors:
Jens D. Nielsen,
Tomas Kocka,
Jose M. Pena
Abstract:
This paper proposes and evaluates the k-greedy equivalence search algorithm (KES) for learning Bayesian networks (BNs) from complete data. The main characteristic of KES is that it allows a trade-off between greediness and randomness, thus exploring different good local optima. When greediness is set at maximum, KES corresponds to the greedy equivalence search algorithm (GES). When greediness is k…
▽ More
This paper proposes and evaluates the k-greedy equivalence search algorithm (KES) for learning Bayesian networks (BNs) from complete data. The main characteristic of KES is that it allows a trade-off between greediness and randomness, thus exploring different good local optima. When greediness is set at maximum, KES corresponds to the greedy equivalence search algorithm (GES). When greediness is kept at minimum, we prove that under mild assumptions KES asymptotically returns any inclusion optimal BN with nonzero probability. Experimental results for both synthetic and real data are reported showing that KES often finds a better local optima than GES. Moreover, we use KES to experimentally confirm that the number of different local optima is often huge.
△ Less
Submitted 19 October, 2012;
originally announced December 2012.
-
A comparative study of new cross-validated bandwidth selectors for kernel density estimation
Authors:
Enno Mammen,
Maria Dolores Martinez Miranda,
Jens Perch Nielsen,
Stefan Sperlich
Abstract:
Recent contributions to kernel smoothing show that the performance of cross-validated bandwidth selectors improve significantly from indirectness. Indirect crossvalidation first estimates the classical cross-validated bandwidth from a more rough and difficult smoothing problem than the original one and then rescales this indirect bandwidth to become a bandwidth of the original problem. The motivat…
▽ More
Recent contributions to kernel smoothing show that the performance of cross-validated bandwidth selectors improve significantly from indirectness. Indirect crossvalidation first estimates the classical cross-validated bandwidth from a more rough and difficult smoothing problem than the original one and then rescales this indirect bandwidth to become a bandwidth of the original problem. The motivation for this approach comes from the observation that classical crossvalidation tends to work better when the smoothing problem is difficult. In this paper we find that the performance of indirect crossvalidation improves theoretically and practically when the polynomial order of the indirect kernel increases, with the Gaussian kernel as limiting kernel when the polynomial order goes to infinity. These theoretical and practical results support the often proposed choice of the Gaussian kernel as indirect kernel. However, for do-validation our study shows a discrepancy between asymptotic theory and practical performance. As for indirect crossvalidation, in asymptotic theory the performance of indirect do-validation improves with increasing polynomial order of the used indirect kernel. But this theoretical improvements do not carry over to practice and the original do-validation still seems to be our preferred bandwidth selector. We also consider plug-in estimation and combinations of plug-in bandwidths and crossvalidated bandwidths. These latter bandwidths do not outperform the original do-validation estimator either.
△ Less
Submitted 20 September, 2012;
originally announced September 2012.