Search | arXiv e-print repository

Joint Alignment of Multivariate Quasi-Periodic Functional Data Using Deep Learning

Authors: Vi Thanh Pham, Jonas Bille Nielsen, Klaus Fuglsang Kofoed, Jørgen Tobias Kühl, Andreas Kryger Jensen

Abstract: The joint alignment of multivariate functional data plays an important role in various fields such as signal processing, neuroscience and medicine, including the statistical analysis of data from wearable devices. Traditional methods often ignore the phase variability and instead focus on the variability in the observed amplitude. We present a novel method for joint alignment of multivariate quasi… ▽ More The joint alignment of multivariate functional data plays an important role in various fields such as signal processing, neuroscience and medicine, including the statistical analysis of data from wearable devices. Traditional methods often ignore the phase variability and instead focus on the variability in the observed amplitude. We present a novel method for joint alignment of multivariate quasi-periodic functions using deep neural networks, decomposing, but retaining all the information in the data by preserving both phase and amplitude variability. Our proposed neural network uses a special activation of the output that builds on the unit simplex transformation, and we utilize a loss function based on the Fisher-Rao metric to train our model. Furthermore, our method is unsupervised and can provide an optimal common template function as well as subject-specific templates. We demonstrate our method on two simulated datasets and one real example, comprising data from 12-lead 10s electrocardiogram recordings. △ Less

Submitted 14 November, 2023; originally announced December 2023.

Comments: 28 pages, 6 figures

arXiv:2308.09919 [pdf, other]

Monitoring a developing pandemic with available data

Authors: María Luz Gámiz, Enno Mammen, María Dolores Martínez-Miranda, Jens Perch Nielsen

Abstract: This paper addresses pandemic statistics from a management perspective. Both input and output are easy to understand. Focus is on operations and cross border communication. To be able to work with simple available data some new missing data issues have to be solved from a mathematical statistical point of view. We illustrate our approach with data from France collected during the recent Covid-19 p… ▽ More This paper addresses pandemic statistics from a management perspective. Both input and output are easy to understand. Focus is on operations and cross border communication. To be able to work with simple available data some new missing data issues have to be solved from a mathematical statistical point of view. We illustrate our approach with data from France collected during the recent Covid-19 pandemic. Our new benchmark method also introduces a potential new division of labour while working with pandemic statistics allowing crucial input to be fed to the model via prior knowledge from external experts. △ Less

Submitted 19 August, 2023; originally announced August 2023.

Comments: 11 figures

MSC Class: 62G05 ACM Class: G.3

arXiv:2308.09918 [pdf, other]

Low quality exposure and point processes with a view to the first phase of a pandemic

Authors: María Luz Gámiz, Enno Mammen, María Dolores Martínez-Miranda, Jens Perch Nielsen

Abstract: In the early days of development of a pandemic there is no time for complicated data collection. One needs a simple cross-country benchmark approach based on robust data that is easy to understand and easy to collect. The recent pandemic has shown us what early available pandemic data might look like, because statistical data was published every day in standard news outlets in many countries. This… ▽ More In the early days of development of a pandemic there is no time for complicated data collection. One needs a simple cross-country benchmark approach based on robust data that is easy to understand and easy to collect. The recent pandemic has shown us what early available pandemic data might look like, because statistical data was published every day in standard news outlets in many countries. This paper provides new methodology for the analysis data where exposure is only vaguely understood and where the very definition of exposure might change over time. The exposure of poor quality is used to analyse and forecast events. Our example of such exposure is daily infections during a pandemic and the events are number of new infected patients in hospitals every day. Examples are given with French Covid-19 data on hospitalized patients and numbers of infected. △ Less

Submitted 19 August, 2023; originally announced August 2023.

Comments: 5 figures

MSC Class: 62G05 ACM Class: G.3

arXiv:2009.04547 [pdf, other]

doi 10.1016/j.strusafe.2021.102140

Optimal Inspection and Maintenance Planning for Deteriorating Structural Components through Dynamic Bayesian Networks and Markov Decision Processes

Authors: P. G. Morato, K. G. Papakonstantinou, C. P. Andriotis, J. S. Nielsen, P. Rigo

Abstract: Civil and maritime engineering systems, among others, from bridges to offshore platforms and wind turbines, must be efficiently managed as they are exposed to deterioration mechanisms throughout their operational life, such as fatigue or corrosion. Identifying optimal inspection and maintenance policies demands the solution of a complex sequential decision-making problem under uncertainty, with th… ▽ More Civil and maritime engineering systems, among others, from bridges to offshore platforms and wind turbines, must be efficiently managed as they are exposed to deterioration mechanisms throughout their operational life, such as fatigue or corrosion. Identifying optimal inspection and maintenance policies demands the solution of a complex sequential decision-making problem under uncertainty, with the main objective of efficiently controlling the risk associated with structural failures. Addressing this complexity, risk-based inspection planning methodologies, supported often by dynamic Bayesian networks, evaluate a set of pre-defined heuristic decision rules to reasonably simplify the decision problem. However, the resulting policies may be compromised by the limited space considered in the definition of the decision rules. Avoiding this limitation, Partially Observable Markov Decision Processes (POMDPs) provide a principled mathematical methodology for stochastic optimal control under uncertain action outcomes and observations, in which the optimal actions are prescribed as a function of the entire, dynamically updated, state probability distribution. In this paper, we combine dynamic Bayesian networks with POMDPs in a joint framework for optimal inspection and maintenance planning, and we provide the formulation for developing both infinite and finite horizon POMDPs in a structural reliability context. The proposed methodology is implemented and tested for the case of a structural component subject to fatigue deterioration, demonstrating the capability of state-of-the-art point-based POMDP solvers for solving the underlying planning optimization problem. Within the numerical experiments, POMDP and heuristic-based policies are thoroughly compared, and results showcase that POMDPs achieve substantially lower costs as compared to their counterparts, even for traditional problem settings. △ Less

Submitted 28 November, 2021; v1 submitted 9 September, 2020; originally announced September 2020.

Journal ref: Structural Safety, Volume 94, 2022,

arXiv:1910.00668 [pdf, other]

Wasserstein Neural Processes

Authors: Andrew Carr, Jared Nielsen, David Wingate

Abstract: Neural Processes (NPs) are a class of models that learn a mapping from a context set of input-output pairs to a distribution over functions. They are traditionally trained using maximum likelihood with a KL divergence regularization term. We show that there are desirable classes of problems where NPs, with this loss, fail to learn any reasonable distribution. We also show that this drawback is sol… ▽ More Neural Processes (NPs) are a class of models that learn a mapping from a context set of input-output pairs to a distribution over functions. They are traditionally trained using maximum likelihood with a KL divergence regularization term. We show that there are desirable classes of problems where NPs, with this loss, fail to learn any reasonable distribution. We also show that this drawback is solved by using approximations of Wasserstein distance which calculates optimal transport distances even for distributions of disjoint support. We give experimental justification for our method and demonstrate performance. These Wasserstein Neural Processes (WNPs) maintain all of the benefits of traditional NPs while being able to approximate a new class of function mappings. △ Less

Submitted 9 January, 2020; v1 submitted 1 October, 2019; originally announced October 2019.

arXiv:1904.01202 [pdf, other]

Non-Smooth Backfitting for Excess Risk Additive Regression Model with Two Survival Time-Scales

Authors: Munir Hiabu, Jens P. Nielsen, Thomas H. Scheike

Abstract: We present a new backfitting algorithm estimating the complex structured non-parametric survival model of Scheike (2001) without having to use smoothing. The considered model is a non-parametric survival model with two time-scales that are equivalent up to a constant that varies over the subjects. Covariate effects are modelled linearly on each time scale by additive Aalen models. Estimators of th… ▽ More We present a new backfitting algorithm estimating the complex structured non-parametric survival model of Scheike (2001) without having to use smoothing. The considered model is a non-parametric survival model with two time-scales that are equivalent up to a constant that varies over the subjects. Covariate effects are modelled linearly on each time scale by additive Aalen models. Estimators of the cumulative intensities on the two time-scales are suggested by solving local estimating equations jointly on the two time-scales. We are able to estimate the cumulative intensities solving backfitting estimating equations without using smoothing methods and we provide large sample properties and simultaneous confidence bands. The model is applied to data on myocardial infarction providing a separation of the two effects stemming from time since diagnosis and age. △ Less

Submitted 1 April, 2019; originally announced April 2019.

arXiv:1809.08908 [pdf, ps, other]

Fast, Precise Myelin Water Quantification using DESS MRI and Kernel Learning

Authors: Gopal Nataraj, Jon-Fredrik Nielsen, Mingjie Gao, Jeffrey A. Fessler

Abstract: Purpose: To investigate the feasibility of myelin water content quantification using fast dual-echo steady-state (DESS) scans and machine learning with kernels. Methods: We optimized combinations of steady-state (SS) scans for precisely estimating the fast-relaxing signal fraction ff of a two-compartment signal model, subject to a scan time constraint. We estimated ff from the optimized DESS acq… ▽ More Purpose: To investigate the feasibility of myelin water content quantification using fast dual-echo steady-state (DESS) scans and machine learning with kernels. Methods: We optimized combinations of steady-state (SS) scans for precisely estimating the fast-relaxing signal fraction ff of a two-compartment signal model, subject to a scan time constraint. We estimated ff from the optimized DESS acquisition using a recently developed method for rapid parameter estimation via regression with kernels (PERK). We compared DESS PERK ff estimates to conventional myelin water fraction (MWF) estimates from a longer multi-echo spin-echo (MESE) acquisition in simulation, in vivo, and ex vivo studies. Results: Simulations demonstrate that DESS PERK ff estimators and MESE MWF estimators achieve comparable error levels. In vivo and ex vivo experiments demonstrate that MESE MWF and DESS PERK ff estimates are quantitatively comparable measures of WM myelin water content. To our knowledge, these experiments are the first to demonstrate myelin water images from a SS acquisition that are quantitatively similar to conventional MESE MWF images. Conclusion: Combinations of fast DESS scans can be designed to enable precise ff estimation. PERK is well-suited for ff estimation. DESS PERK ff and MESE MWF estimates are quantitatively similar measures of WM myelin water content. △ Less

Submitted 24 September, 2018; originally announced September 2018.

arXiv:1710.05575 [pdf, ps, other]

Multiplicative local linear hazard estimation and best one-sided cross-validation

Authors: Maria Luz Gamiz, Maria Dolores Martinez-Miranda, Jens Perch Nielsen

Abstract: This paper develops detailed mathematical statistical theory of a new class of cross-validation techniques of local linear kernel hazards and their multiplicative bias corrections. The new class of cross-validation combines principles of local information and recent advances in indirect cross-validation. A few applications of cross-validating multiplicative kernel hazard estimation do exist in the… ▽ More This paper develops detailed mathematical statistical theory of a new class of cross-validation techniques of local linear kernel hazards and their multiplicative bias corrections. The new class of cross-validation combines principles of local information and recent advances in indirect cross-validation. A few applications of cross-validating multiplicative kernel hazard estimation do exist in the literature. However, detailed mathematical statistical theory and small sample performance are introduced via this paper and further upgraded to our new class of best one-sided cross-validation. Best one-sided cross-validation turns out to have excellent performance in its practical illustrations, in its small sample performance and in its mathematical statistical theoretical performance. △ Less

Submitted 16 October, 2017; originally announced October 2017.

arXiv:1710.02441 [pdf, ps, other]

doi 10.1109/TMI.2018.2817547

Dictionary-Free MRI PERK: Parameter Estimation via Regression with Kernels

Authors: Gopal Nataraj, Jon-Fredrik Nielsen, Clayton Scott, Jeffrey A. Fessler

Abstract: This paper introduces a fast, general method for dictionary-free parameter estimation in quantitative magnetic resonance imaging (QMRI) via regression with kernels (PERK). PERK first uses prior distributions and the nonlinear MR signal model to simulate many parameter-measurement pairs. Inspired by machine learning, PERK then takes these parameter-measurement pairs as labeled training points and l… ▽ More This paper introduces a fast, general method for dictionary-free parameter estimation in quantitative magnetic resonance imaging (QMRI) via regression with kernels (PERK). PERK first uses prior distributions and the nonlinear MR signal model to simulate many parameter-measurement pairs. Inspired by machine learning, PERK then takes these parameter-measurement pairs as labeled training points and learns from them a nonlinear regression function using kernel functions and convex optimization. PERK admits a simple implementation as per-voxel nonlinear lifting of MRI measurements followed by linear minimum mean-squared error regression. We demonstrate PERK for $T_1,T_2$ estimation, a well-studied application where it is simple to compare PERK estimates against dictionary-based grid search estimates. Numerical simulations as well as single-slice phantom and in vivo experiments demonstrate that PERK and grid search produce comparable $T_1,T_2$ estimates in white and gray matter, but PERK is consistently at least $23\times$ faster. This acceleration factor will increase by several orders of magnitude for full-volume QMRI estimation problems involving more latent parameters per voxel. △ Less

Submitted 6 October, 2017; originally announced October 2017.

Comments: submitted to IEEE Transactions on Medical Imaging

Journal ref: IEEE Transactions on Medical Imaging 37(9):2103-14 Sep 2018

arXiv:1212.2500 [pdf]

On Local Optima in Learning Bayesian Networks

Authors: Jens D. Nielsen, Tomas Kocka, Jose M. Pena

Abstract: This paper proposes and evaluates the k-greedy equivalence search algorithm (KES) for learning Bayesian networks (BNs) from complete data. The main characteristic of KES is that it allows a trade-off between greediness and randomness, thus exploring different good local optima. When greediness is set at maximum, KES corresponds to the greedy equivalence search algorithm (GES). When greediness is k… ▽ More This paper proposes and evaluates the k-greedy equivalence search algorithm (KES) for learning Bayesian networks (BNs) from complete data. The main characteristic of KES is that it allows a trade-off between greediness and randomness, thus exploring different good local optima. When greediness is set at maximum, KES corresponds to the greedy equivalence search algorithm (GES). When greediness is kept at minimum, we prove that under mild assumptions KES asymptotically returns any inclusion optimal BN with nonzero probability. Experimental results for both synthetic and real data are reported showing that KES often finds a better local optima than GES. Moreover, we use KES to experimentally confirm that the number of different local optima is often huge. △ Less

Submitted 19 October, 2012; originally announced December 2012.

Comments: Appears in Proceedings of the Nineteenth Conference on Uncertainty in Artificial Intelligence (UAI2003)

Report number: UAI-P-2003-PG-435-442

arXiv:1209.4495 [pdf, other]

A comparative study of new cross-validated bandwidth selectors for kernel density estimation

Authors: Enno Mammen, Maria Dolores Martinez Miranda, Jens Perch Nielsen, Stefan Sperlich

Abstract: Recent contributions to kernel smoothing show that the performance of cross-validated bandwidth selectors improve significantly from indirectness. Indirect crossvalidation first estimates the classical cross-validated bandwidth from a more rough and difficult smoothing problem than the original one and then rescales this indirect bandwidth to become a bandwidth of the original problem. The motivat… ▽ More Recent contributions to kernel smoothing show that the performance of cross-validated bandwidth selectors improve significantly from indirectness. Indirect crossvalidation first estimates the classical cross-validated bandwidth from a more rough and difficult smoothing problem than the original one and then rescales this indirect bandwidth to become a bandwidth of the original problem. The motivation for this approach comes from the observation that classical crossvalidation tends to work better when the smoothing problem is difficult. In this paper we find that the performance of indirect crossvalidation improves theoretically and practically when the polynomial order of the indirect kernel increases, with the Gaussian kernel as limiting kernel when the polynomial order goes to infinity. These theoretical and practical results support the often proposed choice of the Gaussian kernel as indirect kernel. However, for do-validation our study shows a discrepancy between asymptotic theory and practical performance. As for indirect crossvalidation, in asymptotic theory the performance of indirect do-validation improves with increasing polynomial order of the used indirect kernel. But this theoretical improvements do not carry over to practice and the original do-validation still seems to be our preferred bandwidth selector. We also consider plug-in estimation and combinations of plug-in bandwidths and crossvalidated bandwidths. These latter bandwidths do not outperform the original do-validation estimator either. △ Less

Submitted 20 September, 2012; originally announced September 2012.

Comments: 19 pages, 8 tables, 3 figures

Showing 1–11 of 11 results for author: Nielsen, J