-
FactsR: A Safer Method for Producing High Quality Healthcare Documentation
Authors:
Victor Petrén Bach Hansen,
Lasse Krogsbøll,
Jonas Lyngsø,
Mathias Baltzersen,
Andreas Motzfeldt,
Kevin Pelgrims,
Lars Maaløe
Abstract:
There are now a multitude of AI-scribing solutions for healthcare promising the utilization of large language models for ambient documentation. However, these AI scribes still rely on one-shot, or few-shot prompts for generating notes after the consultation has ended, employing little to no reasoning. This risks long notes with an increase in hallucinations, misrepresentation of the intent of the…
▽ More
There are now a multitude of AI-scribing solutions for healthcare promising the utilization of large language models for ambient documentation. However, these AI scribes still rely on one-shot, or few-shot prompts for generating notes after the consultation has ended, employing little to no reasoning. This risks long notes with an increase in hallucinations, misrepresentation of the intent of the clinician, and reliance on the proofreading of the clinician to catch errors. A dangerous combination for patient safety if vigilance is compromised by workload and fatigue. In this paper, we introduce a method for extracting salient clinical information in real-time alongside the healthcare consultation, denoted Facts, and use that information recursively to generate the final note. The FactsR method results in more accurate and concise notes by placing the clinician-in-the-loop of note generation, while opening up new use cases within real-time decision support.
△ Less
Submitted 4 June, 2025; v1 submitted 15 May, 2025;
originally announced May 2025.
-
Moments by Integrating the Moment-Generating Function
Authors:
Peter Reinhard Hansen,
Chen Tong
Abstract:
We introduce a novel method for obtaining a wide variety of moments of a random variable with a well-defined moment-generating function (MGF). We derive new expressions for fractional moments and fractional absolute moments, both central and non-central moments. The new moment expressions are relatively simple integrals that involve the MGF, but do not require its derivatives. We label the new met…
▽ More
We introduce a novel method for obtaining a wide variety of moments of a random variable with a well-defined moment-generating function (MGF). We derive new expressions for fractional moments and fractional absolute moments, both central and non-central moments. The new moment expressions are relatively simple integrals that involve the MGF, but do not require its derivatives. We label the new method CMGF because it uses a complex extension of the MGF and can be used to obtain complex moments. We illustrate the new method with three applications where the MGF is available in closed-form, while the corresponding densities and the derivatives of the MGF are either unavailable or very difficult to obtain.
△ Less
Submitted 30 March, 2025; v1 submitted 30 October, 2024;
originally announced October 2024.
-
Robust Estimation of Realized Correlation: New Insight about Intraday Fluctuations in Market Betas
Authors:
Peter Reinhard Hansen,
Yiyao Luo
Abstract:
Time-varying volatility is an inherent feature of most economic time-series, which causes standard correlation estimators to be inconsistent. The quadrant correlation estimator is consistent but very inefficient. We propose a novel subsampled quadrant estimator that improves efficiency while preserving consistency and robustness. This estimator is particularly well-suited for high-frequency financ…
▽ More
Time-varying volatility is an inherent feature of most economic time-series, which causes standard correlation estimators to be inconsistent. The quadrant correlation estimator is consistent but very inefficient. We propose a novel subsampled quadrant estimator that improves efficiency while preserving consistency and robustness. This estimator is particularly well-suited for high-frequency financial data and we apply it to a large panel of US stocks. Our empirical analysis sheds new light on intra-day fluctuations in market betas by decomposing them into time-varying correlations and relative volatility changes. Our results show that intraday variation in betas is primarily driven by intraday variation in correlations.
△ Less
Submitted 30 October, 2023;
originally announced October 2023.
-
Characterizing Correlation Matrices that Admit a Clustered Factor Representation
Authors:
Chen Tong,
Peter Reinhard Hansen
Abstract:
The Clustered Factor (CF) model induces a block structure on the correlation matrix and is commonly used to parameterize correlation matrices. Our results reveal that the CF model imposes superfluous restrictions on the correlation matrix. This can be avoided by a different parametrization, involving the logarithmic transformation of the block correlation matrix.
The Clustered Factor (CF) model induces a block structure on the correlation matrix and is commonly used to parameterize correlation matrices. Our results reveal that the CF model imposes superfluous restrictions on the correlation matrix. This can be avoided by a different parametrization, involving the logarithmic transformation of the block correlation matrix.
△ Less
Submitted 10 August, 2023;
originally announced August 2023.
-
A New Method for Generating Random Correlation Matrices
Authors:
Ilya Archakov,
Peter Reinhard Hansen,
Yiyao Luo
Abstract:
We propose a new method for generating random correlation matrices that makes it simple to control both location and dispersion. The method is based on a vector parameterization, gamma = g(C), which maps any distribution on R^d, d = n(n-1)/2 to a distribution on the space of non-singular nxn correlation matrices. Correlation matrices with certain properties, such as being well-conditioned, having…
▽ More
We propose a new method for generating random correlation matrices that makes it simple to control both location and dispersion. The method is based on a vector parameterization, gamma = g(C), which maps any distribution on R^d, d = n(n-1)/2 to a distribution on the space of non-singular nxn correlation matrices. Correlation matrices with certain properties, such as being well-conditioned, having block structures, and having strictly positive elements, are simple to generate. We compare the new method with existing methods.
△ Less
Submitted 14 October, 2022;
originally announced October 2022.
-
Horseshoe priors for edge-preserving linear Bayesian inversion
Authors:
Felipe Uribe,
Yiqiu Dong,
Per Christian Hansen
Abstract:
In many large-scale inverse problems, such as computed tomography and image deblurring, characterization of sharp edges in the solution is desired. Within the Bayesian approach to inverse problems, edge-preservation is often achieved using Markov random field priors based on heavy-tailed distributions. Another strategy, popular in statistics, is the application of hierarchical shrinkage priors. An…
▽ More
In many large-scale inverse problems, such as computed tomography and image deblurring, characterization of sharp edges in the solution is desired. Within the Bayesian approach to inverse problems, edge-preservation is often achieved using Markov random field priors based on heavy-tailed distributions. Another strategy, popular in statistics, is the application of hierarchical shrinkage priors. An advantage of this formulation lies in expressing the prior as a conditionally Gaussian distribution depending of global and local hyperparameters which are endowed with heavy-tailed hyperpriors. In this work, we revisit the shrinkage horseshoe prior and introduce its formulation for edge-preserving settings. We discuss a sampling framework based on the Gibbs sampler to solve the resulting hierarchical formulation of the Bayesian inverse problem. In particular, one of the conditional distributions is high-dimensional Gaussian, and the rest are derived in closed form by using a scale mixture representation of the heavy-tailed hyperpriors. Applications from imaging science show that our computational procedure is able to compute sharp edge-preserving posterior point estimates with reduced uncertainty.
△ Less
Submitted 19 July, 2022;
originally announced July 2022.
-
Relative Contagiousness of Emerging Virus Variants: An Analysis of the Alpha, Delta, and Omicron SARS-CoV-2 Variants
Authors:
Peter Reinhard Hansen
Abstract:
We propose a simple dynamic model for estimating the relative contagiousness of two virus variants. Maximum likelihood estimation and inference is conveniently invariant to variation in the total number of cases over the sample period and can be expressed as a logistic regression. We apply the model to Danish SARS-CoV-2 variant data. We estimate the reproduction numbers of Alpha and Delta to be la…
▽ More
We propose a simple dynamic model for estimating the relative contagiousness of two virus variants. Maximum likelihood estimation and inference is conveniently invariant to variation in the total number of cases over the sample period and can be expressed as a logistic regression. We apply the model to Danish SARS-CoV-2 variant data. We estimate the reproduction numbers of Alpha and Delta to be larger than that of the ancestral variant by a factor of 1.51 [CI 95%: 1.50, 1.53] and 3.28 [CI 95%: 3.01, 3.58], respectively. In a predominately vaccinated population, we estimate Omicron to be 3.15 [CI 95%: 2.83, 3.50] times more infectious than Delta. Forecasting the proportion of an emerging virus variant is straight forward and we proceed to show how the effective reproduction number for a new variant can be estimated without contemporary sequencing results. This is useful for assessing the state of the pandemic in real time as we illustrate empirically with the inferred effective reproduction number for the Alpha variant.
△ Less
Submitted 20 January, 2022; v1 submitted 1 October, 2021;
originally announced October 2021.
-
Diffusion Means in Geometric Spaces
Authors:
Benjamin Eltzner,
Pernille Hansen,
Stephan F. Huckemann,
Stefan Sommer
Abstract:
We introduce a location statistic for distributions on non-linear geometric spaces, the diffusion mean, serving as an extension and an alternative to the Fréchet mean. The diffusion mean arises as the generalization of Gaussian maximum likelihood analysis to non-linear spaces by maximizing the likelihood of a Brownian motion. The diffusion mean depends on a time parameter $t$, which admits the int…
▽ More
We introduce a location statistic for distributions on non-linear geometric spaces, the diffusion mean, serving as an extension and an alternative to the Fréchet mean. The diffusion mean arises as the generalization of Gaussian maximum likelihood analysis to non-linear spaces by maximizing the likelihood of a Brownian motion. The diffusion mean depends on a time parameter $t$, which admits the interpretation of the allowed variance of the diffusion. The diffusion $t$-mean of a distribution $X$ is the most likely origin of a Brownian motion at time $t$, given the end-point distribution $X$. We give a detailed description of the asymptotic behavior of the diffusion estimator and provide sufficient conditions for the diffusion estimator to be strongly consistent. Particularly, we present a smeary central limit theorem for diffusion means and we show that joint estimation of the mean and diffusion variance rules out smeariness in all directions simultaneously in general situations. Furthermore, we investigate properties of the diffusion mean for distributions on the sphere $\mathbb S^n$. Experimentally, we consider simulated data and data from magnetic pole reversals, all indicating similar or improved convergence rate compared to the Fréchet mean. Here, we additionally estimate $t$ and consider its effects on smeariness and uniqueness of the diffusion mean for distributions on the sphere.
△ Less
Submitted 4 December, 2022; v1 submitted 25 May, 2021;
originally announced May 2021.
-
A hybrid Gibbs sampler for edge-preserving tomographic reconstruction with uncertain view angles
Authors:
Felipe Uribe,
Johnathan M. Bardsley,
Yiqiu Dong,
Per Christian Hansen,
Nicolai A. B. Riis
Abstract:
In computed tomography, data consist of measurements of the attenuation of X-rays passing through an object. The goal is to reconstruct the linear attenuation coefficient of the object's interior. For each position of the X-ray source, characterized by its angle with respect to a fixed coordinate system, one measures a set of data referred to as a view. A common assumption is that these view angle…
▽ More
In computed tomography, data consist of measurements of the attenuation of X-rays passing through an object. The goal is to reconstruct the linear attenuation coefficient of the object's interior. For each position of the X-ray source, characterized by its angle with respect to a fixed coordinate system, one measures a set of data referred to as a view. A common assumption is that these view angles are known, but in some applications they are known with imprecision. We propose a framework to solve a Bayesian inverse problem that jointly estimates the view angles and an image of the object's attenuation coefficient. We also include a few hyperparameters that characterize the likelihood and the priors. Our approach is based on a Gibbs sampler where the associated conditional densities are simulated using different sampling schemes - hence the term hybrid. In particular, the conditional distribution associated with the reconstruction is nonlinear in the image pixels, non-Gaussian and high-dimensional. We approach this distribution by constructing a Laplace approximation that represents the target conditional locally at each Gibbs iteration. This enables sampling of the attenuation coefficients in an efficient manner using iterative reconstruction algorithms. The numerical results show that our algorithm is able to jointly identify the image and the view angles, while also providing uncertainty estimates of both. We demonstrate our method with 2D X-ray computed tomography problems using fan beam configurations.
△ Less
Submitted 14 April, 2021;
originally announced April 2021.
-
Diffusion Means and Heat Kernel on Manifolds
Authors:
Pernille Hansen,
Benjamin Eltzner,
Stefan Sommer
Abstract:
We introduce diffusion means as location statistics on manifold data spaces. A diffusion mean is defined as the starting point of an isotropic diffusion with a given diffusivity. They can therefore be defined on all spaces on which a Brownian motion can be defined and numerical calculation of sample diffusion means is possible on a variety of spaces using the heat kernel expansion. We present seve…
▽ More
We introduce diffusion means as location statistics on manifold data spaces. A diffusion mean is defined as the starting point of an isotropic diffusion with a given diffusivity. They can therefore be defined on all spaces on which a Brownian motion can be defined and numerical calculation of sample diffusion means is possible on a variety of spaces using the heat kernel expansion. We present several classes of spaces, for which the heat kernel is known and sample diffusion means can therefore be calculated. As an example, we investigate a classic data set from directional statistics, for which the sample Fréchet mean exhibits finite sample smeariness.
△ Less
Submitted 28 February, 2021;
originally announced March 2021.
-
Currents and K-functions for Fiber Point Processes
Authors:
Pernille EH. Hansen,
Rasmus Waagepetersen,
Anne Marie Svane,
Jon Sporring,
Hans JT. Stephensen,
Stine Hasselholt,
Stefan Sommer
Abstract:
Analysis of images of sets of fibers such as myelin sheaths or skeletal muscles must account for both the spatial distribution of fibers and differences in fiber shape. This necessitates a combination of point process and shape analysis methodology. In this paper, we develop a K-function for shape-valued point processes by embedding shapes as currents, thus equipping the point process domain with…
▽ More
Analysis of images of sets of fibers such as myelin sheaths or skeletal muscles must account for both the spatial distribution of fibers and differences in fiber shape. This necessitates a combination of point process and shape analysis methodology. In this paper, we develop a K-function for shape-valued point processes by embedding shapes as currents, thus equipping the point process domain with metric structure inherited from a reproducing kernel Hilbert space. We extend Ripley's K-function which measures deviations from spatial homogeneity of point processes to fiber data. The paper provides a theoretical account of the statistical foundation of the K-function and its extension to fiber data, and we test the developed K-function on simulated as well as real data sets. This includes a fiber data set consisting of myelin sheaths, visualizing the spatial and fiber shape behavior of myelin configurations at different debts.
△ Less
Submitted 10 February, 2021;
originally announced February 2021.
-
A Canonical Representation of Block Matrices with Applications to Covariance and Correlation Matrices
Authors:
Ilya Archakov,
Peter Reinhard Hansen
Abstract:
We obtain a canonical representation for block matrices. The representation facilitates simple computation of the determinant, the matrix inverse, and other powers of a block matrix, as well as the matrix logarithm and the matrix exponential. These results are particularly useful for block covariance and block correlation matrices, where evaluation of the Gaussian log-likelihood and estimation are…
▽ More
We obtain a canonical representation for block matrices. The representation facilitates simple computation of the determinant, the matrix inverse, and other powers of a block matrix, as well as the matrix logarithm and the matrix exponential. These results are particularly useful for block covariance and block correlation matrices, where evaluation of the Gaussian log-likelihood and estimation are greatly simplified. We illustrate this with an empirical application using a large panel of daily asset returns. Moreover, the representation paves new ways to regularizing large covariance/correlation matrices, test block structures in matrices, and estimate regressions with many variables.
△ Less
Submitted 15 November, 2021; v1 submitted 4 December, 2020;
originally announced December 2020.
-
A New Parametrization of Correlation Matrices
Authors:
Ilya Archakov,
Peter Reinhard Hansen
Abstract:
We introduce a novel parametrization of the correlation matrix. The reparametrization facilitates modeling of correlation and covariance matrices by an unrestricted vector, where positive definiteness is an innate property. This parametrization can be viewed as a generalization of Fisther's Z-transformation to higher dimensions and has a wide range of potential applications. An algorithm for recon…
▽ More
We introduce a novel parametrization of the correlation matrix. The reparametrization facilitates modeling of correlation and covariance matrices by an unrestricted vector, where positive definiteness is an innate property. This parametrization can be viewed as a generalization of Fisther's Z-transformation to higher dimensions and has a wide range of potential applications. An algorithm for reconstructing the unique n x n correlation matrix from any d-dimensional vector (with d = n(n-1)/2) is provided, and we derive its numerical complexity.
△ Less
Submitted 3 December, 2020;
originally announced December 2020.
-
FixyNN: Efficient Hardware for Mobile Computer Vision via Transfer Learning
Authors:
Paul N. Whatmough,
Chuteng Zhou,
Patrick Hansen,
Shreyas Kolala Venkataramanaiah,
Jae-sun Seo,
Matthew Mattina
Abstract:
The computational demands of computer vision tasks based on state-of-the-art Convolutional Neural Network (CNN) image classification far exceed the energy budgets of mobile devices. This paper proposes FixyNN, which consists of a fixed-weight feature extractor that generates ubiquitous CNN features, and a conventional programmable CNN accelerator which processes a dataset-specific CNN. Image class…
▽ More
The computational demands of computer vision tasks based on state-of-the-art Convolutional Neural Network (CNN) image classification far exceed the energy budgets of mobile devices. This paper proposes FixyNN, which consists of a fixed-weight feature extractor that generates ubiquitous CNN features, and a conventional programmable CNN accelerator which processes a dataset-specific CNN. Image classification models for FixyNN are trained end-to-end via transfer learning, with the common feature extractor representing the transfered part, and the programmable part being learnt on the target dataset. Experimental results demonstrate FixyNN hardware can achieve very high energy efficiencies up to 26.6 TOPS/W ($4.81 \times$ better than iso-area programmable accelerator). Over a suite of six datasets we trained models via transfer learning with an accuracy loss of $<1\%$ resulting in up to 11.2 TOPS/W - nearly $2 \times$ more efficient than a conventional programmable CNN accelerator of the same area.
△ Less
Submitted 26 February, 2019;
originally announced February 2019.
-
Energy Efficient Hardware for On-Device CNN Inference via Transfer Learning
Authors:
Paul Whatmough,
Chuteng Zhou,
Patrick Hansen,
Matthew Mattina
Abstract:
On-device CNN inference for real-time computer vision applications can result in computational demands that far exceed the energy budgets of mobile devices. This paper proposes FixyNN, a co-designed hardware accelerator platform which splits a CNN model into two parts: a set of layers that are fixed in the hardware platform as a front-end fixed-weight feature extractor, and the remaining layers wh…
▽ More
On-device CNN inference for real-time computer vision applications can result in computational demands that far exceed the energy budgets of mobile devices. This paper proposes FixyNN, a co-designed hardware accelerator platform which splits a CNN model into two parts: a set of layers that are fixed in the hardware platform as a front-end fixed-weight feature extractor, and the remaining layers which become a back-end classifier running on a conventional programmable CNN accelerator. The common front-end provides ubiquitous CNN features for all FixyNN models, while the back-end is programmable and specific to a given dataset. Image classification models for FixyNN are trained end-to-end via transfer learning, with front-end layers fixed for the shared feature extractor, and back-end layers fine-tuned for a specific task. Over a suite of six datasets, we trained models via transfer learning with an accuracy loss of <1%, resulting in a FixyNN hardware platform with nearly 2 times better energy efficiency than a conventional programmable CNN accelerator of the same silicon area (i.e. hardware cost).
△ Less
Submitted 26 February, 2019; v1 submitted 4 December, 2018;
originally announced December 2018.
-
Nonlinear Principal Components and Long-run Implications of Multivariate Diffusions
Authors:
Xioahong Chen,
Lars Peter Hansen,
Jose Scheinkman
Abstract:
We investigate a method for extracting nonlinear principal components (NPCs). These NPCs maximize variation subject to smoothness and orthogonality constraints; but we allow for a general class of constraints and multivariate probability densities, including densities without compact support and even densities with algebraic tails. We provide primitive sufficient conditions for the existence of…
▽ More
We investigate a method for extracting nonlinear principal components (NPCs). These NPCs maximize variation subject to smoothness and orthogonality constraints; but we allow for a general class of constraints and multivariate probability densities, including densities without compact support and even densities with algebraic tails. We provide primitive sufficient conditions for the existence of these NPCs. By exploiting the theory of continuous-time, reversible Markov diffusion processes, we give a different interpretation of these NPCs and the smoothness constraints. When the diffusion matrix is used to enforce smoothness, the NPCs maximize long-run variation relative to the overall variation subject to orthogonality constraints. Moreover, the NPCs behave as scalar autoregressions with heteroskedastic innovations; this supports semiparametric identification and estimation of a multivariate reversible diffusion process and tests of the overidentifying restrictions implied by such a process from low frequency data. We also explore implications for stationary, possibly non-reversible diffusion processes. Finally, we suggest a sieve method to estimate the NPCs from discretely-sampled data.
△ Less
Submitted 4 August, 2009;
originally announced August 2009.