Search | arXiv e-print repository

Data Assimilation for Sign-indefinite Priors: A generalization of Sinkhorn's algorithm

Authors: Anqi Dong, Tryphon T. Georgiou, Allen Tannenbaum

Abstract: The purpose of this work is to develop a framework to calibrate signed datasets so as to be consistent with specified marginals by suitably extending the Schrödinger-Fortet-Sinkhorn paradigm. Specifically, we seek to revise sign-indefinite multi-dimensional arrays in a way that the updated values agree with specified marginals. Our approach follows the rationale in Schrödinger's problem, aimed at… ▽ More The purpose of this work is to develop a framework to calibrate signed datasets so as to be consistent with specified marginals by suitably extending the Schrödinger-Fortet-Sinkhorn paradigm. Specifically, we seek to revise sign-indefinite multi-dimensional arrays in a way that the updated values agree with specified marginals. Our approach follows the rationale in Schrödinger's problem, aimed at updating a "prior" probability measure to agree with marginal distributions. The celebrated Sinkhorn's algorithm (established earlier by R.\ Fortet) that solves Schrödinger's problem found early applications in calibrating contingency tables in statistics and, more recently, multi-marginal problems in machine learning and optimal transport. Herein, we postulate a sign-indefinite prior in the form of a multi-dimensional array, and propose an optimization problem to suitably update this prior to ensure consistency with given marginals. The resulting algorithm generalizes the Sinkhorn algorithm in that it amounts to iterative scaling of the entries of the array along different coordinate directions. The scaling is multiplicative but also, in contrast to Sinkhorn, inverse-multiplicative depending on the sign of the entries. Our algorithm reduces to the classical Sinkhorn algorithm when the entries of the prior are positive. △ Less

Submitted 22 August, 2023; originally announced August 2023.

Comments: 11 pages, 4 figures

MSC Class: 49M29; 90C25

arXiv:2104.04005 [pdf, other]

The Challenge of Small Data: Dynamic Mode Decomposition, Redux

Authors: Amirhossein Karimi, Tryphon T. Georgiou

Abstract: We revisit the setting and the assumptions that underlie the methodology of Dynamic Mode Decomposition (DMD) in order to highlight caveats as well as potential measures of when the applicability is warranted. We revisit the setting and the assumptions that underlie the methodology of Dynamic Mode Decomposition (DMD) in order to highlight caveats as well as potential measures of when the applicability is warranted. △ Less

Submitted 3 May, 2021; v1 submitted 8 April, 2021; originally announced April 2021.

arXiv:2005.09152 [pdf, other]

Lasso formulation of the shortest path problem

Authors: Anqi Dong, Amirhossein Taghvaei, Tryphon T. Georgiou

Abstract: The shortest path problem is formulated as an $l_1$-regularized regression problem, known as lasso. Based on this formulation, a connection is established between Dijkstra's shortest path algorithm and the least angle regression (LARS) for the lasso problem. Specifically, the solution path of the lasso problem, obtained by varying the regularization parameter from infinity to zero (the regularizat… ▽ More The shortest path problem is formulated as an $l_1$-regularized regression problem, known as lasso. Based on this formulation, a connection is established between Dijkstra's shortest path algorithm and the least angle regression (LARS) for the lasso problem. Specifically, the solution path of the lasso problem, obtained by varying the regularization parameter from infinity to zero (the regularization path), corresponds to shortest path trees that appear in the bi-directional Dijkstra algorithm. Although Dijkstra's algorithm and the LARS formulation provide exact solutions, they become impractical when the size of the graph is exceedingly large. To overcome this issue, the alternating direction method of multipliers (ADMM) is proposed to solve the lasso formulation. The resulting algorithm produces good and fast approximations of the shortest path by sacrificing exactness that may not be absolutely essential in many applications. Numerical experiments are provided to illustrate the performance of the proposed approach. △ Less

Submitted 22 May, 2020; v1 submitted 18 May, 2020; originally announced May 2020.

Comments: 17 pages

MSC Class: 05C38 (Primary) 62J07; 68R10; 90C25; 90C06(Secondary)

arXiv:1904.06762 [pdf, other]

Probabilistic Kernel Support Vector Machines

Authors: Yongxin Chen, Tryphon T. Georgiou, Allen R. Tannenbaum

Abstract: We propose a probabilistic enhancement of standard kernel Support Vector Machines for binary classification, in order to address the case when, along with given data sets, a description of uncertainty (e.g., error bounds) may be available on each datum. In the present paper, we specifically consider Gaussian distributions to model uncertainty. Thereby, our data consist of pairs $(x_i,Σ_i)$,… ▽ More We propose a probabilistic enhancement of standard kernel Support Vector Machines for binary classification, in order to address the case when, along with given data sets, a description of uncertainty (e.g., error bounds) may be available on each datum. In the present paper, we specifically consider Gaussian distributions to model uncertainty. Thereby, our data consist of pairs $(x_i,Σ_i)$, $i\in\{1,\ldots,N\}$, along with an indicator $y_i\in\{-1,1\}$ to declare membership in one of two categories for each pair. These pairs may be viewed to represent the mean and covariance, respectively, of random vectors $ξ_i$ taking values in a suitable linear space (typically $\mathbb R^n$). Thus, our setting may also be viewed as a modification of Support Vector Machines to classify distributions, albeit, at present, only Gaussian ones. We outline the formalism that allows computing suitable classifiers via a natural modification of the standard "kernel trick." The main contribution of this work is to point out a suitable kernel function for applying Support Vector techniques to the setting of uncertain data for which a detailed uncertainty description is also available (herein, "Gaussian points"). △ Less

Submitted 18 March, 2020; v1 submitted 14 April, 2019; originally announced April 2019.

Comments: 6 pages, 6 figures

MSC Class: 62G05; 93A30

Showing 1–4 of 4 results for author: Georgiou, T T