-
Consistent causal discovery with equal error variances: a least-squares perspective
Authors:
Anamitra Chaudhuri,
Yang Ni,
Anirban Bhattacharya
Abstract:
We consider the problem of recovering the true causal structure among a set of variables, generated by a linear acyclic structural equation model (SEM) with the error terms being independent and having equal variances. It is well-known that the true underlying directed acyclic graph (DAG) encoding the causal structure is uniquely identifiable under this assumption. In this work, we establish that…
▽ More
We consider the problem of recovering the true causal structure among a set of variables, generated by a linear acyclic structural equation model (SEM) with the error terms being independent and having equal variances. It is well-known that the true underlying directed acyclic graph (DAG) encoding the causal structure is uniquely identifiable under this assumption. In this work, we establish that the sum of minimum expected squared errors for every variable, while predicted by the best linear combination of its parent variables, is minimised if and only if the causal structure is represented by any supergraph of the true DAG. This property is further utilised to design a Bayesian DAG selection method that recovers the true graph consistently.
△ Less
Submitted 18 September, 2025;
originally announced September 2025.
-
Consistent DAG selection for Bayesian causal discovery under general error distributions
Authors:
Anamitra Chaudhuri,
Anirban Bhattacharya,
Yang Ni
Abstract:
We consider the problem of learning the underlying causal structure among a set of variables, which are assumed to follow a Bayesian network or, more specifically, a linear recursive structural equation model (SEM) with the associated errors being independent and allowed to be non-Gaussian. A Bayesian hierarchical model is proposed to identify the true data-generating directed acyclic graph (DAG)…
▽ More
We consider the problem of learning the underlying causal structure among a set of variables, which are assumed to follow a Bayesian network or, more specifically, a linear recursive structural equation model (SEM) with the associated errors being independent and allowed to be non-Gaussian. A Bayesian hierarchical model is proposed to identify the true data-generating directed acyclic graph (DAG) structure where the nodes and edges represent the variables and the direct causal effects, respectively. Moreover, incorporating the information of non-Gaussian errors, we characterize the distribution equivalence class of the true DAG, which specifies the best possible extent to which the DAG can be identified based on purely observational data. Furthermore, under the consideration that the errors are distributed as some scale mixture of Gaussian, where the mixing distribution is unspecified, and mild distributional assumptions, we establish that by employing a non-standard DAG prior, the posterior probability of the distribution equivalence class of the true DAG converges to unity as the sample size grows. This shows that the proposed method achieves the posterior DAG selection consistency, which is further illustrated with examples and simulation studies.
△ Less
Submitted 1 August, 2025;
originally announced August 2025.
-
Graphs With Polarities
Authors:
John C. Baez,
Adittya Chaudhuri
Abstract:
In fields ranging from business to systems biology, directed graphs with edges labeled by signs are used to model systems in a simple way: the nodes represent entities of some sort, and an edge indicates that one entity directly affects another either positively or negatively. Multiplying the signs along a directed path of edges lets us determine indirect positive or negative effects, and if the p…
▽ More
In fields ranging from business to systems biology, directed graphs with edges labeled by signs are used to model systems in a simple way: the nodes represent entities of some sort, and an edge indicates that one entity directly affects another either positively or negatively. Multiplying the signs along a directed path of edges lets us determine indirect positive or negative effects, and if the path is a loop we call this a positive or negative feedback loop. Here we generalize this to graphs with edges labeled by a monoid, whose elements represent `polarities' possibly more general than simply "positive" or "negative". We study three notions of morphism between graphs with labeled edges, each with its own distinctive application: to refine a simple graph into a complicated one, to transform a complicated graph into a simple one, and to find recurring patterns called "motifs". We construct three corresponding symmetric monoidal double categories of "open" graphs. We study feedback loops using a generalization of the homology of a graph to homology with coefficients in a commutative monoid. In particular, we describe the emergence of new feedback loops when we compose open graphs using a variant of the Mayer-Vietoris exact sequence for homology with coefficients in a commutative monoid.
△ Less
Submitted 29 June, 2025;
originally announced June 2025.
-
On gauge theory and parallel transport in principal 2-bundles over Lie groupoids
Authors:
Adittya Chaudhuri
Abstract:
We investigate an interplay between some ideas in traditional gauge theory and certain concepts in fibered categories. We accomplish this by introducing a notion of a principal Lie 2-group bundle over a Lie groupoid and studying its connection structures, gauge transformations, and parallel transport.
We obtain a Lie 2-group torsor version of the one-one correspondence between fibered categories…
▽ More
We investigate an interplay between some ideas in traditional gauge theory and certain concepts in fibered categories. We accomplish this by introducing a notion of a principal Lie 2-group bundle over a Lie groupoid and studying its connection structures, gauge transformations, and parallel transport.
We obtain a Lie 2-group torsor version of the one-one correspondence between fibered categories and pseudofunctors. This results in a classification of our principal 2-bundles based on their underlying fibration structures. This allows us to extend a class of our principal 2-bundles to be defined over differentiable stacks presented by the base Lie groupoids. We construct a short exact sequence of VB-groupoids, namely, the 'Atiyah sequence' associated to our principal 2-bundles. Splitting and splitting up to a natural isomorphism of our Atiyah sequence, respectively, gives us notions of 'strict connections' and 'semi-strict connections' on our principal 2-bundles. We describe such connections in terms of Lie 2-algebra valued 1-forms on the total Lie groupoids. The underlying fibration structure of our 2-bundle provides an existence criterion for strict and semi-strict connections. We study the action of the 2-group of gauge transformations on the groupoid of strict and semi-strict connections, and interestingly, we observe an extended symmetry of semi-strict connections. We demonstrate an interrelationship between `differential geometric connection-induced horizontal path lifting property in traditional principal bundles' and the `category theoretic cartesian lifting of morphisms in fibered categories' by developing a theory of connection-induced parallel transport along a particular class of Haefliger paths in the base Lie groupoid of our principle 2-bundles. Finally, we employ our results to introduce a notion of parallel transport along Haefliger paths in the setup of VB-groupoids.
△ Less
Submitted 26 October, 2024;
originally announced November 2024.
-
A mathematical framework to study organising principles in graphical representations of biochemical processes
Authors:
Adittya Chaudhuri,
Ralf Köhl,
Olaf Wolkenhauer
Abstract:
Systems Biology Graphical Notation (SBGN) is a standardised notational system that visualises biochemical processes as networks. These visualizations lack a formal framework, so that the analysis of such networks through modelling and simulation is an entirely separate task, determined by a chosen modelling framework (e.g. differential equations, Petri nets, stochastic processes, graphs). A second…
▽ More
Systems Biology Graphical Notation (SBGN) is a standardised notational system that visualises biochemical processes as networks. These visualizations lack a formal framework, so that the analysis of such networks through modelling and simulation is an entirely separate task, determined by a chosen modelling framework (e.g. differential equations, Petri nets, stochastic processes, graphs). A second research gap is the lack of a mathematical framework to compose network representations. The complexity of molecular and cellular processes forces experimental studies to focus on subsystems. To study the functioning of biological systems across levels of structural and functional organisation, we require tools to compose and organise networks with different levels of detail and abstraction.
We address these challenges by introducing a category-theoretic formalism for biochemical processes visualised using SBGN Process Description (SBGN-PD) language. Using the theory of structured cospans, we construct a symmetric monoidal double category and demonstrate its horizontal 1-morphisms as SBGN Process Descriptions. We obtain organisational principles such as 'compositionality' (building a large SBGN-PD from smaller ones) and 'zooming-out' (abstracting away details in biochemical processes) defined in category-theoretic terms. We also formally investigate how a particular portion of a biochemical network influences the remaining portion of the network and vice versa. Throughout the paper, we illustrate our findings using standard SBGN-PD examples.
△ Less
Submitted 23 October, 2024;
originally announced October 2024.
-
Bayesian learning with Gaussian processes for low-dimensional representations of time-dependent nonlinear systems
Authors:
Shane A. McQuarrie,
Anirban Chaudhuri,
Karen E. Willcox,
Mengwu Guo
Abstract:
This work presents a data-driven method for learning low-dimensional time-dependent physics-based surrogate models whose predictions are endowed with uncertainty estimates. We use the operator inference approach to model reduction that poses the problem of learning low-dimensional model terms as a regression of state space data and corresponding time derivatives by minimizing the residual of reduc…
▽ More
This work presents a data-driven method for learning low-dimensional time-dependent physics-based surrogate models whose predictions are endowed with uncertainty estimates. We use the operator inference approach to model reduction that poses the problem of learning low-dimensional model terms as a regression of state space data and corresponding time derivatives by minimizing the residual of reduced system equations. Standard operator inference models perform well with accurate training data that are dense in time, but producing stable and accurate models when the state data are noisy and/or sparse in time remains a challenge. Another challenge is the lack of uncertainty estimation for the predictions from the operator inference models. Our approach addresses these challenges by incorporating Gaussian process surrogates into the operator inference framework to (1) probabilistically describe uncertainties in the state predictions and (2) procure analytical time derivative estimates with quantified uncertainties. The formulation leads to a generalized least-squares regression and, ultimately, reduced-order models that are described probabilistically with a closed-form expression for the posterior distribution of the operators. The resulting probabilistic surrogate model propagates uncertainties from the observed state data to reduced-order predictions. We demonstrate the method is effective for constructing low-dimensional models of two nonlinear partial differential equations representing a compressible flow and a nonlinear diffusion-reaction process, as well as for estimating the parameters of a low-dimensional system of nonlinear ordinary differential equations representing compartmental models in epidemiology.
△ Less
Submitted 17 March, 2025; v1 submitted 6 August, 2024;
originally announced August 2024.
-
Round Robin Active Sequential Change Detection for Dependent Multi-Channel Data
Authors:
Anamitra Chaudhuri,
Georgios Fellouris,
Ali Tajer
Abstract:
This paper considers the problem of sequentially detecting a change in the joint distribution of multiple data sources under a sampling constraint. Specifically, the channels or sources generate observations that are independent over time, but not necessarily independent at any given time instant. The sources follow an initial joint distribution, and at an unknown time instant, the joint distribut…
▽ More
This paper considers the problem of sequentially detecting a change in the joint distribution of multiple data sources under a sampling constraint. Specifically, the channels or sources generate observations that are independent over time, but not necessarily independent at any given time instant. The sources follow an initial joint distribution, and at an unknown time instant, the joint distribution of an unknown subset of sources changes. Importantly, there is a hard constraint that only a fixed number of sources are allowed to be sampled at each time instant. The goal is to sequentially observe the sources according to the constraint, and stop sampling as quickly as possible after the change while controlling the false alarm rate below a user-specified level. The sources can be selected dynamically based on the already collected data, and thus, a policy for this problem consists of a joint sampling and change-detection rule. A non-randomized policy is studied, and an upper bound is established on its worst-case conditional expected detection delay with respect to both the change point and the observations from the affected sources before the change.
△ Less
Submitted 24 March, 2024;
originally announced March 2024.
-
Detecting out-of-distribution text using topological features of transformer-based language models
Authors:
Andres Pollano,
Anupam Chaudhuri,
Anj Simmons
Abstract:
To safeguard machine learning systems that operate on textual data against out-of-distribution (OOD) inputs that could cause unpredictable behaviour, we explore the use of topological features of self-attention maps from transformer-based language models to detect when input text is out of distribution. Self-attention forms the core of transformer-based language models, dynamically assigning vecto…
▽ More
To safeguard machine learning systems that operate on textual data against out-of-distribution (OOD) inputs that could cause unpredictable behaviour, we explore the use of topological features of self-attention maps from transformer-based language models to detect when input text is out of distribution. Self-attention forms the core of transformer-based language models, dynamically assigning vectors to words based on context, thus in theory our methodology is applicable to any transformer-based language model with multihead self-attention. We evaluate our approach on BERT and compare it to a traditional OOD approach using CLS embeddings. Our results show that our approach outperforms CLS embeddings in distinguishing in-distribution samples from far-out-of-domain samples, but struggles with near or same-domain datasets.
△ Less
Submitted 18 July, 2024; v1 submitted 21 November, 2023;
originally announced November 2023.
-
Parallel transport on a Lie 2-group bundle over a Lie groupoid along Haefliger paths
Authors:
Saikat Chatterjee,
Adittya Chaudhuri
Abstract:
We prove a Lie 2-group torsor version of the well-known one-one correspondence between fibered categories and pseudofunctors. Consequently, we obtain a weak version of the principal Lie group bundle over a Lie groupoid. The correspondence also enables us to extend a particular class of principal 2-bundles to be defined over differentiable stacks. We show that the differential geometric connection…
▽ More
We prove a Lie 2-group torsor version of the well-known one-one correspondence between fibered categories and pseudofunctors. Consequently, we obtain a weak version of the principal Lie group bundle over a Lie groupoid. The correspondence also enables us to extend a particular class of principal 2-bundles to be defined over differentiable stacks. We show that the differential geometric connection structures introduced in the authors' previous work, combine nicely with the underlying fibration structure of a principal 2-bundle over a Lie groupoid. This interrelation allows us to derive a notion of parallel transport in the framework of principal 2-bundles over Lie groupoids along a particular class of Haefliger paths. The corresponding parallel transport functor is shown to be smooth. We apply our results to examine the parallel transport on an associated VB-groupoid.
△ Less
Submitted 11 September, 2023;
originally announced September 2023.
-
Predictive Digital Twin for Optimizing Patient-Specific Radiotherapy Regimens under Uncertainty in High-Grade Gliomas
Authors:
Anirban Chaudhuri,
Graham Pash,
David A. Hormuth II,
Guillermo Lorenzo,
Michael Kapteyn,
Chengyue Wu,
Ernesto A. B. F. Lima,
Thomas E. Yankeelov,
Karen Willcox
Abstract:
We develop a methodology to create data-driven predictive digital twins for optimal risk-aware clinical decision-making. We illustrate the methodology as an enabler for an anticipatory personalized treatment that accounts for uncertainties in the underlying tumor biology in high-grade gliomas, where heterogeneity in the response to standard-of-care (SOC) radiotherapy contributes to sub-optimal pat…
▽ More
We develop a methodology to create data-driven predictive digital twins for optimal risk-aware clinical decision-making. We illustrate the methodology as an enabler for an anticipatory personalized treatment that accounts for uncertainties in the underlying tumor biology in high-grade gliomas, where heterogeneity in the response to standard-of-care (SOC) radiotherapy contributes to sub-optimal patient outcomes. The digital twin is initialized through prior distributions derived from population-level clinical data in the literature for a mechanistic model's parameters. Then the digital twin is personalized using Bayesian model calibration for assimilating patient-specific magnetic resonance imaging data and used to propose optimal radiotherapy treatment regimens by solving a multi-objective risk-based optimization under uncertainty problem. The solution leads to a suite of patient-specific optimal radiotherapy treatment regimens exhibiting varying levels of trade-off between the two competing clinical objectives: (i) maximizing tumor control (characterized by minimizing the risk of tumor volume growth) and (ii) minimizing the toxicity from radiotherapy. The proposed digital twin framework is illustrated by generating an in silico cohort of 100 patients with high-grade glioma growth and response properties typically observed in the literature. For the same total radiation dose as the SOC, the personalized treatment regimens lead to median increase in tumor time to progression of around six days. Alternatively, for the same level of tumor control as the SOC, the digital twin provides optimal treatment options that lead to a median reduction in radiation dose by 16.7% (10 Gy) compared to SOC total dose of 60 Gy. The range of optimal solutions also provide options with increased doses for patients with aggressive cancer, where SOC does not lead to sufficient tumor control.
△ Less
Submitted 23 August, 2023;
originally announced August 2023.
-
Joint Sequential Detection and Isolation for Dependent Data Streams
Authors:
Anamitra Chaudhuri,
Georgios Fellouris
Abstract:
The problem of joint sequential detection and isolation is considered in the context of multiple, not necessarily independent, data streams. A multiple testing framework is proposed, where each hypothesis corresponds to a different subset of data streams, the sample size is a stopping time of the observations, and the probabilities of four kinds of error are controlled below distinct, user-specifi…
▽ More
The problem of joint sequential detection and isolation is considered in the context of multiple, not necessarily independent, data streams. A multiple testing framework is proposed, where each hypothesis corresponds to a different subset of data streams, the sample size is a stopping time of the observations, and the probabilities of four kinds of error are controlled below distinct, user-specified levels. Two of these errors reflect the detection component of the formulation, whereas the other two the isolation component. The optimal expected sample size is characterized to a first-order asymptotic approximation as the error probabilities go to 0. Different asymptotic regimes, expressing different prioritizations of the detection and isolation tasks, are considered. A novel, versatile family of testing procedures is proposed, in which two distinct, in general, statistics are computed for each hypothesis, one addressing the detection task and the other the isolation task. Tests in this family, of various computational complexities, are shown to be asymptotically optimal under different setups. The general theory is applied to the detection and isolation of anomalous, not necessarily independent, data streams, as well as to the detection and isolation of an unknown dependence structure.
△ Less
Submitted 30 June, 2022;
originally announced July 2022.
-
A Cross Validation Framework for Signal Denoising with Applications to Trend Filtering, Dyadic CART and Beyond
Authors:
Anamitra Chaudhuri,
Sabyasachi Chatterjee
Abstract:
This paper formulates a general cross validation framework for signal denoising. The general framework is then applied to nonparametric regression methods such as Trend Filtering and Dyadic CART. The resulting cross validated versions are then shown to attain nearly the same rates of convergence as are known for the optimally tuned analogues. There did not exist any previous theoretical analyses o…
▽ More
This paper formulates a general cross validation framework for signal denoising. The general framework is then applied to nonparametric regression methods such as Trend Filtering and Dyadic CART. The resulting cross validated versions are then shown to attain nearly the same rates of convergence as are known for the optimally tuned analogues. There did not exist any previous theoretical analyses of cross validated versions of Trend Filtering or Dyadic CART. To illustrate the generality of the framework we also propose and study cross validated versions of two fundamental estimators; lasso for high dimensional linear regression and singular value thresholding for matrix estimation. Our general framework is inspired by the ideas in Chatterjee and Jafarov (2015) and is potentially applicable to a wide range of estimation methods which use tuning parameters.
△ Less
Submitted 3 May, 2023; v1 submitted 7 January, 2022;
originally announced January 2022.
-
Learning High-Dimensional Parametric Maps via Reduced Basis Adaptive Residual Networks
Authors:
Thomas O'Leary-Roseberry,
Xiaosong Du,
Anirban Chaudhuri,
Joaquim R. R. A. Martins,
Karen Willcox,
Omar Ghattas
Abstract:
We propose a scalable framework for the learning of high-dimensional parametric maps via adaptively constructed residual network (ResNet) maps between reduced bases of the inputs and outputs. When just few training data are available, it is beneficial to have a compact parametrization in order to ameliorate the ill-posedness of the neural network training problem. By linearly restricting high-dime…
▽ More
We propose a scalable framework for the learning of high-dimensional parametric maps via adaptively constructed residual network (ResNet) maps between reduced bases of the inputs and outputs. When just few training data are available, it is beneficial to have a compact parametrization in order to ameliorate the ill-posedness of the neural network training problem. By linearly restricting high-dimensional maps to informed reduced bases of the inputs, one can compress high-dimensional maps in a constructive way that can be used to detect appropriate basis ranks, equipped with rigorous error estimates. A scalable neural network learning framework is thus to learn the nonlinear compressed reduced basis mapping. Unlike the reduced basis construction, however, neural network constructions are not guaranteed to reduce errors by adding representation power, making it difficult to achieve good practical performance. Inspired by recent approximation theory that connects ResNets to sequential minimizing flows, we present an adaptive ResNet construction algorithm. This algorithm allows for depth-wise enrichment of the neural network approximation, in a manner that can achieve good practical performance by first training a shallow network and then adapting. We prove universal approximation of the associated neural network class for $L^2_ν$ functions on compact sets. Our overall framework allows for constructive means to detect appropriate breadth and depth, and related compact parametrizations of neural networks, significantly reducing the need for architectural hyperparameter tuning. Numerical experiments for parametric PDE problems and a 3D CFD wing design optimization parametric map demonstrate that the proposed methodology can achieve remarkably high accuracy for limited training data, and outperformed other neural network strategies we compared against.
△ Less
Submitted 15 November, 2022; v1 submitted 13 December, 2021;
originally announced December 2021.
-
Atiyah sequence and Gauge transformations of a principal $2$-bundle over a Lie groupoid
Authors:
Saikat Chatterjee,
Adittya Chaudhuri,
Praphulla Koushik
Abstract:
In this paper, a notion of a principal $2$-bundle over a Lie groupoid has been introduced. For such principal $2$-bundles, we produced a short exact sequence of VB-groupoids, namely, the Atiyah sequence. Two notions of connection structures viz. strict connections and semi-strict connections on a principal $2$-bundle arising respectively, from a retraction of the Atiyah sequence and a retraction u…
▽ More
In this paper, a notion of a principal $2$-bundle over a Lie groupoid has been introduced. For such principal $2$-bundles, we produced a short exact sequence of VB-groupoids, namely, the Atiyah sequence. Two notions of connection structures viz. strict connections and semi-strict connections on a principal $2$-bundle arising respectively, from a retraction of the Atiyah sequence and a retraction up to a natural isomorphism have been introduced. We constructed a class of principal $\mathbb{G}=[G_1\rightrightarrows G_0]$-bundles and connections from a given principal $G_0$-bundle $E_0\rightarrow X_0$ over $[X_1\rightrightarrows X_0]$ with connection. An existence criterion for the connections on a principal $2$-bundle over a proper, étale Lie groupoid is proposed. The action of the $2$-group of gauge transformations on the category of strict and semi-strict connections has been studied. Finally we noted an extended symmetry of the category of semi-strict connections.
△ Less
Submitted 4 August, 2021; v1 submitted 29 July, 2021;
originally announced July 2021.
-
Certifiable Risk-Based Engineering Design Optimization
Authors:
Anirban Chaudhuri,
Boris Kramer,
Matthew Norton,
Johannes O. Royset,
Karen Willcox
Abstract:
Reliable, risk-averse design of complex engineering systems with optimized performance requires dealing with uncertainties. A conventional approach is to add safety margins to a design that was obtained from deterministic optimization. Safer engineering designs require appropriate cost and constraint function definitions that capture the \textit{risk} associated with unwanted system behavior in th…
▽ More
Reliable, risk-averse design of complex engineering systems with optimized performance requires dealing with uncertainties. A conventional approach is to add safety margins to a design that was obtained from deterministic optimization. Safer engineering designs require appropriate cost and constraint function definitions that capture the \textit{risk} associated with unwanted system behavior in the presence of uncertainties. The paper proposes two notions of certifiability. The first is based on accounting for the magnitude of failure to ensure data-informed conservativeness. The second is the ability to provide optimization convergence guarantees by preserving convexity. Satisfying these notions leads to \textit{certifiable} risk-based design optimization (CRiBDO). In the context of CRiBDO, risk measures based on superquantile (a.k.a.\ conditional value-at-risk) and buffered probability of failure are analyzed. CRiBDO is contrasted with reliability-based design optimization (RBDO), where uncertainties are accounted for via the probability of failure, through a structural and a thermal design problem. A reformulation of the short column structural design problem leading to a convex CRiBDO problem is presented. The CRiBDO formulations capture more information about the problem to assign the appropriate conservativeness, exhibit superior optimization convergence by preserving properties of underlying functions, and alleviate the adverse effects of choosing hard failure thresholds required in RBDO.
△ Less
Submitted 13 July, 2021; v1 submitted 13 January, 2021;
originally announced January 2021.
-
Local topological recursion governs the enumeration of lattice points in $\overline{\mathcal M}_{g,n}$
Authors:
Anupam Chaudhuri,
Norman Do,
Ellena Moskovsky
Abstract:
The second author and Norbury initiated the enumeration of lattice points in the Deligne-Mumford compactifications of moduli spaces of curves. They showed that the enumeration may be expressed in terms of polynomials, whose top and bottom degree coefficients store psi-class intersection numbers and orbifold Euler characteristics of $\overline{\mathcal M}_{g,n}$, respectively. Furthermore, they ask…
▽ More
The second author and Norbury initiated the enumeration of lattice points in the Deligne-Mumford compactifications of moduli spaces of curves. They showed that the enumeration may be expressed in terms of polynomials, whose top and bottom degree coefficients store psi-class intersection numbers and orbifold Euler characteristics of $\overline{\mathcal M}_{g,n}$, respectively. Furthermore, they ask whether the enumeration is governed by the topological recursion and whether the intermediate coefficients also store algebro-geometric information. In this paper, we prove that the enumeration does indeed satisfy the topological recursion, although with a modification to the initial spectral curve data. Thus, one can consider this to be one of the first known instances of a natural enumerative problem governed by the so-called local topological recursion. Combining the present work with the known relation between local topological recursion and cohomological field theory should uncover the geometric meaning of the intermediate coefficients of the aforementioned polynomials.
△ Less
Submitted 17 June, 2019;
originally announced June 2019.
-
Generalisations of the Harer-Zagier recursion for 1-point functions
Authors:
Anupam Chaudhuri,
Norman Do
Abstract:
Harer and Zagier proved a recursion to enumerate gluings of a $2d$-gon that result in an orientable genus $g$ surface, in their work on Euler characteristics of moduli spaces of curves. Analogous results have been discovered for other enumerative problems, so it is natural to pose the following question: how large is the family of problems for which these so-called 1-point recursions exist?
In t…
▽ More
Harer and Zagier proved a recursion to enumerate gluings of a $2d$-gon that result in an orientable genus $g$ surface, in their work on Euler characteristics of moduli spaces of curves. Analogous results have been discovered for other enumerative problems, so it is natural to pose the following question: how large is the family of problems for which these so-called 1-point recursions exist?
In this paper, we prove the existence of 1-point recursions for a class of enumerative problems that have Schur function expansions. In particular, we recover the Harer-Zagier recursion, but our methodology also applies to the enumeration of dessins d'enfant, to Bousquet-Mélou-Schaeffer numbers, to monotone Hurwitz numbers, and more. On the other hand, we prove that there is no 1-point recursion that governs simple Hurwitz numbers. Our results are effective in the sense that one can explicitly compute particular instances of 1-point recursions, and we provide several examples. We conclude the paper with a brief discussion and a conjecture relating 1-point recursions to the theory of topological recursion.
△ Less
Submitted 31 December, 2018;
originally announced December 2018.
-
The Trace Criterion for Kernel Bandwidth Selection for Support Vector Data Description
Authors:
Arin Chaudhuri,
Carol Sadek,
Deovrat Kakde,
Wenhao Hu,
Hansi Jiang,
Seunghyun Kong,
Yuewei Liao,
Sergiy Peredriy,
Haoyu Wang
Abstract:
Support vector data description (SVDD) is a popular anomaly detection technique. The SVDD classifier partitions the whole data space into an inlier region, which consists of the region near the training data, and an outlier region, which consists of points away from the training data. The computation of the SVDD classifier requires a kernel function, for which the Gaussian kernel is a common choic…
▽ More
Support vector data description (SVDD) is a popular anomaly detection technique. The SVDD classifier partitions the whole data space into an inlier region, which consists of the region near the training data, and an outlier region, which consists of points away from the training data. The computation of the SVDD classifier requires a kernel function, for which the Gaussian kernel is a common choice. The Gaussian kernel has a bandwidth parameter, and it is important to set the value of this parameter correctly for good results. A small bandwidth leads to overfitting such that the resulting SVDD classifier overestimates the number of anomalies, whereas a large bandwidth leads to underfitting and an inability to detect many anomalies. In this paper, we present a new unsupervised method for selecting the Gaussian kernel bandwidth. Our method exploits a low-rank representation of the kernel matrix to suggest a kernel bandwidth value. Our new technique is competitive with the current state of the art for low-dimensional data and performs extremely well for many classes of high-dimensional data. Because the mathematical formulation of SVDD is identical with the mathematical formulation of one-class support vector machines (OCSVM) when the Gaussian kernel is used, our method is equally applicable to Gaussian kernel bandwidth tuning for OCSVM.
△ Less
Submitted 5 February, 2020; v1 submitted 15 November, 2018;
originally announced November 2018.
-
Much ado about Zero
Authors:
Asis Kumar Chaudhuri
Abstract:
A brief historical introduction for the enigmatic number Zero is given. The discussions are for popular consumption.
A brief historical introduction for the enigmatic number Zero is given. The discussions are for popular consumption.
△ Less
Submitted 7 June, 2016; v1 submitted 22 January, 2016;
originally announced January 2016.
-
Ultrametric Cantor sets and Origin of Anomalous Diffusion
Authors:
Dhurjati Prasad Datta,
Santanu Raut,
Anuja Roy Chaudhuri
Abstract:
The anomalous mean square fluctuations are shown to arise naturally from the ordinary diffusion equation interpreted scale invariantly in a formalism endowing real numbers with a nonarchimedean multiplicative structure. A variable $t$ approaching 0 linearly in the ordinary analysis is shown to enjoy instead a sublinear $t\log t^{-1}$ flow in the presence of this scale invariant structure. Diffusio…
▽ More
The anomalous mean square fluctuations are shown to arise naturally from the ordinary diffusion equation interpreted scale invariantly in a formalism endowing real numbers with a nonarchimedean multiplicative structure. A variable $t$ approaching 0 linearly in the ordinary analysis is shown to enjoy instead a sublinear $t\log t^{-1}$ flow in the presence of this scale invariant structure. Diffusion on an ultrametric Cantor set is also generically subdiffusive with the above sublinear mean square deviation. The present study seems to offer a new interpretation of a possible emergence of complex patterns from an apparently simple system.
△ Less
Submitted 13 August, 2010;
originally announced August 2010.