Search | arXiv e-print repository

Consistent causal discovery with equal error variances: a least-squares perspective

Authors: Anamitra Chaudhuri, Yang Ni, Anirban Bhattacharya

Abstract: We consider the problem of recovering the true causal structure among a set of variables, generated by a linear acyclic structural equation model (SEM) with the error terms being independent and having equal variances. It is well-known that the true underlying directed acyclic graph (DAG) encoding the causal structure is uniquely identifiable under this assumption. In this work, we establish that… ▽ More We consider the problem of recovering the true causal structure among a set of variables, generated by a linear acyclic structural equation model (SEM) with the error terms being independent and having equal variances. It is well-known that the true underlying directed acyclic graph (DAG) encoding the causal structure is uniquely identifiable under this assumption. In this work, we establish that the sum of minimum expected squared errors for every variable, while predicted by the best linear combination of its parent variables, is minimised if and only if the causal structure is represented by any supergraph of the true DAG. This property is further utilised to design a Bayesian DAG selection method that recovers the true graph consistently. △ Less

Submitted 18 September, 2025; originally announced September 2025.

MSC Class: 62H22; 62F15 (Primary) 62C10; 62E10 (Secondary)

arXiv:2508.00993 [pdf, ps, other]

Consistent DAG selection for Bayesian causal discovery under general error distributions

Authors: Anamitra Chaudhuri, Anirban Bhattacharya, Yang Ni

Abstract: We consider the problem of learning the underlying causal structure among a set of variables, which are assumed to follow a Bayesian network or, more specifically, a linear recursive structural equation model (SEM) with the associated errors being independent and allowed to be non-Gaussian. A Bayesian hierarchical model is proposed to identify the true data-generating directed acyclic graph (DAG)… ▽ More We consider the problem of learning the underlying causal structure among a set of variables, which are assumed to follow a Bayesian network or, more specifically, a linear recursive structural equation model (SEM) with the associated errors being independent and allowed to be non-Gaussian. A Bayesian hierarchical model is proposed to identify the true data-generating directed acyclic graph (DAG) structure where the nodes and edges represent the variables and the direct causal effects, respectively. Moreover, incorporating the information of non-Gaussian errors, we characterize the distribution equivalence class of the true DAG, which specifies the best possible extent to which the DAG can be identified based on purely observational data. Furthermore, under the consideration that the errors are distributed as some scale mixture of Gaussian, where the mixing distribution is unspecified, and mild distributional assumptions, we establish that by employing a non-standard DAG prior, the posterior probability of the distribution equivalence class of the true DAG converges to unity as the sample size grows. This shows that the proposed method achieves the posterior DAG selection consistency, which is further illustrated with examples and simulation studies. △ Less

Submitted 1 August, 2025; originally announced August 2025.

MSC Class: 62H22; 62F15 (Primary) 62C10; 62E10 (Secondary)

arXiv:2506.23375 [pdf, ps, other]

Graphs With Polarities

Authors: John C. Baez, Adittya Chaudhuri

Abstract: In fields ranging from business to systems biology, directed graphs with edges labeled by signs are used to model systems in a simple way: the nodes represent entities of some sort, and an edge indicates that one entity directly affects another either positively or negatively. Multiplying the signs along a directed path of edges lets us determine indirect positive or negative effects, and if the p… ▽ More In fields ranging from business to systems biology, directed graphs with edges labeled by signs are used to model systems in a simple way: the nodes represent entities of some sort, and an edge indicates that one entity directly affects another either positively or negatively. Multiplying the signs along a directed path of edges lets us determine indirect positive or negative effects, and if the path is a loop we call this a positive or negative feedback loop. Here we generalize this to graphs with edges labeled by a monoid, whose elements represent `polarities' possibly more general than simply "positive" or "negative". We study three notions of morphism between graphs with labeled edges, each with its own distinctive application: to refine a simple graph into a complicated one, to transform a complicated graph into a simple one, and to find recurring patterns called "motifs". We construct three corresponding symmetric monoidal double categories of "open" graphs. We study feedback loops using a generalization of the homology of a graph to homology with coefficients in a commutative monoid. In particular, we describe the emergence of new feedback loops when we compose open graphs using a variant of the Mayer-Vietoris exact sequence for homology with coefficients in a commutative monoid. △ Less

Submitted 29 June, 2025; originally announced June 2025.

Comments: 37 pages LaTeX with TikZ figures

arXiv:2411.00814 [pdf, other]

On gauge theory and parallel transport in principal 2-bundles over Lie groupoids

Authors: Adittya Chaudhuri

Abstract: We investigate an interplay between some ideas in traditional gauge theory and certain concepts in fibered categories. We accomplish this by introducing a notion of a principal Lie 2-group bundle over a Lie groupoid and studying its connection structures, gauge transformations, and parallel transport. We obtain a Lie 2-group torsor version of the one-one correspondence between fibered categories… ▽ More We investigate an interplay between some ideas in traditional gauge theory and certain concepts in fibered categories. We accomplish this by introducing a notion of a principal Lie 2-group bundle over a Lie groupoid and studying its connection structures, gauge transformations, and parallel transport. We obtain a Lie 2-group torsor version of the one-one correspondence between fibered categories and pseudofunctors. This results in a classification of our principal 2-bundles based on their underlying fibration structures. This allows us to extend a class of our principal 2-bundles to be defined over differentiable stacks presented by the base Lie groupoids. We construct a short exact sequence of VB-groupoids, namely, the 'Atiyah sequence' associated to our principal 2-bundles. Splitting and splitting up to a natural isomorphism of our Atiyah sequence, respectively, gives us notions of 'strict connections' and 'semi-strict connections' on our principal 2-bundles. We describe such connections in terms of Lie 2-algebra valued 1-forms on the total Lie groupoids. The underlying fibration structure of our 2-bundle provides an existence criterion for strict and semi-strict connections. We study the action of the 2-group of gauge transformations on the groupoid of strict and semi-strict connections, and interestingly, we observe an extended symmetry of semi-strict connections. We demonstrate an interrelationship between `differential geometric connection-induced horizontal path lifting property in traditional principal bundles' and the `category theoretic cartesian lifting of morphisms in fibered categories' by developing a theory of connection-induced parallel transport along a particular class of Haefliger paths in the base Lie groupoid of our principle 2-bundles. Finally, we employ our results to introduce a notion of parallel transport along Haefliger paths in the setup of VB-groupoids. △ Less

Submitted 26 October, 2024; originally announced November 2024.

Comments: PhD Thesis, 210 pages, Thesis defended on 15th February, 2024

MSC Class: 53C08; 22A22; 58H05

arXiv:2410.18024 [pdf, other]

A mathematical framework to study organising principles in graphical representations of biochemical processes

Authors: Adittya Chaudhuri, Ralf Köhl, Olaf Wolkenhauer

Abstract: Systems Biology Graphical Notation (SBGN) is a standardised notational system that visualises biochemical processes as networks. These visualizations lack a formal framework, so that the analysis of such networks through modelling and simulation is an entirely separate task, determined by a chosen modelling framework (e.g. differential equations, Petri nets, stochastic processes, graphs). A second… ▽ More Systems Biology Graphical Notation (SBGN) is a standardised notational system that visualises biochemical processes as networks. These visualizations lack a formal framework, so that the analysis of such networks through modelling and simulation is an entirely separate task, determined by a chosen modelling framework (e.g. differential equations, Petri nets, stochastic processes, graphs). A second research gap is the lack of a mathematical framework to compose network representations. The complexity of molecular and cellular processes forces experimental studies to focus on subsystems. To study the functioning of biological systems across levels of structural and functional organisation, we require tools to compose and organise networks with different levels of detail and abstraction. We address these challenges by introducing a category-theoretic formalism for biochemical processes visualised using SBGN Process Description (SBGN-PD) language. Using the theory of structured cospans, we construct a symmetric monoidal double category and demonstrate its horizontal 1-morphisms as SBGN Process Descriptions. We obtain organisational principles such as 'compositionality' (building a large SBGN-PD from smaller ones) and 'zooming-out' (abstracting away details in biochemical processes) defined in category-theoretic terms. We also formally investigate how a particular portion of a biochemical network influences the remaining portion of the network and vice versa. Throughout the paper, we illustrate our findings using standard SBGN-PD examples. △ Less

Submitted 23 October, 2024; originally announced October 2024.

MSC Class: 18B10; 92-10

arXiv:2408.03455 [pdf, other]

doi 10.1016/j.physd.2025.134572

Bayesian learning with Gaussian processes for low-dimensional representations of time-dependent nonlinear systems

Authors: Shane A. McQuarrie, Anirban Chaudhuri, Karen E. Willcox, Mengwu Guo

Abstract: This work presents a data-driven method for learning low-dimensional time-dependent physics-based surrogate models whose predictions are endowed with uncertainty estimates. We use the operator inference approach to model reduction that poses the problem of learning low-dimensional model terms as a regression of state space data and corresponding time derivatives by minimizing the residual of reduc… ▽ More This work presents a data-driven method for learning low-dimensional time-dependent physics-based surrogate models whose predictions are endowed with uncertainty estimates. We use the operator inference approach to model reduction that poses the problem of learning low-dimensional model terms as a regression of state space data and corresponding time derivatives by minimizing the residual of reduced system equations. Standard operator inference models perform well with accurate training data that are dense in time, but producing stable and accurate models when the state data are noisy and/or sparse in time remains a challenge. Another challenge is the lack of uncertainty estimation for the predictions from the operator inference models. Our approach addresses these challenges by incorporating Gaussian process surrogates into the operator inference framework to (1) probabilistically describe uncertainties in the state predictions and (2) procure analytical time derivative estimates with quantified uncertainties. The formulation leads to a generalized least-squares regression and, ultimately, reduced-order models that are described probabilistically with a closed-form expression for the posterior distribution of the operators. The resulting probabilistic surrogate model propagates uncertainties from the observed state data to reduced-order predictions. We demonstrate the method is effective for constructing low-dimensional models of two nonlinear partial differential equations representing a compressible flow and a nonlinear diffusion-reaction process, as well as for estimating the parameters of a low-dimensional system of nonlinear ordinary differential equations representing compartmental models in epidemiology. △ Less

Submitted 17 March, 2025; v1 submitted 6 August, 2024; originally announced August 2024.

Comments: https://github.com/Sandialabs/GP-BayesOpInf

MSC Class: 62F15; 60G15; 65C05; 35B30 ACM Class: G.3

Journal ref: Physica D: Nonlinear Phenomena, 475 (2025), 134572

arXiv:2403.16297 [pdf, other]

Round Robin Active Sequential Change Detection for Dependent Multi-Channel Data

Authors: Anamitra Chaudhuri, Georgios Fellouris, Ali Tajer

Abstract: This paper considers the problem of sequentially detecting a change in the joint distribution of multiple data sources under a sampling constraint. Specifically, the channels or sources generate observations that are independent over time, but not necessarily independent at any given time instant. The sources follow an initial joint distribution, and at an unknown time instant, the joint distribut… ▽ More This paper considers the problem of sequentially detecting a change in the joint distribution of multiple data sources under a sampling constraint. Specifically, the channels or sources generate observations that are independent over time, but not necessarily independent at any given time instant. The sources follow an initial joint distribution, and at an unknown time instant, the joint distribution of an unknown subset of sources changes. Importantly, there is a hard constraint that only a fixed number of sources are allowed to be sampled at each time instant. The goal is to sequentially observe the sources according to the constraint, and stop sampling as quickly as possible after the change while controlling the false alarm rate below a user-specified level. The sources can be selected dynamically based on the already collected data, and thus, a policy for this problem consists of a joint sampling and change-detection rule. A non-randomized policy is studied, and an upper bound is established on its worst-case conditional expected detection delay with respect to both the change point and the observations from the affected sources before the change. △ Less

Submitted 24 March, 2024; originally announced March 2024.

MSC Class: 62L10; 62L05 (Primary) 62P30 (Secondary)

arXiv:2311.13102 [pdf, other]

Detecting out-of-distribution text using topological features of transformer-based language models

Authors: Andres Pollano, Anupam Chaudhuri, Anj Simmons

Abstract: To safeguard machine learning systems that operate on textual data against out-of-distribution (OOD) inputs that could cause unpredictable behaviour, we explore the use of topological features of self-attention maps from transformer-based language models to detect when input text is out of distribution. Self-attention forms the core of transformer-based language models, dynamically assigning vecto… ▽ More To safeguard machine learning systems that operate on textual data against out-of-distribution (OOD) inputs that could cause unpredictable behaviour, we explore the use of topological features of self-attention maps from transformer-based language models to detect when input text is out of distribution. Self-attention forms the core of transformer-based language models, dynamically assigning vectors to words based on context, thus in theory our methodology is applicable to any transformer-based language model with multihead self-attention. We evaluate our approach on BERT and compare it to a traditional OOD approach using CLS embeddings. Our results show that our approach outperforms CLS embeddings in distinguishing in-distribution samples from far-out-of-domain samples, but struggles with near or same-domain datasets. △ Less

Submitted 18 July, 2024; v1 submitted 21 November, 2023; originally announced November 2023.

Comments: 8 pages, 6 figures, 3 tables, to be published in proceedings of the IJCAI-2024 AISafety Workshop

arXiv:2309.05355 [pdf, ps, other]

Parallel transport on a Lie 2-group bundle over a Lie groupoid along Haefliger paths

Authors: Saikat Chatterjee, Adittya Chaudhuri

Abstract: We prove a Lie 2-group torsor version of the well-known one-one correspondence between fibered categories and pseudofunctors. Consequently, we obtain a weak version of the principal Lie group bundle over a Lie groupoid. The correspondence also enables us to extend a particular class of principal 2-bundles to be defined over differentiable stacks. We show that the differential geometric connection… ▽ More We prove a Lie 2-group torsor version of the well-known one-one correspondence between fibered categories and pseudofunctors. Consequently, we obtain a weak version of the principal Lie group bundle over a Lie groupoid. The correspondence also enables us to extend a particular class of principal 2-bundles to be defined over differentiable stacks. We show that the differential geometric connection structures introduced in the authors' previous work, combine nicely with the underlying fibration structure of a principal 2-bundle over a Lie groupoid. This interrelation allows us to derive a notion of parallel transport in the framework of principal 2-bundles over Lie groupoids along a particular class of Haefliger paths. The corresponding parallel transport functor is shown to be smooth. We apply our results to examine the parallel transport on an associated VB-groupoid. △ Less

Submitted 11 September, 2023; originally announced September 2023.

MSC Class: Primary 53C08; Secondary 22A22; 58H05

arXiv:2308.12429 [pdf, other]

doi 10.3389/frai.2023.1222612

Predictive Digital Twin for Optimizing Patient-Specific Radiotherapy Regimens under Uncertainty in High-Grade Gliomas

Authors: Anirban Chaudhuri, Graham Pash, David A. Hormuth II, Guillermo Lorenzo, Michael Kapteyn, Chengyue Wu, Ernesto A. B. F. Lima, Thomas E. Yankeelov, Karen Willcox

Abstract: We develop a methodology to create data-driven predictive digital twins for optimal risk-aware clinical decision-making. We illustrate the methodology as an enabler for an anticipatory personalized treatment that accounts for uncertainties in the underlying tumor biology in high-grade gliomas, where heterogeneity in the response to standard-of-care (SOC) radiotherapy contributes to sub-optimal pat… ▽ More We develop a methodology to create data-driven predictive digital twins for optimal risk-aware clinical decision-making. We illustrate the methodology as an enabler for an anticipatory personalized treatment that accounts for uncertainties in the underlying tumor biology in high-grade gliomas, where heterogeneity in the response to standard-of-care (SOC) radiotherapy contributes to sub-optimal patient outcomes. The digital twin is initialized through prior distributions derived from population-level clinical data in the literature for a mechanistic model's parameters. Then the digital twin is personalized using Bayesian model calibration for assimilating patient-specific magnetic resonance imaging data and used to propose optimal radiotherapy treatment regimens by solving a multi-objective risk-based optimization under uncertainty problem. The solution leads to a suite of patient-specific optimal radiotherapy treatment regimens exhibiting varying levels of trade-off between the two competing clinical objectives: (i) maximizing tumor control (characterized by minimizing the risk of tumor volume growth) and (ii) minimizing the toxicity from radiotherapy. The proposed digital twin framework is illustrated by generating an in silico cohort of 100 patients with high-grade glioma growth and response properties typically observed in the literature. For the same total radiation dose as the SOC, the personalized treatment regimens lead to median increase in tumor time to progression of around six days. Alternatively, for the same level of tumor control as the SOC, the digital twin provides optimal treatment options that lead to a median reduction in radiation dose by 16.7% (10 Gy) compared to SOC total dose of 60 Gy. The range of optimal solutions also provide options with increased doses for patients with aggressive cancer, where SOC does not lead to sufficient tumor control. △ Less

Submitted 23 August, 2023; originally announced August 2023.

Journal ref: Frontiers in Artificial Intelligence, 6, 2023

arXiv:2207.00120 [pdf, other]

Joint Sequential Detection and Isolation for Dependent Data Streams

Authors: Anamitra Chaudhuri, Georgios Fellouris

Abstract: The problem of joint sequential detection and isolation is considered in the context of multiple, not necessarily independent, data streams. A multiple testing framework is proposed, where each hypothesis corresponds to a different subset of data streams, the sample size is a stopping time of the observations, and the probabilities of four kinds of error are controlled below distinct, user-specifi… ▽ More The problem of joint sequential detection and isolation is considered in the context of multiple, not necessarily independent, data streams. A multiple testing framework is proposed, where each hypothesis corresponds to a different subset of data streams, the sample size is a stopping time of the observations, and the probabilities of four kinds of error are controlled below distinct, user-specified levels. Two of these errors reflect the detection component of the formulation, whereas the other two the isolation component. The optimal expected sample size is characterized to a first-order asymptotic approximation as the error probabilities go to 0. Different asymptotic regimes, expressing different prioritizations of the detection and isolation tasks, are considered. A novel, versatile family of testing procedures is proposed, in which two distinct, in general, statistics are computed for each hypothesis, one addressing the detection task and the other the isolation task. Tests in this family, of various computational complexities, are shown to be asymptotically optimal under different setups. The general theory is applied to the detection and isolation of anomalous, not necessarily independent, data streams, as well as to the detection and isolation of an unknown dependence structure. △ Less

Submitted 30 June, 2022; originally announced July 2022.

MSC Class: Primary 62L10; 62L05; secondary 62J15

arXiv:2201.02654 [pdf, other]

A Cross Validation Framework for Signal Denoising with Applications to Trend Filtering, Dyadic CART and Beyond

Authors: Anamitra Chaudhuri, Sabyasachi Chatterjee

Abstract: This paper formulates a general cross validation framework for signal denoising. The general framework is then applied to nonparametric regression methods such as Trend Filtering and Dyadic CART. The resulting cross validated versions are then shown to attain nearly the same rates of convergence as are known for the optimally tuned analogues. There did not exist any previous theoretical analyses o… ▽ More This paper formulates a general cross validation framework for signal denoising. The general framework is then applied to nonparametric regression methods such as Trend Filtering and Dyadic CART. The resulting cross validated versions are then shown to attain nearly the same rates of convergence as are known for the optimally tuned analogues. There did not exist any previous theoretical analyses of cross validated versions of Trend Filtering or Dyadic CART. To illustrate the generality of the framework we also propose and study cross validated versions of two fundamental estimators; lasso for high dimensional linear regression and singular value thresholding for matrix estimation. Our general framework is inspired by the ideas in Chatterjee and Jafarov (2015) and is potentially applicable to a wide range of estimation methods which use tuning parameters. △ Less

Submitted 3 May, 2023; v1 submitted 7 January, 2022; originally announced January 2022.

MSC Class: Primary 62G05; 62G08

arXiv:2112.07096 [pdf, other]

doi 10.1016/j.cma.2022.115730

Learning High-Dimensional Parametric Maps via Reduced Basis Adaptive Residual Networks

Authors: Thomas O'Leary-Roseberry, Xiaosong Du, Anirban Chaudhuri, Joaquim R. R. A. Martins, Karen Willcox, Omar Ghattas

Abstract: We propose a scalable framework for the learning of high-dimensional parametric maps via adaptively constructed residual network (ResNet) maps between reduced bases of the inputs and outputs. When just few training data are available, it is beneficial to have a compact parametrization in order to ameliorate the ill-posedness of the neural network training problem. By linearly restricting high-dime… ▽ More We propose a scalable framework for the learning of high-dimensional parametric maps via adaptively constructed residual network (ResNet) maps between reduced bases of the inputs and outputs. When just few training data are available, it is beneficial to have a compact parametrization in order to ameliorate the ill-posedness of the neural network training problem. By linearly restricting high-dimensional maps to informed reduced bases of the inputs, one can compress high-dimensional maps in a constructive way that can be used to detect appropriate basis ranks, equipped with rigorous error estimates. A scalable neural network learning framework is thus to learn the nonlinear compressed reduced basis mapping. Unlike the reduced basis construction, however, neural network constructions are not guaranteed to reduce errors by adding representation power, making it difficult to achieve good practical performance. Inspired by recent approximation theory that connects ResNets to sequential minimizing flows, we present an adaptive ResNet construction algorithm. This algorithm allows for depth-wise enrichment of the neural network approximation, in a manner that can achieve good practical performance by first training a shallow network and then adapting. We prove universal approximation of the associated neural network class for $L^2_ν$ functions on compact sets. Our overall framework allows for constructive means to detect appropriate breadth and depth, and related compact parametrizations of neural networks, significantly reducing the need for architectural hyperparameter tuning. Numerical experiments for parametric PDE problems and a 3D CFD wing design optimization parametric map demonstrate that the proposed methodology can achieve remarkably high accuracy for limited training data, and outperformed other neural network strategies we compared against. △ Less

Submitted 15 November, 2022; v1 submitted 13 December, 2021; originally announced December 2021.

arXiv:2107.13747 [pdf, ps, other]

doi 10.1016/j.geomphys.2022.104509

Atiyah sequence and Gauge transformations of a principal $2$-bundle over a Lie groupoid

Authors: Saikat Chatterjee, Adittya Chaudhuri, Praphulla Koushik

Abstract: In this paper, a notion of a principal $2$-bundle over a Lie groupoid has been introduced. For such principal $2$-bundles, we produced a short exact sequence of VB-groupoids, namely, the Atiyah sequence. Two notions of connection structures viz. strict connections and semi-strict connections on a principal $2$-bundle arising respectively, from a retraction of the Atiyah sequence and a retraction u… ▽ More In this paper, a notion of a principal $2$-bundle over a Lie groupoid has been introduced. For such principal $2$-bundles, we produced a short exact sequence of VB-groupoids, namely, the Atiyah sequence. Two notions of connection structures viz. strict connections and semi-strict connections on a principal $2$-bundle arising respectively, from a retraction of the Atiyah sequence and a retraction up to a natural isomorphism have been introduced. We constructed a class of principal $\mathbb{G}=[G_1\rightrightarrows G_0]$-bundles and connections from a given principal $G_0$-bundle $E_0\rightarrow X_0$ over $[X_1\rightrightarrows X_0]$ with connection. An existence criterion for the connections on a principal $2$-bundle over a proper, étale Lie groupoid is proposed. The action of the $2$-group of gauge transformations on the category of strict and semi-strict connections has been studied. Finally we noted an extended symmetry of the category of semi-strict connections. △ Less

Submitted 4 August, 2021; v1 submitted 29 July, 2021; originally announced July 2021.

MSC Class: Primary 53C08; Secondary 22A22; 58H05

Journal ref: Journal of Geometry and Physics, Volume 176, June 2022, 104509

arXiv:2101.05129 [pdf, other]

doi 10.2514/1.J060539

Certifiable Risk-Based Engineering Design Optimization

Authors: Anirban Chaudhuri, Boris Kramer, Matthew Norton, Johannes O. Royset, Karen Willcox

Abstract: Reliable, risk-averse design of complex engineering systems with optimized performance requires dealing with uncertainties. A conventional approach is to add safety margins to a design that was obtained from deterministic optimization. Safer engineering designs require appropriate cost and constraint function definitions that capture the \textit{risk} associated with unwanted system behavior in th… ▽ More Reliable, risk-averse design of complex engineering systems with optimized performance requires dealing with uncertainties. A conventional approach is to add safety margins to a design that was obtained from deterministic optimization. Safer engineering designs require appropriate cost and constraint function definitions that capture the \textit{risk} associated with unwanted system behavior in the presence of uncertainties. The paper proposes two notions of certifiability. The first is based on accounting for the magnitude of failure to ensure data-informed conservativeness. The second is the ability to provide optimization convergence guarantees by preserving convexity. Satisfying these notions leads to \textit{certifiable} risk-based design optimization (CRiBDO). In the context of CRiBDO, risk measures based on superquantile (a.k.a.\ conditional value-at-risk) and buffered probability of failure are analyzed. CRiBDO is contrasted with reliability-based design optimization (RBDO), where uncertainties are accounted for via the probability of failure, through a structural and a thermal design problem. A reformulation of the short column structural design problem leading to a convex CRiBDO problem is presented. The CRiBDO formulations capture more information about the problem to assign the appropriate conservativeness, exhibit superior optimization convergence by preserving properties of underlying functions, and alleviate the adverse effects of choosing hard failure thresholds required in RBDO. △ Less

Submitted 13 July, 2021; v1 submitted 13 January, 2021; originally announced January 2021.

Journal ref: AIAA Journal, 60(2), pp.551-565, 2022

arXiv:1906.06964 [pdf, ps, other]

Local topological recursion governs the enumeration of lattice points in $\overline{\mathcal M}_{g,n}$

Authors: Anupam Chaudhuri, Norman Do, Ellena Moskovsky

Abstract: The second author and Norbury initiated the enumeration of lattice points in the Deligne-Mumford compactifications of moduli spaces of curves. They showed that the enumeration may be expressed in terms of polynomials, whose top and bottom degree coefficients store psi-class intersection numbers and orbifold Euler characteristics of $\overline{\mathcal M}_{g,n}$, respectively. Furthermore, they ask… ▽ More The second author and Norbury initiated the enumeration of lattice points in the Deligne-Mumford compactifications of moduli spaces of curves. They showed that the enumeration may be expressed in terms of polynomials, whose top and bottom degree coefficients store psi-class intersection numbers and orbifold Euler characteristics of $\overline{\mathcal M}_{g,n}$, respectively. Furthermore, they ask whether the enumeration is governed by the topological recursion and whether the intermediate coefficients also store algebro-geometric information. In this paper, we prove that the enumeration does indeed satisfy the topological recursion, although with a modification to the initial spectral curve data. Thus, one can consider this to be one of the first known instances of a natural enumerative problem governed by the so-called local topological recursion. Combining the present work with the known relation between local topological recursion and cohomological field theory should uncover the geometric meaning of the intermediate coefficients of the aforementioned polynomials. △ Less

Submitted 17 June, 2019; originally announced June 2019.

Comments: 22 pages

MSC Class: 14N10 (primary); 05A15; 14N35; 30F30

arXiv:1812.11885 [pdf, ps, other]

Generalisations of the Harer-Zagier recursion for 1-point functions

Authors: Anupam Chaudhuri, Norman Do

Abstract: Harer and Zagier proved a recursion to enumerate gluings of a $2d$-gon that result in an orientable genus $g$ surface, in their work on Euler characteristics of moduli spaces of curves. Analogous results have been discovered for other enumerative problems, so it is natural to pose the following question: how large is the family of problems for which these so-called 1-point recursions exist? In t… ▽ More Harer and Zagier proved a recursion to enumerate gluings of a $2d$-gon that result in an orientable genus $g$ surface, in their work on Euler characteristics of moduli spaces of curves. Analogous results have been discovered for other enumerative problems, so it is natural to pose the following question: how large is the family of problems for which these so-called 1-point recursions exist? In this paper, we prove the existence of 1-point recursions for a class of enumerative problems that have Schur function expansions. In particular, we recover the Harer-Zagier recursion, but our methodology also applies to the enumeration of dessins d'enfant, to Bousquet-Mélou-Schaeffer numbers, to monotone Hurwitz numbers, and more. On the other hand, we prove that there is no 1-point recursion that governs simple Hurwitz numbers. Our results are effective in the sense that one can explicitly compute particular instances of 1-point recursions, and we provide several examples. We conclude the paper with a brief discussion and a conjecture relating 1-point recursions to the theory of topological recursion. △ Less

Submitted 31 December, 2018; originally announced December 2018.

Comments: 26 pages

MSC Class: 05A15; 05E10; 14N10

arXiv:1811.06838 [pdf, other]

The Trace Criterion for Kernel Bandwidth Selection for Support Vector Data Description

Authors: Arin Chaudhuri, Carol Sadek, Deovrat Kakde, Wenhao Hu, Hansi Jiang, Seunghyun Kong, Yuewei Liao, Sergiy Peredriy, Haoyu Wang

Abstract: Support vector data description (SVDD) is a popular anomaly detection technique. The SVDD classifier partitions the whole data space into an inlier region, which consists of the region near the training data, and an outlier region, which consists of points away from the training data. The computation of the SVDD classifier requires a kernel function, for which the Gaussian kernel is a common choic… ▽ More Support vector data description (SVDD) is a popular anomaly detection technique. The SVDD classifier partitions the whole data space into an inlier region, which consists of the region near the training data, and an outlier region, which consists of points away from the training data. The computation of the SVDD classifier requires a kernel function, for which the Gaussian kernel is a common choice. The Gaussian kernel has a bandwidth parameter, and it is important to set the value of this parameter correctly for good results. A small bandwidth leads to overfitting such that the resulting SVDD classifier overestimates the number of anomalies, whereas a large bandwidth leads to underfitting and an inability to detect many anomalies. In this paper, we present a new unsupervised method for selecting the Gaussian kernel bandwidth. Our method exploits a low-rank representation of the kernel matrix to suggest a kernel bandwidth value. Our new technique is competitive with the current state of the art for low-dimensional data and performs extremely well for many classes of high-dimensional data. Because the mathematical formulation of SVDD is identical with the mathematical formulation of one-class support vector machines (OCSVM) when the Gaussian kernel is used, our method is equally applicable to Gaussian kernel bandwidth tuning for OCSVM. △ Less

Submitted 5 February, 2020; v1 submitted 15 November, 2018; originally announced November 2018.

Comments: note: some text overlap with arXiv:1708.05106 because common background material is covered in both papers

arXiv:1601.05983 [pdf]

Much ado about Zero

Authors: Asis Kumar Chaudhuri

Abstract: A brief historical introduction for the enigmatic number Zero is given. The discussions are for popular consumption. A brief historical introduction for the enigmatic number Zero is given. The discussions are for popular consumption. △ Less

Submitted 7 June, 2016; v1 submitted 22 January, 2016; originally announced January 2016.

Comments: minor corrections are made. 13 pages, 5 figures

arXiv:1008.2366 [pdf, ps, other]

Ultrametric Cantor sets and Origin of Anomalous Diffusion

Authors: Dhurjati Prasad Datta, Santanu Raut, Anuja Roy Chaudhuri

Abstract: The anomalous mean square fluctuations are shown to arise naturally from the ordinary diffusion equation interpreted scale invariantly in a formalism endowing real numbers with a nonarchimedean multiplicative structure. A variable $t$ approaching 0 linearly in the ordinary analysis is shown to enjoy instead a sublinear $t\log t^{-1}$ flow in the presence of this scale invariant structure. Diffusio… ▽ More The anomalous mean square fluctuations are shown to arise naturally from the ordinary diffusion equation interpreted scale invariantly in a formalism endowing real numbers with a nonarchimedean multiplicative structure. A variable $t$ approaching 0 linearly in the ordinary analysis is shown to enjoy instead a sublinear $t\log t^{-1}$ flow in the presence of this scale invariant structure. Diffusion on an ultrametric Cantor set is also generically subdiffusive with the above sublinear mean square deviation. The present study seems to offer a new interpretation of a possible emergence of complex patterns from an apparently simple system. △ Less

Submitted 13 August, 2010; originally announced August 2010.

Comments: Latex 2e, 12 pages

Showing 1–20 of 20 results for author: Chaudhuri, A