-
Uncovering the topology of an infinite-server queueing network from population data
Authors:
Hritika Gupta,
Michel Mandjes,
Liron Ravner,
Jiesen Wang
Abstract:
This paper studies statistical inference in a network of infinite-server queues, with the aim of estimating the underlying parameters (routing matrix, arrival rates, parameters pertaining to the service times) using observations of the network population vector at Poisson time points. We propose a method-of-moments estimator and establish its consistency. The method relies on deriving the covarian…
▽ More
This paper studies statistical inference in a network of infinite-server queues, with the aim of estimating the underlying parameters (routing matrix, arrival rates, parameters pertaining to the service times) using observations of the network population vector at Poisson time points. We propose a method-of-moments estimator and establish its consistency. The method relies on deriving the covariance structure of different nodes at different sampling epochs. Numerical experiments demonstrate that the method yields accurate estimates, even in settings with a large number of parameters. Two model variants are considered: one that assumes a known parametric form for the service-time distributions, and a model-free version that does not require such assumptions.
△ Less
Submitted 8 June, 2025;
originally announced June 2025.
-
Reconfigurable Inspection in Manufacturing: State of the Art and Taxonomy
Authors:
Harshit Gupta,
Ashok Kumar Madan
Abstract:
This article provides an overview of the evolution of the product quality and measurement inspection procedure with emphasis on the Reconfigurable Inspection System and Machine. The major components of a reconfigurable manufacturing system have been examined, and the evolution of manufacturing processes has been briefly discussed. Different Reconfigurable Inspection Machines (RIMs) and their arran…
▽ More
This article provides an overview of the evolution of the product quality and measurement inspection procedure with emphasis on the Reconfigurable Inspection System and Machine. The major components of a reconfigurable manufacturing system have been examined, and the evolution of manufacturing processes has been briefly discussed. Different Reconfigurable Inspection Machines (RIMs) and their arrangement in an assembly line as an inspection system have been carefully studied and the modern inspection system equipped in RMS has been compared to the traditional techniques commonly used in inspection of product quality. A survey of evolving inspection techniques is offered from the standpoint of technological challenges and advancement affecting manufacturing over time. As per authors' knowledge, the review on Reconfigurable Inspection in Manufacturing and taxonomy of reconfigurable inspection systems is rare. Considering the studies done in this domain, there is still resourceful taxonomy for this paradigm. Therefore, different types of inspection procedures have been discussed, their features and applications have been compared to arrive at the taxonomy of the RIS based on the understanding of the nature of a RIS after a critical review.
△ Less
Submitted 11 November, 2023;
originally announced November 2023.
-
Assessment of DeepONet for reliability analysis of stochastic nonlinear dynamical systems
Authors:
Shailesh Garg,
Harshit Gupta,
Souvik Chakraborty
Abstract:
Time dependent reliability analysis and uncertainty quantification of structural system subjected to stochastic forcing function is a challenging endeavour as it necessitates considerable computational time. We investigate the efficacy of recently proposed DeepONet in solving time dependent reliability analysis and uncertainty quantification of systems subjected to stochastic loading. Unlike conve…
▽ More
Time dependent reliability analysis and uncertainty quantification of structural system subjected to stochastic forcing function is a challenging endeavour as it necessitates considerable computational time. We investigate the efficacy of recently proposed DeepONet in solving time dependent reliability analysis and uncertainty quantification of systems subjected to stochastic loading. Unlike conventional machine learning and deep learning algorithms, DeepONet learns is a operator network and learns a function to function mapping and hence, is ideally suited to propagate the uncertainty from the stochastic forcing function to the output responses. We use DeepONet to build a surrogate model for the dynamical system under consideration. Multiple case studies, involving both toy and benchmark problems, have been conducted to examine the efficacy of DeepONet in time dependent reliability analysis and uncertainty quantification of linear and nonlinear dynamical systems. Results obtained indicate that the DeepONet architecture is accurate as well as efficient. Moreover, DeepONet posses zero shot learning capabilities and hence, a trained model easily generalizes to unseen and new environment with no further training.
△ Less
Submitted 31 January, 2022;
originally announced January 2022.
-
The Dark Machines Anomaly Score Challenge: Benchmark Data and Model Independent Event Classification for the Large Hadron Collider
Authors:
T. Aarrestad,
M. van Beekveld,
M. Bona,
A. Boveia,
S. Caron,
J. Davies,
A. De Simone,
C. Doglioni,
J. M. Duarte,
A. Farbin,
H. Gupta,
L. Hendriks,
L. Heinrich,
J. Howarth,
P. Jawahar,
A. Jueid,
J. Lastow,
A. Leinweber,
J. Mamuzic,
E. Merényi,
A. Morandini,
P. Moskvitina,
C. Nellist,
J. Ngadiuba,
B. Ostdiek
, et al. (14 additional authors not shown)
Abstract:
We describe the outcome of a data challenge conducted as part of the Dark Machines Initiative and the Les Houches 2019 workshop on Physics at TeV colliders. The challenged aims at detecting signals of new physics at the LHC using unsupervised machine learning algorithms. First, we propose how an anomaly score could be implemented to define model-independent signal regions in LHC searches. We defin…
▽ More
We describe the outcome of a data challenge conducted as part of the Dark Machines Initiative and the Les Houches 2019 workshop on Physics at TeV colliders. The challenged aims at detecting signals of new physics at the LHC using unsupervised machine learning algorithms. First, we propose how an anomaly score could be implemented to define model-independent signal regions in LHC searches. We define and describe a large benchmark dataset, consisting of >1 Billion simulated LHC events corresponding to $10~\rm{fb}^{-1}$ of proton-proton collisions at a center-of-mass energy of 13 TeV. We then review a wide range of anomaly detection and density estimation algorithms, developed in the context of the data challenge, and we measure their performance in a set of realistic analysis environments. We draw a number of useful conclusions that will aid the development of unsupervised new physics searches during the third run of the LHC, and provide our benchmark dataset for future studies at https://www.phenoMLdata.org. Code to reproduce the analysis is provided at https://github.com/bostdiek/DarkMachines-UnsupervisedChallenge.
△ Less
Submitted 9 December, 2021; v1 submitted 28 May, 2021;
originally announced May 2021.
-
Computing Accurate Probabilistic Estimates of One-D Entropy from Equiprobable Random Samples
Authors:
Hoshin V Gupta,
Mohammed Reza Ehsani,
Tirthankar Roy,
Maria A Sans-Fuentes,
Uwe Ehret,
Ali Behrangi
Abstract:
We develop a simple Quantile Spacing (QS) method for accurate probabilistic estimation of one-dimensional entropy from equiprobable random samples, and compare it with the popular Bin-Counting (BC) method. In contrast to BC, which uses equal-width bins with varying probability mass, the QS method uses estimates of the quantiles that divide the support of the data generating probability density fun…
▽ More
We develop a simple Quantile Spacing (QS) method for accurate probabilistic estimation of one-dimensional entropy from equiprobable random samples, and compare it with the popular Bin-Counting (BC) method. In contrast to BC, which uses equal-width bins with varying probability mass, the QS method uses estimates of the quantiles that divide the support of the data generating probability density function (pdf) into equal-probability-mass intervals. Whereas BC requires optimal tuning of a bin-width hyper-parameter whose value varies with sample size and shape of the pdf, QS requires specification of the number of quantiles to be used. Results indicate, for the class of distributions tested, that the optimal number of quantile-spacings is a fixed fraction of the sample size (empirically determined to be ~0.25-0.35), and that this value is relatively insensitive to distributional form or sample size, providing a clear advantage over BC since hyperparameter tuning is not required. Bootstrapping is used to approximate the sampling variability distribution of the resulting entropy estimate, and is shown to accurately reflect the true uncertainty. For the four distributional forms studied (Gaussian, Log-Normal, Exponential and Bimodal Gaussian Mixture), expected estimation bias is less than 1% and uncertainty is relatively low even for very small sample sizes. We speculate that estimating quantile locations, rather than bin-probabilities, results in more efficient use of the information in the data to approximate the underlying shape of an unknown data generating pdf.
△ Less
Submitted 15 June, 2021; v1 submitted 24 February, 2021;
originally announced February 2021.
-
The Mean-Squared Error of Double Q-Learning
Authors:
Wentao Weng,
Harsh Gupta,
Niao He,
Lei Ying,
R. Srikant
Abstract:
In this paper, we establish a theoretical comparison between the asymptotic mean-squared error of Double Q-learning and Q-learning. Our result builds upon an analysis for linear stochastic approximation based on Lyapunov equations and applies to both tabular setting and with linear function approximation, provided that the optimal policy is unique and the algorithms converge. We show that the asym…
▽ More
In this paper, we establish a theoretical comparison between the asymptotic mean-squared error of Double Q-learning and Q-learning. Our result builds upon an analysis for linear stochastic approximation based on Lyapunov equations and applies to both tabular setting and with linear function approximation, provided that the optimal policy is unique and the algorithms converge. We show that the asymptotic mean-squared error of Double Q-learning is exactly equal to that of Q-learning if Double Q-learning uses twice the learning rate of Q-learning and outputs the average of its two estimators. We also present some practical implications of this theoretical observation using simulations.
△ Less
Submitted 14 June, 2022; v1 submitted 9 July, 2020;
originally announced July 2020.
-
Mixed Logit Models and Network Formation
Authors:
Harsh Gupta,
Mason A. Porter
Abstract:
The study of network formation is pervasive in economics, sociology, and many other fields. In this paper, we model network formation as a `choice' that is made by nodes in a network to connect to other nodes. We study these `choices' using discrete-choice models, in which an agent chooses between two or more discrete alternatives. We employ the `repeated-choice' (RC) model to study network format…
▽ More
The study of network formation is pervasive in economics, sociology, and many other fields. In this paper, we model network formation as a `choice' that is made by nodes in a network to connect to other nodes. We study these `choices' using discrete-choice models, in which an agent chooses between two or more discrete alternatives. We employ the `repeated-choice' (RC) model to study network formation. We argue that the RC model overcomes important limitations of the multinomial logit (MNL) model, which gives one framework for studying network formation, and that it is well-suited to study network formation. We also illustrate how to use the RC model to accurately study network formation using both synthetic and real-world networks. Using edge-independent synthetic networks, we also compare the performance of the MNL model and the RC model. We find that the RC model estimates the data-generation process of our synthetic networks more accurately than the MNL model. In a patent citation network, which forms sequentially, we present a case study of a qualitatively interesting scenario -- the fact that new patents are more likely to cite older, more cited, and similar patents -- for which employing the RC model yields interesting insights.
△ Less
Submitted 26 August, 2022; v1 submitted 30 June, 2020;
originally announced June 2020.
-
Approaches and Applications of Early Classification of Time Series: A Review
Authors:
Ashish Gupta,
Hari Prabhat Gupta,
Bhaskar Biswas,
Tanima Dutta
Abstract:
Early classification of time series has been extensively studied for minimizing class prediction delay in time-sensitive applications such as healthcare and finance. A primary task of an early classification approach is to classify an incomplete time series as soon as possible with some desired level of accuracy. Recent years have witnessed several approaches for early classification of time serie…
▽ More
Early classification of time series has been extensively studied for minimizing class prediction delay in time-sensitive applications such as healthcare and finance. A primary task of an early classification approach is to classify an incomplete time series as soon as possible with some desired level of accuracy. Recent years have witnessed several approaches for early classification of time series. As most of the approaches have solved the early classification problem with different aspects, it becomes very important to make a thorough review of the existing solutions to know the current status of the area. These solutions have demonstrated reasonable performance in a wide range of applications including human activity recognition, gene expression based health diagnostic, industrial monitoring, and so on. In this paper, we present a systematic review of current literature on early classification approaches for both univariate and multivariate time series. We divide various existing approaches into four exclusive categories based on their proposed solution strategies. The four categories include prefix based, shapelet based, model based, and miscellaneous approaches. The authors also discuss the applications of early classification in many areas including industrial monitoring, intelligent transportation, and medical. Finally, we provide a quick summary of the current literature with future research directions.
△ Less
Submitted 15 October, 2020; v1 submitted 6 May, 2020;
originally announced May 2020.
-
Deep Neural Networks with Trainable Activations and Controlled Lipschitz Constant
Authors:
Shayan Aziznejad,
Harshit Gupta,
Joaquim Campos,
Michael Unser
Abstract:
We introduce a variational framework to learn the activation functions of deep neural networks. Our aim is to increase the capacity of the network while controlling an upper-bound of the actual Lipschitz constant of the input-output relation. To that end, we first establish a global bound for the Lipschitz constant of neural networks. Based on the obtained bound, we then formulate a variational pr…
▽ More
We introduce a variational framework to learn the activation functions of deep neural networks. Our aim is to increase the capacity of the network while controlling an upper-bound of the actual Lipschitz constant of the input-output relation. To that end, we first establish a global bound for the Lipschitz constant of neural networks. Based on the obtained bound, we then formulate a variational problem for learning activation functions. Our variational problem is infinite-dimensional and is not computationally tractable. However, we prove that there always exists a solution that has continuous and piecewise-linear (linear-spline) activations. This reduces the original problem to a finite-dimensional minimization where an l1 penalty on the parameters of the activations favors the learning of sparse nonlinearities. We numerically compare our scheme with standard ReLU network and its variations, PReLU and LeakyReLU and we empirically demonstrate the practical aspects of our framework.
△ Less
Submitted 7 August, 2020; v1 submitted 17 January, 2020;
originally announced January 2020.
-
Finite-Time Performance Bounds and Adaptive Learning Rate Selection for Two Time-Scale Reinforcement Learning
Authors:
Harsh Gupta,
R. Srikant,
Lei Ying
Abstract:
We study two time-scale linear stochastic approximation algorithms, which can be used to model well-known reinforcement learning algorithms such as GTD, GTD2, and TDC. We present finite-time performance bounds for the case where the learning rate is fixed. The key idea in obtaining these bounds is to use a Lyapunov function motivated by singular perturbation theory for linear differential equation…
▽ More
We study two time-scale linear stochastic approximation algorithms, which can be used to model well-known reinforcement learning algorithms such as GTD, GTD2, and TDC. We present finite-time performance bounds for the case where the learning rate is fixed. The key idea in obtaining these bounds is to use a Lyapunov function motivated by singular perturbation theory for linear differential equations. We use the bound to design an adaptive learning rate scheme which significantly improves the convergence rate over the known optimal polynomial decay rule in our experiments, and can be used to potentially improve the performance of any other schedule where the learning rate is changed at pre-determined time instants.
△ Less
Submitted 14 July, 2019;
originally announced July 2019.
-
Almost Boltzmann Exploration
Authors:
Harsh Gupta,
Seo Taek Kong,
R. Srikant,
Weina Wang
Abstract:
Boltzmann exploration is widely used in reinforcement learning to provide a trade-off between exploration and exploitation. Recently, in (Cesa-Bianchi et al., 2017) it has been shown that pure Boltzmann exploration does not perform well from a regret perspective, even in the simplest setting of stochastic multi-armed bandit (MAB) problems. In this paper, we show that a simple modification to Boltz…
▽ More
Boltzmann exploration is widely used in reinforcement learning to provide a trade-off between exploration and exploitation. Recently, in (Cesa-Bianchi et al., 2017) it has been shown that pure Boltzmann exploration does not perform well from a regret perspective, even in the simplest setting of stochastic multi-armed bandit (MAB) problems. In this paper, we show that a simple modification to Boltzmann exploration, motivated by a variation of the standard doubling trick, achieves $O(K\log^{1+α} T)$ regret for a stochastic MAB problem with $K$ arms, where $α>0$ is a parameter of the algorithm. This improves on the result in (Cesa-Bianchi et al., 2017), where an algorithm inspired by the Gumbel-softmax trick achieves $O(K\log^2 T)$ regret. We also show that our algorithm achieves $O(β(G) \log^{1+α} T)$ regret in stochastic MAB problems with graph-structured feedback, without knowledge of the graph structure, where $β(G)$ is the independence number of the feedback graph. Additionally, we present extensive experimental results on real datasets and applications for multi-armed bandits with both traditional bandit feedback and graph-structured feedback. In all cases, our algorithm performs as well or better than the state-of-the-art.
△ Less
Submitted 19 April, 2019; v1 submitted 24 January, 2019;
originally announced January 2019.
-
Information vs. Uncertainty as the Foundation for a Science of Environmental Modeling
Authors:
Grey Nearing,
Hoshin Gupta
Abstract:
Information accounting provides a better foundation for hypothesis testing than does uncertainty quantification. A quantitative account of science is derived under this perspective that alleviates the need for epistemic bridge principles, solves the problem of ad hoc falsification criteria, and deals with verisimilitude by facilitating a general approach to process-level diagnostics. Our argument…
▽ More
Information accounting provides a better foundation for hypothesis testing than does uncertainty quantification. A quantitative account of science is derived under this perspective that alleviates the need for epistemic bridge principles, solves the problem of ad hoc falsification criteria, and deals with verisimilitude by facilitating a general approach to process-level diagnostics. Our argument is that the well-known inconsistencies of both Bayesian and classical statistical hypothesis tests are due to the fact that probability theory is an insufficient logic of science. Information theory, as an extension of probability theory, is required to provide a complete logic on which to base quantitative theories of empirical learning. The organizing question in this case becomes not whether our theories or models are more or less true, or about how much uncertainty is associated with a particular model, but instead whether there is any information available from experimental data that might allow us to improve the model. This becomes a formal hypothesis test, provides a theory of model diagnostics, and suggests a new approach to building dynamical systems models.
△ Less
Submitted 24 April, 2017;
originally announced April 2017.