Skip to main content

Showing 1–31 of 31 results for author: Gupta, N

Searching in archive stat. Search in all archives.
.
  1. arXiv:2506.01473  [pdf, ps, other

    stat.ME math.ST stat.AP

    Characterization based Goodness-of-Fit for Generalized Pareto Distribution: A Blend of Stein's Identity and Dynamic Survival Extropy

    Authors: Gaurav Kandpal, Nitin Gupta

    Abstract: This paper proposes a goodness of fit test for the generalized Pareto distribution (GPD). Firstly, we provide two characterizations of GPD based on Stein's identity and dynamic survival extropy. These characterizations are used to test GPD separately for the positive and negative shape parameter cases. A Monte Carlo simulation is conducted to provide the critical values and power of the proposed t… ▽ More

    Submitted 2 June, 2025; originally announced June 2025.

    MSC Class: 62G10; 62G20; 62B10; 94A17

  2. arXiv:2505.19824  [pdf, ps, other

    math.ST stat.AP

    Weighted Tail Random Variable: A Novel Framework with Stochastic Properties and Applications

    Authors: Sarikul Islam, Nitin Gupta

    Abstract: This paper introduces a novel framework to construct the probability density function (PDF) of non-negative continuous random variables. The proposed framework uses two functions: one is the survival function (SF) of a non-negative continuous random variable, and the other is a weight function, which is an increasing and differentiable function satisfying some properties. The resulting random vari… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: 28 pages, 4 figures, Original work

    MSC Class: 2020: Primary 62N05; 60E15; Secondary 62N02

  3. arXiv:2502.01255  [pdf, other

    stat.ME math.ST

    Inference of Half Logistic Geometric Distribution Based on Generalized Order Statistics

    Authors: Neetu Gupta, S. K. Neogy, Qazi J. Azhad, Bhagwati Devi

    Abstract: As the unification of various models of ordered quantities, generalized order statistics act as a simplistic approach introduced in \cite{kamps1995concept}. In this present study, results pertaining to the expressions of marginal and joint moment generating functions from half logistic geometric distribution are presented based on generalized order statistics framework. We also consider the estima… ▽ More

    Submitted 3 February, 2025; originally announced February 2025.

    Comments: 20 pages, 3 figures, 7 tables, preprint

    ACM Class: G.3

  4. arXiv:2501.14797  [pdf, ps, other

    math.ST stat.AP stat.CO

    A characterization of uniform distribution using varextropy with application in testing uniformity

    Authors: Santosh Kumar Chaudhary, Nitin Gupta

    Abstract: In statistical analysis, quantifying uncertainties through measures such as entropy, extropy, varentropy, and varextropy is of fundamental importance for understanding distribution functions. This paper investigates several properties of varextropy and give a new characterization of uniform distribution using varextropy. The alredy proposed estimators are used as a test statistics. Building on the… ▽ More

    Submitted 12 January, 2025; originally announced January 2025.

  5. arXiv:2407.11634  [pdf, ps, other

    stat.ME math.ST

    A goodness-of-fit test for testing exponentiality based on normalized dynamic survival extropy

    Authors: Gaurav Kandpal, Nitin Gupta

    Abstract: The cumulative residual extropy (CRJ) is a measure of uncertainty that serves as an alternative to extropy. It replaces the probability density function with the survival function in the expression of extropy. This work introduces a new concept called normalized dynamic survival extropy (NDSE), a dynamic variation of CRJ. We observe that NDSE is equivalent to CRJ of the random variable of interest… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

  6. arXiv:2406.05834  [pdf, ps, other

    math.ST math.PR stat.AP

    Stochastic comparison of series and parallel systems lifetime in Archimedean copula under random shock

    Authors: Sarikul Islam, Nitin Gupta

    Abstract: In this paper, we studied the stochastic ordering behavior of series as well as parallel systems' lifetimes comprising dependent and heterogeneous components, experiencing random shocks, and exhibiting distinct dependency structures. We establish certain conditions on the lifetime of individual components where the dependency among components defined by Archimedean copulas, and the impact of rando… ▽ More

    Submitted 28 May, 2025; v1 submitted 9 June, 2024; originally announced June 2024.

    Comments: Number of pages 19, Original work

    MSC Class: Primary: 60E15; 90B25; Secondary: 62G30

  7. arXiv:2307.02764  [pdf, other

    cs.LG stat.ML

    When Does Confidence-Based Cascade Deferral Suffice?

    Authors: Wittawat Jitkrittum, Neha Gupta, Aditya Krishna Menon, Harikrishna Narasimhan, Ankit Singh Rawat, Sanjiv Kumar

    Abstract: Cascades are a classical strategy to enable inference cost to vary adaptively across samples, wherein a sequence of classifiers are invoked in turn. A deferral rule determines whether to invoke the next classifier in the sequence, or to terminate prediction. One simple deferral rule employs the confidence of the current classifier, e.g., based on the maximum predicted softmax probability. Despite… ▽ More

    Submitted 23 January, 2024; v1 submitted 6 July, 2023; originally announced July 2023.

    Comments: NeurIPS 2023

  8. arXiv:2305.01655  [pdf, other

    cs.LG stat.ME

    Predicting blood pressure under circumstances of missing data: An analysis of missing data patterns and imputation methods using NHANES

    Authors: Harish Chauhan, Nikunj Gupta, Zoe Haskell-Craig

    Abstract: The World Health Organization defines cardio-vascular disease (CVD) as "a group of disorders of the heart and blood vessels," including coronary heart disease and stroke (WHO 21). CVD is affected by "intermediate risk factors" such as raised blood pressure, raised blood glucose, raised blood lipids, and obesity. These are predominantly influenced by lifestyle and behaviour, including physical inac… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

  9. arXiv:2301.02698  [pdf, ps, other

    stat.AP

    Testing exponentiality using extropy of upper record values

    Authors: Santosh Kumar Chaudhary, Nitin Gupta

    Abstract: We are giving one characterization result of exponential distribution using extropy of nth upper k-record value. We introduce test statistics based on the proposed characterization result that will be used to test exponentially. The critical value and power of the test have been calculated using monte Carlo simulation. The test is applied to seven real-life data sets to verify its applicability in… ▽ More

    Submitted 6 January, 2023; originally announced January 2023.

    Comments: 23 page

  10. arXiv:2209.06703  [pdf, other

    stat.ME

    Testing of symmetry based on cumulative past and residual extropy of record values

    Authors: Santosh Kumar Chaudhary, Nitin Gupta

    Abstract: In this paper, we are testing the symmetry in the distribution of data observed on a random variable. We proposed test statistics using cumulative past and residual extropy of record values based on the characterization developed by Gupta and Chaudhary (2022) [5]. It is shown that the obtained estimator is consistent. Our proposed test has an advantage that we do not need to estimate the centre of… ▽ More

    Submitted 14 September, 2022; originally announced September 2022.

    Comments: 27 pages, 1 fugure

    MSC Class: 62G30; 62E10; 62G10; 62B10

  11. arXiv:2207.02003  [pdf, ps, other

    math.ST stat.OT

    On General Weighted Extropy of Ranked Set Sampling

    Authors: Nitin Gupta, Santosh Kumar Chaudhary

    Abstract: In the past six years, a considerable attention has been given to the extropy measure proposed by Lad et al. (2015). Weighted Extropy of Ranked Set Sampling was studied and compared with simple random sampling by Qiu et al. (2022). The general weighted extropy and some results related to it are introduced in this paper. We provide general weighted extropy of ranked set sampling. We also studied ch… ▽ More

    Submitted 5 July, 2022; originally announced July 2022.

    Comments: 17 pages

    MSC Class: 62B10; 62D05

  12. arXiv:2206.10566  [pdf, other

    stat.ML cs.LG

    Ensembling over Classifiers: a Bias-Variance Perspective

    Authors: Neha Gupta, Jamie Smith, Ben Adlam, Zelda Mariet

    Abstract: Ensembles are a straightforward, remarkably effective method for improving the accuracy,calibration, and robustness of models on classification tasks; yet, the reasons that underlie their success remain an active area of research. We build upon the extension to the bias-variance decomposition by Pfau (2013) in order to gain crucial insights into the behavior of ensembles of classifiers. Introducin… ▽ More

    Submitted 21 June, 2022; originally announced June 2022.

  13. arXiv:2202.04167  [pdf, other

    stat.ML cs.LG math.PR

    Understanding the bias-variance tradeoff of Bregman divergences

    Authors: Ben Adlam, Neha Gupta, Zelda Mariet, Jamie Smith

    Abstract: This paper builds upon the work of Pfau (2013), which generalized the bias variance tradeoff to any Bregman divergence loss function. Pfau (2013) showed that for Bregman divergences, the bias and variances are defined with respect to a central label, defined as the mean of the label variable, and a central prediction, of a more complex form. We show that, similarly to the label, the central predic… ▽ More

    Submitted 9 February, 2022; v1 submitted 8 February, 2022; originally announced February 2022.

  14. arXiv:2108.08670  [pdf, other

    math.OC cs.LG eess.SY stat.ML

    On Accelerating Distributed Convex Optimizations

    Authors: Kushal Chakrabarti, Nirupam Gupta, Nikhil Chopra

    Abstract: This paper studies a distributed multi-agent convex optimization problem. The system comprises multiple agents in this problem, each with a set of local data points and an associated local cost function. The agents are connected to a server, and there is no inter-agent communication. The agents' goal is to learn a parameter vector that optimizes the aggregate of their local costs without revealing… ▽ More

    Submitted 19 August, 2021; originally announced August 2021.

  15. arXiv:2101.10967  [pdf, other

    math.OC cs.LG eess.SY stat.ML

    Robustness of Iteratively Pre-Conditioned Gradient-Descent Method: The Case of Distributed Linear Regression Problem

    Authors: Kushal Chakrabarti, Nirupam Gupta, Nikhil Chopra

    Abstract: This paper considers the problem of multi-agent distributed linear regression in the presence of system noises. In this problem, the system comprises multiple agents wherein each agent locally observes a set of data points, and the agents' goal is to compute a linear model that best fits the collective data points observed by all the agents. We consider a server-based distributed architecture wher… ▽ More

    Submitted 26 January, 2021; originally announced January 2021.

    Comments: in IEEE Control Systems Letters. Related articles: arXiv:2003.07180v2 [math.OC], arXiv:2008.02856v1 [math.OC], and arXiv:2011.07595v2 [math.OC]

    Journal ref: IEEE Control Systems Letters, vol. 5, no. 6, pp. 2180-2185, Dec. 2021

  16. arXiv:2011.07595  [pdf, other

    math.OC cs.LG eess.SY stat.ML

    Accelerating Distributed SGD for Linear Regression using Iterative Pre-Conditioning

    Authors: Kushal Chakrabarti, Nirupam Gupta, Nikhil Chopra

    Abstract: This paper considers the multi-agent distributed linear least-squares problem. The system comprises multiple agents, each agent with a locally observed set of data points, and a common server with whom the agents can interact. The agents' goal is to compute a linear model that best fits the collective data points observed by all the agents. In the server-based distributed settings, the server cann… ▽ More

    Submitted 28 November, 2020; v1 submitted 15 November, 2020; originally announced November 2020.

    Comments: Changes in the replacement: Application to distributed state estimation problem has been added in Appendix B. Related articles: arXiv:2003.07180v2 [math.OC] and arXiv:2008.02856v1 [math.OC]

  17. arXiv:2010.08633  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Universal guarantees for decision tree induction via a higher-order splitting criterion

    Authors: Guy Blanc, Neha Gupta, Jane Lange, Li-Yang Tan

    Abstract: We propose a simple extension of top-down decision tree learning heuristics such as ID3, C4.5, and CART. Our algorithm achieves provable guarantees for all target functions $f: \{-1,1\}^n \to \{-1,1\}$ with respect to the uniform distribution, circumventing impossibility results showing that existing heuristics fare poorly even for simple target functions. The crux of our extension is a new splitt… ▽ More

    Submitted 16 October, 2020; originally announced October 2020.

  18. arXiv:2008.13374  [pdf, ps, other

    cs.LG stat.ML

    Active Local Learning

    Authors: Arturs Backurs, Avrim Blum, Neha Gupta

    Abstract: In this work we consider active local learning: given a query point $x$, and active access to an unlabeled training set $S$, output the prediction $h(x)$ of a near-optimal $h \in H$ using significantly fewer labels than would be needed to actually learn $h$ fully. In particular, the number of label queries should be independent of the complexity of $H$, and the function $h$ should be well-defined,… ▽ More

    Submitted 3 September, 2020; v1 submitted 31 August, 2020; originally announced August 2020.

    Comments: Published at COLT 2020

  19. arXiv:2008.04699  [pdf, other

    cs.LG cs.DC stat.ML

    Byzantine Fault-Tolerant Distributed Machine Learning Using Stochastic Gradient Descent (SGD) and Norm-Based Comparative Gradient Elimination (CGE)

    Authors: Nirupam Gupta, Shuo Liu, Nitin H. Vaidya

    Abstract: This paper considers the Byzantine fault-tolerance problem in distributed stochastic gradient descent (D-SGD) method - a popular algorithm for distributed multi-agent machine learning. In this problem, each agent samples data points independently from a certain data-generating distribution. In the fault-free case, the D-SGD method allows all the agents to learn a mathematical model best fitting th… ▽ More

    Submitted 17 April, 2021; v1 submitted 11 August, 2020; originally announced August 2020.

    Comments: The report includes 52 pages, and 16 figures. Extension of our prior work on Byzantine fault-tolerant distribution optimization (arXiv:1903.08752 and doi:10.1145/3382734.3405748) to Byzantine fault-tolerant distributed machine learning; Updated to the full version of workshop paper in DSN-DSML '21

  20. arXiv:2008.02856  [pdf, other

    math.OC cs.LG eess.SY stat.ML

    Iterative Pre-Conditioning for Expediting the Gradient-Descent Method: The Distributed Linear Least-Squares Problem

    Authors: Kushal Chakrabarti, Nirupam Gupta, Nikhil Chopra

    Abstract: This paper considers the multi-agent linear least-squares problem in a server-agent network. In this problem, the system comprises multiple agents, each having a set of local data points, that are connected to a server. The goal for the agents is to compute a linear mathematical model that optimally fits the collective data points held by all the agents, without sharing their individual local data… ▽ More

    Submitted 6 August, 2021; v1 submitted 6 August, 2020; originally announced August 2020.

    Comments: Update: figures for the rest of the datasets have been added

    Journal ref: Automatica, Volume 137, 2022, p110095

  21. arXiv:2005.00123  [pdf, other

    cs.LG cs.CL stat.ML

    Unsupervised Learning of KB Queries in Task-Oriented Dialogs

    Authors: Dinesh Raghu, Nikhil Gupta, Mausam

    Abstract: Task-oriented dialog (TOD) systems often need to formulate knowledge base (KB) queries corresponding to the user intent and use the query results to generate system responses. Existing approaches require dialog datasets to explicitly annotate these KB queries -- these annotations can be time consuming, and expensive. In response, we define the novel problems of predicting the KB query and training… ▽ More

    Submitted 3 June, 2021; v1 submitted 30 April, 2020; originally announced May 2020.

    Comments: Presented at ACL 2021

    Journal ref: Transactions of the Association for Computational Linguistics (2021) 9: 374-390

  22. arXiv:2003.07180  [pdf, other

    math.OC cs.LG eess.SY stat.ML

    Iterative Pre-Conditioning to Expedite the Gradient-Descent Method

    Authors: Kushal Chakrabarti, Nirupam Gupta, Nikhil Chopra

    Abstract: This paper considers the problem of multi-agent distributed optimization. In this problem, there are multiple agents in the system, and each agent only knows its local cost function. The objective for the agents is to collectively compute a common minimum of the aggregate of all their local cost functions. In principle, this problem is solvable using a distributed variant of the traditional gradie… ▽ More

    Submitted 29 March, 2020; v1 submitted 13 March, 2020; originally announced March 2020.

    Comments: Accepted for the proceedings of the 2020 American Control Conference

  23. Extreme Regression for Dynamic Search Advertising

    Authors: Yashoteja Prabhu, Aditya Kusupati, Nilesh Gupta, Manik Varma

    Abstract: This paper introduces a new learning paradigm called eXtreme Regression (XR) whose objective is to accurately predict the numerical degrees of relevance of an extremely large number of labels to a data point. XR can provide elegant solutions to many large-scale ranking and recommendation applications including Dynamic Search Advertising (DSA). XR can learn more accurate models than the recently po… ▽ More

    Submitted 20 January, 2020; v1 submitted 15 January, 2020; originally announced January 2020.

    Comments: 15 pages, 4 figures, published at WSDM 2020 as a Long Oral

  24. arXiv:1910.08108  [pdf, other

    cs.LG cs.CV stat.ML

    Enforcing Linearity in DNN succours Robustness and Adversarial Image Generation

    Authors: Anindya Sarkar, Nikhil Kumar Gupta, Raghu Iyengar

    Abstract: Recent studies on the adversarial vulnerability of neural networks have shown that models trained with the objective of minimizing an upper bound on the worst-case loss over all possible adversarial perturbations improve robustness against adversarial attacks. Beside exploiting adversarial training framework, we show that by enforcing a Deep Neural Network (DNN) to be linear in transformed input a… ▽ More

    Submitted 21 October, 2019; v1 submitted 17 October, 2019; originally announced October 2019.

    Comments: Adversarial Machine Learning

  25. arXiv:1904.09080  [pdf, other

    cs.LG stat.ML

    Implicit regularization for deep neural networks driven by an Ornstein-Uhlenbeck like process

    Authors: Guy Blanc, Neha Gupta, Gregory Valiant, Paul Valiant

    Abstract: We consider networks, trained via stochastic gradient descent to minimize $\ell_2$ loss, with the training labels perturbed by independent noise at each iteration. We characterize the behavior of the training dynamics near any parameter vector that achieves zero training error, in terms of an implicit regularization term corresponding to the sum over the data points, of the squared $\ell_2$ norm o… ▽ More

    Submitted 22 July, 2020; v1 submitted 19 April, 2019; originally announced April 2019.

  26. arXiv:1904.08730  [pdf, other

    math.ST stat.OT

    Some ordering properties of highest and lowest order statistics with exponentiated Gumble type-II distributed components

    Authors: Surojit Biswas, Nitin Gupta

    Abstract: In this paper, we have studied the stochastic comparisons of the highest and lowest order statistics of exponentiated Gumble type-II distribution with three parameters. We have compared both the statistics by using three different stochastic ordering. First, we consider a system with different scale and outer shape parameters and then we study the usual stochastic ordering of the lowest and highes… ▽ More

    Submitted 18 April, 2019; originally announced April 2019.

    Comments: 16 pages, 1 figures

  27. arXiv:1903.08752  [pdf, other

    cs.LG cs.DC stat.ML

    Byzantine Fault Tolerant Distributed Linear Regression

    Authors: Nirupam Gupta, Nitin H. Vaidya

    Abstract: This paper considers the problem of Byzantine fault tolerance in distributed linear regression in a multi-agent system. However, the proposed algorithms are given for a more general class of distributed optimization problems, of which distributed linear regression is a special case. The system comprises of a server and multiple agents, where each agent is holding a certain number of data points an… ▽ More

    Submitted 4 April, 2019; v1 submitted 20 March, 2019; originally announced March 2019.

    Comments: Manuscript revised by adding; a new improved filtering technique, and convergence analysis with noise

  28. arXiv:1805.01216  [pdf, other

    cs.LG cs.CL stat.ML

    Disentangling Language and Knowledge in Task-Oriented Dialogs

    Authors: Dinesh Raghu, Nikhil Gupta, Mausam

    Abstract: The Knowledge Base (KB) used for real-world applications, such as booking a movie or restaurant reservation, keeps changing over time. End-to-end neural networks trained for these task-oriented dialogs are expected to be immune to any changes in the KB. However, existing approaches breakdown when asked to handle such changes. We propose an encoder-decoder architecture (BoSsNet) with a novel Bag-of… ▽ More

    Submitted 5 April, 2019; v1 submitted 3 May, 2018; originally announced May 2018.

    Comments: Published in NAACL-HLT 2019

  29. arXiv:1804.04780  [pdf, other

    stat.ML cs.LG

    A Grid Based Adversarial Clustering Algorithm

    Authors: Wutao Wei, Nikhil Gupta, Bowei Xi

    Abstract: Nowadays more and more data are gathered for detecting and preventing cyber attacks. In cyber security applications, data analytics techniques have to deal with active adversaries that try to deceive the data analytics models and avoid being detected. The existence of such adversarial behavior motivates the development of robust and resilient adversarial learning techniques for various tasks. Most… ▽ More

    Submitted 21 November, 2024; v1 submitted 13 April, 2018; originally announced April 2018.

  30. arXiv:1011.2929  [pdf

    stat.AP math-ph physics.data-an

    Intrinsic Geometric Analysis of the Network Reliability and Voltage Stability

    Authors: N. Gupta, B. N. Tiwari, S. Bellucci

    Abstract: This paper presents the intrinsic geometric model for the solution of power system planning and its operation. This problem is large-scale and nonlinear, in general. Thus, we have developed the intrinsic geometric model for the network reliability and voltage stability, and examined it for the IEEE 5 bus system. The robustness of the proposed model is illustrated by introducing variations of the n… ▽ More

    Submitted 12 November, 2010; originally announced November 2010.

    Comments: 8 pages, 4 figures, 2 tables, Index Terms -- Circuit modeling, geometric modeling, parameter space method, power system reliability, power system stability, transmission planning, nonlinear methods, geometric controls, components optimization

  31. arXiv:1011.2924  [pdf, ps, other

    stat.AP math-ph physics.data-an

    Geometric Design and Stability of Power Networks

    Authors: Neeraj Gupta, Bhupendra Nath Tiwari, Stefano Bellucci

    Abstract: From the perspective of the network theory, the present work illustrates how the parametric intrinsic geometric description exhibits an exact set of pair correction functions and global correlation volume with and without the inclusion of the imaginary power flow. The Gaussian fluctuations about the equilibrium basis accomplish a well-defined, non-degenerate, curved regular intrinsic Riemannian su… ▽ More

    Submitted 12 November, 2010; originally announced November 2010.

    Comments: 23 pages, 11 figures, Keywords: Correlation; Geometry; Power Flow; Network; Stability