-
Collaborative Algorithms for Online Personalized Mean Estimation
Authors:
Mahsa Asadi,
Aurélien Bellet,
Odalric-Ambrym Maillard,
Marc Tommasi
Abstract:
We consider an online estimation problem involving a set of agents. Each agent has access to a (personal) process that generates samples from a real-valued distribution and seeks to estimate its mean. We study the case where some of the distributions have the same mean, and the agents are allowed to actively query information from other agents. The goal is to design an algorithm that enables each…
▽ More
We consider an online estimation problem involving a set of agents. Each agent has access to a (personal) process that generates samples from a real-valued distribution and seeks to estimate its mean. We study the case where some of the distributions have the same mean, and the agents are allowed to actively query information from other agents. The goal is to design an algorithm that enables each agent to improve its mean estimate thanks to communication with other agents. The means as well as the number of distributions with same mean are unknown, which makes the task nontrivial. We introduce a novel collaborative strategy to solve this online personalized mean estimation problem. We analyze its time complexity and introduce variants that enjoy good performance in numerical experiments. We also extend our approach to the setting where clusters of agents with similar means seek to estimate the mean of their cluster.
△ Less
Submitted 19 December, 2022; v1 submitted 24 August, 2022;
originally announced August 2022.
-
Optimal Redundancy Allocation in Coherent Systems with Heterogeneous Dependent Components
Authors:
Maryam Kelkinnama,
Majid Asadi
Abstract:
This paper is concerned with the optimal number of redundant allocation to $n$-component coherent systems consist of heterogeneous dependent components. We assume that the system is built of $L$ groups of different components, $L\geq 1$, where there are $n_i$ components in group $i$, and $\sum_{i=1}^{L}n_i=n$. The problem of interest is to allocate $v_i$ active redundant components to each compone…
▽ More
This paper is concerned with the optimal number of redundant allocation to $n$-component coherent systems consist of heterogeneous dependent components. We assume that the system is built of $L$ groups of different components, $L\geq 1$, where there are $n_i$ components in group $i$, and $\sum_{i=1}^{L}n_i=n$. The problem of interest is to allocate $v_i$ active redundant components to each component of type $i$, $i=1,\dots, L$. To get the optimal values of $v_i$, we propose two cost-based criteria. One of them is introduced based on the costs of renewing the failed components and the costs of refreshing the alive ones at the system failure time. The other criterion is proposed based on the costs of replacing the system at its failure time or at a predetermined time $τ$, whichever occurs first. The expressions for the proposed functions are derived using the mixture representation of the system reliability function based on the notion of survival signature. We assume that a given copula function models the dependency structure between the components. In the particular case that the system is a series-parallel structure, we provide the formulas for the proposed cost-based functions. The results are discussed numerically for some specific coherent systems.
△ Less
Submitted 12 February, 2022;
originally announced February 2022.
-
Statistical Inference from Partially Nominated Sets: An Application to Estimating the Prevalence of Osteoporosis
Authors:
Zeinab Akbari Ghamsari,
Ehsan Zamanzade,
Majid Asadi
Abstract:
This paper focuses on drawing statistical inference based on a novel variant of maxima or minima nomination sampling (NS) designs. These sampling designs are useful for obtaining more representative sample units from the tails of the population distribution using the available auxiliary ranking information. However, one common difficulty in performing NS in practice is that the researcher cannot o…
▽ More
This paper focuses on drawing statistical inference based on a novel variant of maxima or minima nomination sampling (NS) designs. These sampling designs are useful for obtaining more representative sample units from the tails of the population distribution using the available auxiliary ranking information. However, one common difficulty in performing NS in practice is that the researcher cannot obtain a nominated sample unless he/she uniquely determines the sample unit with the highest or the lowest rank in each set. To overcome this problem, a variant of NS which is called partial nomination sampling is proposed in which the researcher is allowed to declare that two or more units are tied in the ranks whenever he/she cannot find with high confidence the sample unit with the highest or the lowest rank with high confidence. Based on this sampling design, two asymptotically unbiased estimators are developed for the cumulative distribution function, which are obtained using maximum likelihood and moment-based approaches, and their asymptotic normality is proved. Several numerical studies have shown that the developed estimators have higher relative efficiencies than their counterpart in simple random sampling in analyzing either the upper or the lower tail of the parent distribution. The procedures that we are developed are then implemented on a real dataset from the Third National Health and Nutrition Examination Survey (NHANES III) to estimate the prevalence of osteoporosis among adult women aged 50 and over. It is shown that in some certain circumstances, the techniques that we have developed require only one-third of the sample size needed in SRS to achieve the desired precision. This results in a considerable reduction of time and cost compared to the standard SRS method.
△ Less
Submitted 30 May, 2021;
originally announced May 2021.
-
A ranked-based estimator of the mean past lifetime with its application
Authors:
Elham Zamanzade,
Majid Asadi,
Afshin Parvardeh,
Ehsan Zamanzade
Abstract:
The mean past lifetime (MPL) is an important tool in reliability and survival analysis for measuring the average time elapsed since the occurrence of an event, under the condition that the event has occurred before a specific time $t>0$. This article develops a nonparametric estimator for MPL based on observations collected according to ranked set sampling (RSS) design. It is shown that the estima…
▽ More
The mean past lifetime (MPL) is an important tool in reliability and survival analysis for measuring the average time elapsed since the occurrence of an event, under the condition that the event has occurred before a specific time $t>0$. This article develops a nonparametric estimator for MPL based on observations collected according to ranked set sampling (RSS) design. It is shown that the estimator that we have developed is a strongly uniform consistent. It is also proved that the introduced estimator tends to a Gaussian process under some mild conditions. A Monte Carlo simulation study is employed to evaluate the performance of the proposed estimator with its competitor in simple random sampling (SRS). Our findings show the introduced estimator is more efficient than its counterpart estimator in SRS as long as the quality of ranking is better than random. Finally, an illustrative example is provided to describe the potential application of the developed estimator in assessing the average time between the infection and diagnosis in HIV patients.
△ Less
Submitted 20 May, 2021;
originally announced May 2021.
-
Probability Link Models with Symmetric Information Divergence
Authors:
Majid Asadi,
Karthik Devarajan,
Nader Ebrahimi,
Ehsan Soofi,
Lauren Spirko-Burns
Abstract:
This paper introduces link functions for transforming one probability distribution to another such that the Kullback-Leibler and Rényi divergences between the two distributions are symmetric. Two general classes of link models are proposed. The first model links two survival functions and is applicable to models such as the proportional odds and change point, which are used in survival analysis an…
▽ More
This paper introduces link functions for transforming one probability distribution to another such that the Kullback-Leibler and Rényi divergences between the two distributions are symmetric. Two general classes of link models are proposed. The first model links two survival functions and is applicable to models such as the proportional odds and change point, which are used in survival analysis and reliability modeling. A prototype application involving the proportional odds model demonstrates advantages of symmetric divergence measures over asymmetric measures for assessing the efficacy of features and for model averaging purposes. The advantages include providing unique ranks for models and unique information weights for model averaging with one-half as much computation requirement of asymmetric divergences. The second model links two cumulative probability distribution functions. This model produces a generalized location model which are continuous counterparts of the binary probability models such as probit and logit models. Examples include the generalized probit and logit models which have appeared in the survival analysis literature, and a generalized Laplace model and a generalized Student-$t$ model, which are survival time models corresponding to the respective binary probability models. Lastly, extensions to symmetric divergence between survival functions and conditions for copula dependence information are presented.
△ Less
Submitted 10 August, 2020;
originally announced August 2020.
-
Model-Based Reinforcement Learning Exploiting State-Action Equivalence
Authors:
Mahsa Asadi,
Mohammad Sadegh Talebi,
Hippolyte Bourel,
Odalric-Ambrym Maillard
Abstract:
Leveraging an equivalence property in the state-space of a Markov Decision Process (MDP) has been investigated in several studies. This paper studies equivalence structure in the reinforcement learning (RL) setup, where transition distributions are no longer assumed to be known. We present a notion of similarity between transition probabilities of various state-action pairs of an MDP, which natura…
▽ More
Leveraging an equivalence property in the state-space of a Markov Decision Process (MDP) has been investigated in several studies. This paper studies equivalence structure in the reinforcement learning (RL) setup, where transition distributions are no longer assumed to be known. We present a notion of similarity between transition probabilities of various state-action pairs of an MDP, which naturally defines an equivalence structure in the state-action space. We present equivalence-aware confidence sets for the case where the learner knows the underlying structure in advance. These sets are provably smaller than their corresponding equivalence-oblivious counterparts. In the more challenging case of an unknown equivalence structure, we present an algorithm called ApproxEquivalence that seeks to find an (approximate) equivalence structure, and define confidence sets using the approximate equivalence. To illustrate the efficacy of the presented confidence sets, we present C-UCRL, as a natural modification of UCRL2 for RL in undiscounted MDPs. In the case of a known equivalence structure, we show that C-UCRL improves over UCRL2 in terms of regret by a factor of $\sqrt{SA/C}$, in any communicating MDP with $S$ states, $A$ actions, and $C$ classes, which corresponds to a massive improvement when $C \ll SA$. To the best of our knowledge, this is the first work providing regret bounds for RL when an equivalence structure in the MDP is efficiently exploited. In the case of an unknown equivalence structure, we show through numerical experiments that C-UCRL combined with ApproxEquivalence outperforms UCRL2 in ergodic MDPs.
△ Less
Submitted 9 October, 2019;
originally announced October 2019.
-
A Unified Approach to Construct Correlation Coefficient Between Random Variables
Authors:
Majid Asadi,
Somayeh Zarezadeh
Abstract:
Measuring the correlation (association) between two random variables is one of the important goals in statistical applications. In the literature, the covariance between two random variables is a widely used criterion in measuring the linear association between two random variables. In this paper, first we propose a covariance based unified measure of variability for a continuous random variable X…
▽ More
Measuring the correlation (association) between two random variables is one of the important goals in statistical applications. In the literature, the covariance between two random variables is a widely used criterion in measuring the linear association between two random variables. In this paper, first we propose a covariance based unified measure of variability for a continuous random variable X and we show that several measures of variability and uncertainty, such as variance, Gini mean difference, cumulative residual entropy, etc., can be considered as special cases. Then, we propose a unified measure of correlation between two continuous random variables X and Y, with distribution functions (DFs) F and G, based on the covariance between X and H^{-1}G(Y) (known as the Q-transformation of H on G) where H is a continuous DF. We show that our proposed measure of association subsumes some of the existing measures of correlation. Under some mild condition on H, it is shown the suggested index ranges between [-1,1] where the extremes of the range, i.e., -1 and 1, are attainable by the Frechet bivariate minimal and maximal DFs, respectively. A special case of the proposed correlation measure leads to a variant of Pearson correlation coefficient which, as a measure of strength and direction of the linear relationship between X and Y, has absolute values greater than or equal to the Pearson correlation. The results are examined numerically for some well known bivariate DFs.
△ Less
Submitted 28 October, 2018; v1 submitted 28 September, 2018;
originally announced September 2018.
-
A Shock Model Based Approach to Network Reliability
Authors:
S. Zarezadeh,
S. Ashrafi,
M. Asadi
Abstract:
We consider a network consisting of $n$ components (links or nodes) and assume that the network has two states, up and down. We further suppose that the network is subject to shocks that appear according to a counting process and that each shock may lead to the component failures. Under some assumptions on the shock occurrences, we present a new variant of the notion of signature which we call it…
▽ More
We consider a network consisting of $n$ components (links or nodes) and assume that the network has two states, up and down. We further suppose that the network is subject to shocks that appear according to a counting process and that each shock may lead to the component failures. Under some assumptions on the shock occurrences, we present a new variant of the notion of signature which we call it t-signature. Then t-signature based mixture representations for the reliability function of the network are obtained. Several stochastic properties of the network lifetime are investigated. In particular, under the assumption that the number of failures at each shock follows a binomial distribution and the process of shocks is non-homogeneous Poisson process, explicit form of the network reliability is derived and its aging properties are explored. Several examples are also provided
△ Less
Submitted 15 July, 2015;
originally announced July 2015.