-
Differential Privacy and Survey Sampling
Authors:
Daniel Bernard Bonnéry,
Julien Jamme
Abstract:
The Horvitz-Thompson estimate of a total can be seen as as differentially private mechanism applied to this population total. We provide forumlae to compute the $ε$ and $δ$ parameter for this specific mecanism, coupled or not coupled with the addition of a Laplace or a Gaussian noise. This allows to determine the scale of the Laplace privacy mechanism to be added to reach a specified level of priv…
▽ More
The Horvitz-Thompson estimate of a total can be seen as as differentially private mechanism applied to this population total. We provide forumlae to compute the $ε$ and $δ$ parameter for this specific mecanism, coupled or not coupled with the addition of a Laplace or a Gaussian noise. This allows to determine the scale of the Laplace privacy mechanism to be added to reach a specified level of privacy, expressed in terms of $ε,δ$ differential privacy. In particular, we provide simple formulae for the special case of simple random sampling on binary data.
△ Less
Submitted 17 June, 2025;
originally announced June 2025.
-
The effect of Informative Selection on the estimation of parameters related to Spatial Processes
Authors:
Daniel Bonnery,
Francesco Pantalone,
M. Giovanna Ranalli
Abstract:
This paper extends the concept of informative selection, population distribution and sample distribution to a spatial process context. These notions were first defined in a context where the output of the random process of interest consists of independent and identically distributed realisations for each individual of a population. It has been showed that informative selection was inducing a stoch…
▽ More
This paper extends the concept of informative selection, population distribution and sample distribution to a spatial process context. These notions were first defined in a context where the output of the random process of interest consists of independent and identically distributed realisations for each individual of a population. It has been showed that informative selection was inducing a stochastic dependence among realisations on the selected units. In the context of spatial processes, the "population" is a continuous space and realisations for two different elements of the population are not independent. We show how informative selection may induce a different dependence among selected units and how the sample distribution differs from the population distribution.
△ Less
Submitted 18 March, 2021;
originally announced March 2021.
-
On the definition of informative vs. ignorable nuisance process
Authors:
Daniel Bonnery,
Joseph Sedransk
Abstract:
This paper is an early version.
We propose to generalise the notion of "ignoring" a random process as well as the notions of informative and ignorable random processes in a very general setup and for different types of inference (Bayesian or frequentist), and for different purposes (estimation, prediction or testing). We then confront the definitions we propose to mentions or definitions of info…
▽ More
This paper is an early version.
We propose to generalise the notion of "ignoring" a random process as well as the notions of informative and ignorable random processes in a very general setup and for different types of inference (Bayesian or frequentist), and for different purposes (estimation, prediction or testing). We then confront the definitions we propose to mentions or definitions of informative and ignorable processes found in the litterature. To that purpose, we provide a very general statistical framework for survey sampling in order to define precisely the notions of design and selection, and to serve to illustrate and discuss the notions proposed.
△ Less
Submitted 6 June, 2019;
originally announced June 2019.
-
Uniform convergence of the empirical cumulative distribution function under informative selection from a finite population
Authors:
Daniel Bonnéry,
F. Jay Breidt,
François Coquet
Abstract:
Consider informative selection of a sample from a finite population. Responses are realized as independent and identically distributed (i.i.d.) random variables with a probability density function (p.d.f.) f, referred to as the superpopulation model. The selection is informative in the sense that the sample responses, given that they were selected, are not i.i.d. f. In general, the informative sel…
▽ More
Consider informative selection of a sample from a finite population. Responses are realized as independent and identically distributed (i.i.d.) random variables with a probability density function (p.d.f.) f, referred to as the superpopulation model. The selection is informative in the sense that the sample responses, given that they were selected, are not i.i.d. f. In general, the informative selection mechanism may induce dependence among the selected observations. The impact of such dependence on the empirical cumulative distribution function (c.d.f.) is studied. An asymptotic framework and weak conditions on the informative selection mechanism are developed under which the (unweighted) empirical c.d.f. converges uniformly, in $L_2$ and almost surely, to a weighted version of the superpopulation c.d.f. This yields an analogue of the Glivenko-Cantelli theorem. A series of examples, motivated by real problems in surveys and other observational studies, shows that the conditions are verifiable for specified designs.
△ Less
Submitted 23 November, 2012;
originally announced November 2012.