-
arXiv:1609.04558 [pdf, ps, other]
Statistical Inference in a Directed Network Model with Covariates
Abstract: Networks are often characterized by node heterogeneity for which nodes exhibit different degrees of interaction and link homophily for which nodes sharing common features tend to associate with each other. In this paper, we propose a new directed network model to capture the former via node-specific parametrization and the latter by incorporating covariates. In particular, this model quantifies th… ▽ More
Submitted 10 March, 2018; v1 submitted 15 September, 2016; originally announced September 2016.
Comments: 29 pages. minor revision
-
$β$ models for random hypergraphs with a given degree sequence
Abstract: We introduce the beta model for random hypergraphs in order to represent the occurrence of multi-way interactions among agents in a social network. This model builds upon and generalizes the well-studied beta model for random graphs, which instead only considers pairwise interactions. We provide two algorithms for fitting the model parameters, IPS (iterative proportional scaling) and fixed point a… ▽ More
Submitted 3 July, 2014; originally announced July 2014.
Comments: 9 pages, 2 figures, Proceedings of 21st International Conference on Computational Statistics (2014), to appear
-
arXiv:1311.7513 [pdf, ps, other]
From Statistical Evidence to Evidence of Causality
Abstract: While statisticians and quantitative social scientists typically study the "effects of causes" (EoC), Lawyers and the Courts are more concerned with understanding the "causes of effects" (CoE). EoC can be addressed using experimental design and statistical analysis, but it is less clear how to incorporate statistical or epidemiological evidence into CoE reasoning, as might be required for a case a… ▽ More
Submitted 25 October, 2014; v1 submitted 29 November, 2013; originally announced November 2013.
Comments: 27 pages, 1 table, 9 figures. This is a fairly substantial revision of version 1
MSC Class: 62
Journal ref: Bayesian Analysis, Volume 11, Number 3 (2016), 725-752
-
arXiv:1105.6145 [pdf, ps, other]
Maximum lilkelihood estimation in the $β$-model
Abstract: We study maximum likelihood estimation for the statistical model for undirected random graphs, known as the $β$-model, in which the degree sequences are minimal sufficient statistics. We derive necessary and sufficient conditions, based on the polytope of degree sequences, for the existence of the maximum likelihood estimator (MLE) of the model parameters. We characterize in a combinatorial fashio… ▽ More
Submitted 18 June, 2013; v1 submitted 30 May, 2011; originally announced May 2011.
Comments: Published in at http://dx.doi.org/10.1214/12-AOS1078 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)
Report number: IMS-AOS-AOS1078
Journal ref: Annals of Statistics 2013, Vol. 41, No. 3, 1085-1110
-
arXiv:1104.3618 [pdf, ps, other]
Maximum likelihood estimation in log-linear models
Abstract: We study maximum likelihood estimation in log-linear models under conditional Poisson sampling schemes. We derive necessary and sufficient conditions for existence of the maximum likelihood estimator (MLE) of the model parameters and investigate estimability of the natural and mean-value parameters under a nonexistent MLE. Our conditions focus on the role of sampling zeros in the observed table. W… ▽ More
Submitted 23 July, 2012; v1 submitted 18 April, 2011; originally announced April 2011.
Comments: Published in at http://dx.doi.org/10.1214/12-AOS986 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)
Report number: IMS-AOS-AOS986
Journal ref: Annals of Statistics 2012, Vol. 40, No. 2, 996-1023
-
arXiv:1010.0745 [pdf, ps, other]
On the Existence of the MLE for a Directed Random Graph Network Model with Reciprocation
Abstract: Holland and Leinhardt (1981) proposed a directed random graph model, the p1 model, to describe dyadic interactions in a social network. In previous work (Petrovic et al., 2010), we studied the algebraic properties of the p1 model and showed that it is a toric model specified by a multi-homogeneous ideal. We conducted an extensive study of the Markov bases for p1 that incorporate explicitly the con… ▽ More
Submitted 4 October, 2010; originally announced October 2010.
-
Algebraic statistics for a directed random graph model with reciprocation
Abstract: The p_1 model is a directed random graph model used to describe dyadic interactions in a social network in terms of effects due to differential attraction (popularity) and expansiveness, as well as an additional effect due to reciprocation. In this article we carry out an algebraic statistics analysis of this model. We show that the p_1 model is a toric model specified by a multi-homogeneous ide… ▽ More
Submitted 16 March, 2010; v1 submitted 31 August, 2009; originally announced September 2009.
Comments: 22 pages. 4 figures depicting relevant Markov moves. One section removed from previous version.
-
Maximum Likelihood Estimation in Latent Class Models For Contingency Table Data
Abstract: Statistical models with latent structure have a history going back to the 1950s and have seen widespread use in the social sciences and, more recently, in computational biology and in machine learning. Here we study the basic latent class model proposed originally by the sociologist Paul F. Lazarfeld for categorical variables, and we explain its geometric structure. We draw parallels between the… ▽ More
Submitted 21 September, 2007; originally announced September 2007.
-
Mixed membership stochastic blockmodels
Abstract: Observations consisting of measurements on relationships for pairs of objects arise in many settings, such as protein interaction and gene regulatory networks, collections of author-recipient email, and social networks. Analyzing such data with probabilisic models can be delicate because the simple exchangeability assumptions underlying many boilerplate models no longer hold. In this paper, we d… ▽ More
Submitted 30 May, 2007; originally announced May 2007.
Comments: 46 pages, 14 figures, 3 tables
Journal ref: Journal of Machine Learning Research, 9, 1981-2014.
-
arXiv:math/0612788 [pdf, ps, other]
Comment: Complex Causal Questions Require Careful Model Formulation: Discussion of Rubin on Experiments with "Censoring" Due to Death
Abstract: Comment on Complex Causal Questions Require Careful Model Formulation: Discussion of Rubin on Experiments with ``Censoring'' Due to Death [math.ST/0612783]
Submitted 27 December, 2006; originally announced December 2006.
Comments: Published at http://dx.doi.org/10.1214/088342306000000295 in the Statistical Science (http://www.imstat.org/sts/) by the Institute of Mathematical Statistics (http://www.imstat.org)
Report number: IMS-STS-STS160C
Journal ref: Statistical Science 2006, Vol. 21, No. 3, 317-318
-
arXiv:math/0609288 [pdf, ps, other]
Privacy and Confidentiality in an e-Commerce World: Data Mining, Data Warehousing, Matching and Disclosure Limitation
Abstract: The growing expanse of e-commerce and the widespread availability of online databases raise many fears regarding loss of privacy and many statistical challenges. Even with encryption and other nominal forms of protection for individual databases, we still need to protect against the violation of privacy through linkages across multiple databases. These issues parallel those that have arisen and… ▽ More
Submitted 11 September, 2006; originally announced September 2006.
Comments: Published at http://dx.doi.org/10.1214/088342306000000240 in the Statistical Science (http://www.imstat.org/sts/) by the Institute of Mathematical Statistics (http://www.imstat.org)
Report number: IMS-STS-STS172
Journal ref: Statistical Science 2006, Vol. 21, No. 2, 143-154
-
arXiv:math/0405044 [pdf, ps, other]
Polyhedral conditions for the nonexistence of the MLE for hierarchical log-linear models
Abstract: We provide a polyhedral description of the conditions for the existence of the maximum likelihood estimate (MLE) for a hierarchical log-linear model. The MLE exists if and only if the observed margins lie in the relative interior of the marginal cone. Using this description, we give an algorithm for determining if the MLE exists. If the tree width is bounded, the algorithm runs in polynomial tim… ▽ More
Submitted 3 May, 2004; originally announced May 2004.
Comments: 15 pages