Skip to main content

Showing 1–17 of 17 results for author: Petrović, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2505.04062  [pdf, other

    stat.CO math.NA

    Multilevel Sampling in Algebraic Statistics

    Authors: Nathan Kirk, Ivan Gvozdanović, Sonja Petrović

    Abstract: This paper proposes a multilevel sampling algorithm for fiber sampling problems in algebraic statistics, inspired by Henry Wynn's suggestion to adapt multilevel Monte Carlo (MLMC) ideas to discrete models. Focusing on log-linear models, we sample from high-dimensional lattice fibers defined by algebraic constraints. Building on Markov basis methods and results from Diaconis and Sturmfels, our algo… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

    Comments: 21 pages, 7 figures

    MSC Class: 62R01 (Primary) 62-08; 52B20 (Secondary)

  2. arXiv:2405.13950  [pdf, other

    stat.ML cs.LG

    Learning to sample fibers for goodness-of-fit testing

    Authors: Ivan Gvozdanović, Sonja Petrović

    Abstract: We consider the problem of constructing exact goodness-of-fit tests for discrete exponential family models. This classical problem remains practically unsolved for many types of structured or sparse data, as it rests on a computationally difficult core task: to produce a reliable sample from lattice points in a high-dimensional polytope. We translate the problem into a Markov decision process and… ▽ More

    Submitted 15 April, 2025; v1 submitted 22 May, 2024; originally announced May 2024.

    MSC Class: 62R01

  3. arXiv:2307.02428  [pdf, other

    stat.CO math.AC math.ST

    Sampling lattice points in a polytope: a Bayesian biased algorithm with random updates

    Authors: Miles Bakenhus, Sonja Petrović

    Abstract: The set of nonnegative integer lattice points in a polytope, also known as the fiber of a linear map, makes an appearance in several applications including optimization and statistics. We address the problem of sampling from this set using three ingredients: an easy-to-compute lattice basis of the constraint matrix, a biased sampling algorithm with a Bayesian framework, and a step-wise selection m… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: 22 pages, 12 figures

    MSC Class: 62R01 (Primary) 62-08; 52B20 (Secondary)

    Journal ref: Alg. Stat. 15 (2024) 61-83

  4. arXiv:2306.06270  [pdf, other

    stat.ME math.AC math.CO

    Markov bases: a 25 year update

    Authors: Félix Almendra-Hernández, Jesús A. De Loera, Sonja Petrović

    Abstract: In this paper, we evaluate the challenges and best practices associated with the Markov bases approach to sampling from conditional distributions. We provide insights and clarifications after 25 years of the publication of the fundamental theorem for Markov bases by Diaconis and Sturmfels. In addition to a literature review we prove three new results on the complexity of Markov bases in hierarchic… ▽ More

    Submitted 9 January, 2024; v1 submitted 9 June, 2023; originally announced June 2023.

    Comments: 24 pages, 3 figures

    MSC Class: 62R01; 62-08; 62P10; 62H17

  5. arXiv:2108.05555  [pdf, other

    stat.ME math.PR math.ST

    Longitudinal Network Models and Permutation-Uniform Markov Chains

    Authors: William K. Schwartz, Sonja Petrović, Hemanshu Kaul

    Abstract: Consider longitudinal networks whose edges turn on and off according to a discrete-time Markov chain with exponential-family transition probabilities. We characterize when their joint distributions are also exponential families with the same parameter, improving data reduction. Further we show that the permutation-uniform subclass of these chains permit interpretation as an independent, identicall… ▽ More

    Submitted 10 March, 2024; v1 submitted 12 August, 2021; originally announced August 2021.

    Comments: 22 pages plus references and appendices. This is the accepted version of the final published article

    MSC Class: 60J10 (Primary); 05C80; 60B20; 62M05; 62M02; 62B05; 62F10; 60F99; 60G50; 62R01 (Secondary)

    Journal ref: Scandinavian Journal of Statistics 50.3 (September 2023) 1201-1231

  6. arXiv:2106.03676  [pdf, other

    math.AC cs.LG cs.SC math.AG stat.ML

    Learning a performance metric of Buchberger's algorithm

    Authors: Jelena Mojsilović, Dylan Peifer, Sonja Petrović

    Abstract: What can be (machine) learned about the complexity of Buchberger's algorithm? Given a system of polynomials, Buchberger's algorithm computes a Gröbner basis of the ideal these polynomials generate using an iterative procedure based on multivariate long division. The runtime of each step of the algorithm is typically dominated by a series of polynomial additions, and the total number of these add… ▽ More

    Submitted 31 May, 2022; v1 submitted 7 June, 2021; originally announced June 2021.

    Journal ref: Involve 16 (2023) 227-248

  7. arXiv:2104.03167  [pdf, other

    stat.ME math.CO stat.AP

    Goodness of fit for log-linear ERGMs

    Authors: Elizabeth Gross, Sonja Petrović, Despina Stasi

    Abstract: Many popular models from the networks literature can be viewed through a common lens of contingency tables on network dyads, resulting in \emph{log-linear ERGMs}: exponential family models for random graphs whose sufficient statistics are linear on the dyads. We propose a new model in this family, the \emph{$p_1$-SBM}, which combines node and group effects common in network formation mechanisms. I… ▽ More

    Submitted 3 March, 2024; v1 submitted 7 April, 2021; originally announced April 2021.

    Comments: Link to supplementary code provided

    MSC Class: 62R01; 62P10; 62-08; 62H17

  8. arXiv:1910.01692  [pdf, other

    stat.ME math.ST

    Algebraic statistics, tables, and networks: The Fienberg advantage

    Authors: Elizabeth Gross, Vishesh Karwa, Sonja Petrović

    Abstract: Stephen Fienberg's affinity for contingency table problems and reinterpreting models with a fresh look gave rise to a new approach for hypothesis testing of network models that are linear exponential families. We outline his vision and influence in this fundamental problem, as well as generalizations to multigraphs and hypergraphs.

    Submitted 3 October, 2019; originally announced October 2019.

  9. arXiv:1907.07320  [pdf, other

    math.ST math.AC stat.ME

    What is... a Markov basis?

    Authors: Sonja Petrović

    Abstract: This short piece defines a Markov basis. The aim is to introduce the statistical concept to mathematicians.

    Submitted 15 July, 2019; originally announced July 2019.

    Comments: AMS Notices piece

  10. arXiv:1612.06040  [pdf, other

    stat.ME math.ST

    Monte Carlo goodness-of-fit tests for degree corrected and related stochastic blockmodels

    Authors: Vishesh Karwa, Debdeep Pati, Sonja Petrović, Liam Solus, Nikita Alexeev, Mateja Raič, Dane Wilburne, Robert Williams, Bowei Yan

    Abstract: We construct Bayesian and frequentist finite-sample goodness-of-fit tests for three different variants of the stochastic blockmodel for network data. Since all of the stochastic blockmodel variants are log-linear in form when block assignments are known, the tests for the \emph{latent} block model versions combine a block membership estimator with the algebraic statistics machinery for testing goo… ▽ More

    Submitted 6 March, 2024; v1 submitted 18 December, 2016; originally announced December 2016.

    Comments: substantial revision from v3, updated simulations and theoretical discussions

    MSC Class: 62R01; 05C82

    Journal ref: Journal of the Royal Statistical Society Series B: Statistical Methodology, Volume 86, Issue 1, February 2024, Pages 90-121

  11. arXiv:1612.03054  [pdf, other

    stat.ME cs.DM cs.SI stat.CO

    DERGMs: Degeneracy-restricted exponential random graph models

    Authors: Vishesh Karwa, Sonja Petrović, Denis Bajić

    Abstract: Exponential random graph models, or ERGMs, are a flexible and general class of models for modeling dependent data. While the early literature has shown them to be powerful in capturing many network features of interest, recent work highlights difficulties related to the models' ill behavior, such as most of the probability mass being concentrated on a very small subset of the parameter space. This… ▽ More

    Submitted 7 January, 2022; v1 submitted 9 December, 2016; originally announced December 2016.

    Comments: Version 3

  12. arXiv:1608.06667  [pdf, other

    stat.AP cs.DL cs.SI physics.soc-ph

    Coauthorship and citation networks for statisticians: Comment

    Authors: Vishesh Karwa, Sonja Petrović

    Abstract: This is a comment on the paper arXiv:1410.2840 by Ji and Jin, to appear in the AOAS.

    Submitted 23 August, 2016; originally announced August 2016.

  13. arXiv:1510.02838  [pdf, other

    cs.DM math.CO stat.ME

    A survey of discrete methods in (algebraic) statistics for networks

    Authors: Sonja Petrović

    Abstract: Sampling algorithms, hypergraph degree sequences, and polytopes play a crucial role in statistical analysis of network data. This article offers a brief overview of open problems in this area of discrete mathematics from the point of view of a particular family of statistical models for networks called exponential random graph models. The problems and underlying constructions are also related to w… ▽ More

    Submitted 8 January, 2016; v1 submitted 9 October, 2015; originally announced October 2015.

    Comments: Revised for clarity, minor updates, added example, upon suggestions of people mentioned in the acknowledgements section

  14. arXiv:1410.7357  [pdf, other

    math.ST cs.SI physics.soc-ph stat.CO

    Statistical models for cores decomposition of an undirected random graph

    Authors: Vishesh Karwa, Michael J. Pelsmajer, Sonja Petrović, Despina Stasi, Dane Wilburne

    Abstract: The $k$-core decomposition is a widely studied summary statistic that describes a graph's global connectivity structure. In this paper, we move beyond using $k$-core decomposition as a tool to summarize a graph and propose using $k$-core decomposition as a tool to model random graphs. We propose using the shell distribution vector, a way of summarizing the decomposition, as a sufficient statistic… ▽ More

    Submitted 28 November, 2016; v1 submitted 27 October, 2014; originally announced October 2014.

    Comments: Subsection 3.1 is new: `Sample space restriction and degeneracy of real-world networks'. Several clarifying comments have been added. Discussion now mentions 2 additional specific open problems. Bibliography updated. 25 pages (including appendix), ~10 figures

  15. arXiv:1401.4896  [pdf, other

    stat.ME math.CO stat.CO

    Goodness-of-fit for log-linear network models: Dynamic Markov bases using hypergraphs

    Authors: Elizabeth Gross, Sonja Petrović, Despina Stasi

    Abstract: Social networks and other large sparse data sets pose significant challenges for statistical inference, as many standard statistical methods for testing model fit are not applicable in such settings. Algebraic statistics offers a theoretically justified approach to goodness-of-fit testing that relies on the theory of Markov bases and is intimately connected with the geometry of the model as descri… ▽ More

    Submitted 20 January, 2014; originally announced January 2014.

  16. arXiv:1208.6550  [pdf, ps, other

    math.AC stat.CO

    Graphical models in Macaulay2

    Authors: Luis David García-Puente, Sonja Petrović, Seth Sullivant

    Abstract: The Macaulay2 package GraphicalModels contains algorithms for the algebraic study of graphical models associated to undirected, directed and mixed graphs, and associated collections of conditional independence statements. Among the algorithms implemented are procedures for computing the vanishing ideal of graphical models, for generating conditional independence ideals of families of independence… ▽ More

    Submitted 8 January, 2013; v1 submitted 31 August, 2012; originally announced August 2012.

    Comments: Several changes to address referee comments and suggestions. We will eventually include this package in the standard distribution of Macaulay2. But until then, the associated Macaulay2 file can be found at http://www.shsu.edu/~ldg005/papers.html

    MSC Class: 13P25 (Primary) 62-04; 14Q15; 68W30 (Secondary)

  17. arXiv:1105.6145  [pdf, ps, other

    stat.OT cs.DM math.ST

    Maximum lilkelihood estimation in the $β$-model

    Authors: Alessandro Rinaldo, Sonja Petrović, Stephen E. Fienberg

    Abstract: We study maximum likelihood estimation for the statistical model for undirected random graphs, known as the $β$-model, in which the degree sequences are minimal sufficient statistics. We derive necessary and sufficient conditions, based on the polytope of degree sequences, for the existence of the maximum likelihood estimator (MLE) of the model parameters. We characterize in a combinatorial fashio… ▽ More

    Submitted 18 June, 2013; v1 submitted 30 May, 2011; originally announced May 2011.

    Comments: Published in at http://dx.doi.org/10.1214/12-AOS1078 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS1078

    Journal ref: Annals of Statistics 2013, Vol. 41, No. 3, 1085-1110