Skip to main content

Showing 1–21 of 21 results for author: Darling, R W R

Searching in archive math. Search in all archives.
.
  1. arXiv:2506.15796  [pdf, ps, other

    math.CO

    Prüfer codes on vertex-colored rooted trees

    Authors: R. W. R. Darling, Grant Fickes

    Abstract: Prüfer codes provide an encoding scheme for representing a vertex-labeled tree on $n$ vertices with a string of length $n-2$. Indeed, two labeled trees are isomorphic if and only if their Prüfer codes are identical, and this supplies a proof of Cayley's Theorem. Motivated by a graph decomposition of freight networks into a corpus of vertex-colored rooted trees, we extend the notion of Prüfer codes… ▽ More

    Submitted 18 June, 2025; originally announced June 2025.

    Comments: 25 pages, 13 figures

    MSC Class: 05C05 ACM Class: E.1

  2. arXiv:2302.02200  [pdf, other

    math.CO math.ST

    Rank-based linkage I: triplet comparisons and oriented simplicial complexes

    Authors: R. W. R. Darling, Will Grilliette, Adam Logan

    Abstract: Rank-based linkage is a new tool for summarizing a collection $S$ of objects according to their relationships. These objects are not mapped to vectors, and ``similarity'' between objects need be neither numerical nor symmetrical. All an object needs to do is rank nearby objects by similarity to itself, using a Comparator which is transitive, but need not be consistent with any metric on the whole… ▽ More

    Submitted 20 April, 2023; v1 submitted 4 February, 2023; originally announced February 2023.

    Comments: 37 pages, 12 figures

    MSC Class: 62H30 (Primary) 05C20; 05E45; 05C76 (Secondary) ACM Class: G.4

  3. arXiv:2204.01142   

    math.AT cs.CG cs.LG math.CO

    Proceedings of TDA: Applications of Topological Data Analysis to Data Science, Artificial Intelligence, and Machine Learning Workshop at SDM 2022

    Authors: R. W. R. Darling, John A. Emanuello, Emilie Purvine, Ahmad Ridley

    Abstract: Topological Data Analysis (TDA) is a rigorous framework that borrows techniques from geometric and algebraic topology, category theory, and combinatorics in order to study the "shape" of such complex high-dimensional data. Research in this area has grown significantly over the last several years bringing a deeply rooted theory to bear on practical applications in areas such as genomics, natural la… ▽ More

    Submitted 14 April, 2022; v1 submitted 3 April, 2022; originally announced April 2022.

  4. arXiv:2108.08864  [pdf, other

    cs.DS math.CO math.PR

    Partitioned K-nearest neighbor local depth for scalable comparison-based learning

    Authors: Jacob D. Baron, R. W. R. Darling, J. Laylon Davis, R. Pettit

    Abstract: A triplet comparison oracle on a set $S$ takes an object $x \in S$ and for any pair $\{y, z\} \subset S \setminus \{x\}$ declares which of $y$ and $z$ is more similar to $x$. Partitioned Local Depth (PaLD) supplies a principled non-parametric partitioning of $S$ under such triplet comparisons but needs $O(n^2 \log{n})$ oracle calls and $O(n^3)$ post-processing steps. We introduce Partitioned Nea… ▽ More

    Submitted 2 December, 2021; v1 submitted 19 August, 2021; originally announced August 2021.

    Comments: 27 pages, 2 figures

    MSC Class: 90C35 ACM Class: F.2.2

  5. arXiv:2102.09581  [pdf, other

    math.PR

    Hidden Ancestor Graphs: Models for Detagging Property Graphs

    Authors: R. W. R. Darling, Gregory S. Clark, J. D. Tucker

    Abstract: Consider a graph $G$ where each vertex is visibly labelled as a member of a distinct class, but also has a hidden binary state: wild or tame. Edges with end points in the same class are called agreement edges. Premise: an edge connecting vertices in different classes -- a conflict edge -- is allowed only when at least one end point is wild. Interpret wild status as readiness to form connections wi… ▽ More

    Submitted 13 December, 2023; v1 submitted 18 February, 2021; originally announced February 2021.

    Comments: 35 pages, 12 figures

    MSC Class: 05C80

  6. arXiv:1908.07645  [pdf, other

    math.CO math.ST

    K-Nearest Neighbor Approximation Via the Friend-of-a-Friend Principle

    Authors: Jacob D. Baron, R. W. R. Darling

    Abstract: Suppose $V$ is an $n$-element set where for each $x \in V$, the elements of $V \setminus \{x\}$ are ranked by their similarity to $x$. The $K$-nearest neighbor graph is a directed graph including an arc from each $x$ to the $K$ points of $V \setminus \{x\}$ most similar to $x$. Constructive approximation to this graph using far fewer than $n^2$ comparisons is important for the analysis of large hi… ▽ More

    Submitted 28 December, 2020; v1 submitted 20 August, 2019; originally announced August 2019.

    Comments: 31 pages, 5 figures

    MSC Class: 90C35; 06A07

  7. arXiv:1811.04483  [pdf, ps, other

    math.CO math.PR

    Anomaly Detection and Correction in Large Labeled Bipartite Graphs

    Authors: R. W. R. Darling, Mark L. Velednitsky

    Abstract: Binary classification problems can be naturally modeled as bipartite graphs, where we attempt to classify right nodes based on their left adjacencies. We consider the case of labeled bipartite graphs in which some labels and edges are not trustworthy. Our goal is to reduce noise by identifying and fixing these labels and edges. We first propose a geometric technique for generating random graph i… ▽ More

    Submitted 11 November, 2018; originally announced November 2018.

    Comments: 36 pages, 4 figures

    Report number: SUMMER 2016 INTERN PROJECT MSC Class: 05C78

  8. arXiv:1810.02016  [pdf, other

    math.CO math.ST

    The Four Point Permutation Test for Latent Block Structure in Incidence Matrices

    Authors: R W R Darling, Cheyne Homberger

    Abstract: Transactional data may be represented as a bipartite graph $G:=(L \cup R, E)$, where $L$ denotes agents, $R$ denotes objects visible to many agents, and an edge in $E$ denotes an interaction between an agent and an object. Unsupervised learning seeks to detect block structures in the adjacency matrix $Z$ between $L$ and $R$, thus grouping together sets of agents with similar object interactions. N… ▽ More

    Submitted 19 July, 2019; v1 submitted 3 October, 2018; originally announced October 2018.

    Comments: 41 pages, 14 figures

    MSC Class: 62H20

  9. arXiv:1809.08723  [pdf, other

    math.CO math.OC

    The Combinatorial Data Fusion Problem in Conflicted-supervised Learning

    Authors: R. W. R. Darling, David G. Harris, Dev R. Phulara, John A. Proos

    Abstract: The best merge problem in industrial data science generates instances where disparate data sources place incompatible relational structures on the same set $V$ of objects. Graph vertex labelling data may include (1) missing or erroneous labels,(2) assertions that two vertices carry the same (unspecified) label, and (3) denying some subset of vertices from carrying the same label. Conflicted-superv… ▽ More

    Submitted 23 September, 2018; originally announced September 2018.

    Comments: 48 pages, 10 figures

    MSC Class: 05C85

  10. arXiv:1805.09443  [pdf, other

    math.PR

    Euclidean Embedding of the Poisson Weighted Infinite Tree and Application to Mobility Models

    Authors: R. W. R. Darling, Robin Pemantle

    Abstract: Continuous time branching models are used to create random fractals in a Euclidean space, whose Hausdorff dimension is controlled by an input parameter. Finite realizations are applied in modelling the set of sites visited in models of human and animal mobility.

    Submitted 23 May, 2018; originally announced May 2018.

    Comments: 23 pages, 2 figures

    MSC Class: 60J80; 60J85

  11. Rank deficiency in sparse random GF[2] matrices

    Authors: R. W. R. Darling, Mathew D. Penrose, Andrew R. Wade, Sandy L. Zabell

    Abstract: Let $M$ be a random $m \times n$ matrix with binary entries and i.i.d. rows. The weight (i.e., number of ones) of a row has a specified probability distribution, with the row chosen uniformly at random given its weight. Let $N(n,m)$ denote the number of left null vectors in ${0,1}^m$ for $M$ (including the zero vector), where addition is mod 2. We take $n, m \to \infty$, with $m/n \to α> 0$, while… ▽ More

    Submitted 23 November, 2012; originally announced November 2012.

    Comments: 49 pages, 4 figures

    MSC Class: 60C05 (Primary) 05C65; 05C80; 15B52; 60B20; 60F10 (Secondary)

    Journal ref: Electronic Journal of Probability, Vol. 19 (2014), article 83

  12. arXiv:0911.2660  [pdf, ps, other

    math.NT math.PR

    Maximum GCD Among Pairs of Random Integers

    Authors: R. W. R. Darling, E. E. Pyle

    Abstract: Fix $α>0$, and sample $N$ integers uniformly at random from $\{1,2,\ldots ,\lfloor e^{αN}\rfloor \}$. Given $η>0$, the probability that the maximum of the pairwise GCDs lies between $N^{2-η}$ and $N^{2+η}$ converges to 1 as $N\to \infty $. More precise estimates are obtained. This is a Birthday Problem: two of the random integers are likely to share some prime factor of order $N^2/\log [N]$. The… ▽ More

    Submitted 13 November, 2009; originally announced November 2009.

    Comments: 11 pages

    MSC Class: 11K99

  13. Differential equation approximations for Markov chains

    Authors: R. W. R. Darling, J. R. Norris

    Abstract: We formulate some simple conditions under which a Markov chain may be approximated by the solution to a differential equation, with quantifiable error probabilities. The role of a choice of coordinate functions for the Markov chain is emphasised. The general theory is illustrated in three examples: the classical stochastic epidemic, a population process model with fast and slow variables, and co… ▽ More

    Submitted 23 April, 2008; v1 submitted 17 October, 2007; originally announced October 2007.

    Comments: Published in at http://dx.doi.org/10.1214/07-PS121 the Probability Surveys (http://www.i-journals.org/ps/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-PS-PS_2007_121 MSC Class: 05C65 (Primary) 60J75; 05C80 (Secondary)

    Journal ref: Probability Surveys 2008, Vol. 5, 37-79

  14. Structure of large random hypergraphs

    Authors: R. W. R. Darling, J. R. Norris

    Abstract: The theme of this paper is the derivation of analytic formulae for certain large combinatorial structures. The formulae are obtained via fluid limits of pure jump-type Markov processes, established under simple conditions on the Laplace transforms of their Levy kernels. Furthermore, a related Gaussian approximation allows us to describe the randomness which may persist in the limit when certain… ▽ More

    Submitted 22 March, 2005; originally announced March 2005.

    Comments: Published at http://dx.doi.org/10.1214/105051604000000567 in the Annals of Applied Probability (http://www.imstat.org/aap/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AAP-AAP043 MSC Class: 05C65 (Primary) 60J75; 05C80. (Secondary)

    Journal ref: Annals of Applied Probability 2005, Vol. 15, No. 1A, 125-152

  15. arXiv:math/0312451  [pdf, ps, other

    math.PR math.CO

    Continuous and discontinuous phase transitions in hypergraph processes

    Authors: R. W. R. Darling, D. A. Levin, J. R. Norris

    Abstract: Let V denote a set of N vertices. To construct a "hypergraph process", create a new hyperedge at each event time of a Poisson process; the cardinality K of this hyperedge is random, with arbitrary probability generating function r(x), except that we assume P(K=1) +P(K=2) > 0. Given K=k, the k vertices appearing in the new hyperedge are selected uniformly at random from V. Hyperedges of cardinali… ▽ More

    Submitted 2 March, 2004; v1 submitted 24 December, 2003; originally announced December 2003.

    Comments: 25 pages, 2 figures. Revised version. To appear in Random Structures & Algorithms

    MSC Class: 05C80;60F17;05C85

  16. arXiv:math/0210109  [pdf

    math.PR math.CO

    Fluid Limits of Pure Jump Markov Processes: a Practical Guide

    Authors: R. W. R. Darling

    Abstract: A rescaled Markov chain converges uniformly in probability to the solution of an ordinary differential equation, under carefully specified assumptions. The presentation is much simpler than those in the outside literature. The result may be used to build parsimonious models of large random or pseudo-random systems.

    Submitted 23 December, 2002; v1 submitted 8 October, 2002; originally announced October 2002.

    Comments: 16 pages, 1 figure

    MSC Class: 60F17; 05C80; 60J75

  17. arXiv:math/0109020  [pdf, ps, other

    math.PR math.CO

    Structure of large random hypergraphs

    Authors: R. W. R. Darling, J. R. Norris

    Abstract: The theme of this paper is the derivation of analytic formulae for certain large combinatorial structures. The formulae are obtained via fluid limits of pure jump type Markov processes, established under simple conditions on the Laplace transforms of their Levy kernels. Furthermore, a related Gaussian approximation allows us to describe the randomness which may persist in the limit when certain… ▽ More

    Submitted 16 January, 2004; v1 submitted 4 September, 2001; originally announced September 2001.

    Comments: Revised version with minor conceptual improvements and additional discussion. 32 pages, 5 figures

    MSC Class: 05C65; 60J75; 05C80

  18. arXiv:math/9809029  [pdf

    math.PR math.DG

    Geometrically Intrinsic Nonlinear Recursive Filters II: Foundations

    Authors: R. W. R. Darling

    Abstract: This paper contains the technical foundations from stochastic differential geometry for the construction of geometrically intrinsic nonlinear recursive filters. A diffusion X on a manifold N is run for a time interval T, with a random initial condition. There is a single observation consisting of a nonlinear function of X(T), corrupted by noise, and with values in another manifold M. The noise c… ▽ More

    Submitted 6 September, 1998; originally announced September 1998.

    Comments: 25 pages

    Report number: UC Berkeley Dept of Stats, Tech. Report 512 MSC Class: 60G35; 58G32; 53B20

  19. arXiv:math/9809028  [pdf

    math.OC math.PR

    Geometrically Intrinsic Nonlinear Recursive Filters I: Algorithms

    Authors: R. W. R. Darling

    Abstract: The Geometrically Intrinsic Nonlinear Recursive Filter, or GI Filter, is designed to estimate an arbitrary continuous-time Markov diffusion process X subject to nonlinear discrete-time observations. The GI Filter is fundamentally different from the much-used Extended Kalman Filter (EKF), and its second-order variants, even in the simplest nonlinear case, in that: (i) It uses a quadratic function… ▽ More

    Submitted 6 September, 1998; originally announced September 1998.

    Comments: 22 pages, 4 figures

    Report number: UC Berkeley, Dept. of Stats, Tech. Report 494 MSC Class: 93E11; 60G35; 58G32

  20. arXiv:math/9809027  [pdf

    math.PR math.OC

    Intrinsic Location Parameter of a Diffusion Process

    Authors: R. W. R. Darling

    Abstract: For nonlinear functions f of a random vector Y, E[f(Y)] and f(E[Y]) usually differ. Consequently the mathematical expectation of Y is not intrinsic: when we change coordinate systems, it is not invariant.This article is about a fundamental and hitherto neglected property of random vectors of the form Y = f(X(t)), where X(t) is the value at time t of a diffusion process X: namely that there exist… ▽ More

    Submitted 6 September, 1998; originally announced September 1998.

    Comments: 25 pages, 1 figure

    Report number: UC Berkeley, Dept of Stats, Tech Report 493 MSC Class: 60H30; 58G32

  21. arXiv:math/9808049  [pdf

    math.PR

    The Repeated Solicitation Model

    Authors: R. W. R. Darling

    Abstract: This paper presents a probabilistic analysis of what we call the "repeated solicitation model". To give a specific context, suppose B is a direct marketing company with a list of S sales prospects. At epoch 1, B sends a solicitation to every prospect on the list, and elicits X(1) replies. The company deletes the respondents from the list, and at epoch 2 sends a solicitation to the other prospect… ▽ More

    Submitted 15 September, 1998; v1 submitted 11 August, 1998; originally announced August 1998.

    Comments: 18 pages 2 figures

    Report number: NSA Unclassified RWRD01 MSC Class: 60J20; 90A60