-
Performance Bounds for Graphical Record Linkage
Authors:
Rebecca C. Steorts,
Matt Barnes,
Willie Neiswanger
Abstract:
Record linkage involves merging records in large, noisy databases to remove duplicate entities. It has become an important area because of its widespread occurrence in bibliometrics, public health, official statistics production, political science, and beyond. Traditional linkage methods directly linking records to one another are computationally infeasible as the number of records grows. As a res…
▽ More
Record linkage involves merging records in large, noisy databases to remove duplicate entities. It has become an important area because of its widespread occurrence in bibliometrics, public health, official statistics production, political science, and beyond. Traditional linkage methods directly linking records to one another are computationally infeasible as the number of records grows. As a result, it is increasingly common for researchers to treat record linkage as a clustering task, in which each latent entity is associated with one or more noisy database records. We critically assess performance bounds using the Kullback-Leibler (KL) divergence under a Bayesian record linkage framework, making connections to Kolchin partition models. We provide an upper bound using the KL divergence and a lower bound on the minimum probability of misclassifying a latent entity. We give insights for when our bounds hold using simulated data and provide practical user guidance.
△ Less
Submitted 7 March, 2017;
originally announced March 2017.
-
A Note On Immersion Intertwines Of Infinite Graphs
Authors:
Matthew Barnes,
Bogdan Oporowski
Abstract:
We present a construction of two infinite graphs $G_1$ and $G_2$, and of an infinite set $\mathscr{F}$ of graphs such that $\mathscr{F}$ is an antichain with respect to the immersion relation and, for each graph $G$ in $\mathscr{F}$, both $G_1$ and $G_2$ are subgraphs of $G$, but no graph properly immersed in $G$ admits an immersion of $G_1$ and of $G_2$. This shows that the class of infinite grap…
▽ More
We present a construction of two infinite graphs $G_1$ and $G_2$, and of an infinite set $\mathscr{F}$ of graphs such that $\mathscr{F}$ is an antichain with respect to the immersion relation and, for each graph $G$ in $\mathscr{F}$, both $G_1$ and $G_2$ are subgraphs of $G$, but no graph properly immersed in $G$ admits an immersion of $G_1$ and of $G_2$. This shows that the class of infinite graphs ordered by the immersion relation does not have the finite intertwine property.
△ Less
Submitted 21 August, 2015;
originally announced August 2015.
-
McKay Centralizer Algebras
Authors:
Jeffrey M. Barnes,
Georgia Benkart,
Tom Halverson
Abstract:
For a finite subgroup $G$ of the special unitary group $SU_2$, we study the centralizer algebra $Z_k(G) = End_G(V^{\otimes k})$ of $G$ acting on the $k$-fold tensor product of its defining representation $V= \mathbb{C}^2$. These subgroups are in bijection with the simply-laced affine Dynkin diagrams. The McKay correspondence relates the representation theory of these groups to the associated Dynki…
▽ More
For a finite subgroup $G$ of the special unitary group $SU_2$, we study the centralizer algebra $Z_k(G) = End_G(V^{\otimes k})$ of $G$ acting on the $k$-fold tensor product of its defining representation $V= \mathbb{C}^2$. These subgroups are in bijection with the simply-laced affine Dynkin diagrams. The McKay correspondence relates the representation theory of these groups to the associated Dynkin diagram, and we use this connection to show that the structure and representation theory of $Z_k(G)$ as a semisimple algebra is controlled by the combinatorics of the corresponding Dynkin diagram.
△ Less
Submitted 15 December, 2015; v1 submitted 18 December, 2013;
originally announced December 2013.