-
Modeling Social Systems: Transparency, Reproducibility, and Responsibility
Authors:
Maximino Aldana,
Roni Barak Ventura,
Heather Z. Brooks,
Philip S. Chodrow,
Filipe Georgiou,
Joseph Johnson,
Krešimir Josić,
Zachary P. Kilpatrick,
Kath Landgren,
Andrew Nugent,
Maurizio Porfiri,
Nancy Rodriguez,
Pablo Suárez-Serrato,
David White,
Alexander Wiedemann,
Sam Zhang
Abstract:
Mathematical models of complex social systems can enrich social scientific theory, inform interventions, and shape policy. From voting behavior to economic inequality and urban development, such models influence decisions that affect millions of lives. Thus, it is especially important to formulate and present them with transparency, reproducibility, and humility. Modeling in social domains, howeve…
▽ More
Mathematical models of complex social systems can enrich social scientific theory, inform interventions, and shape policy. From voting behavior to economic inequality and urban development, such models influence decisions that affect millions of lives. Thus, it is especially important to formulate and present them with transparency, reproducibility, and humility. Modeling in social domains, however, is often uniquely challenging. Unlike in physics or engineering, researchers often lack controlled experiments or abundant, clean data. Observational data is sparse, noisy, partial, and missing in systematic ways. In such an environment, how can we build models that can inform science and decision-making in transparent and responsible ways?
△ Less
Submitted 25 August, 2025;
originally announced August 2025.
-
Characteristic Imsets for Cyclic Linear Causal Models and the Chickering Ideal
Authors:
Joseph Johnson,
Pardis Semnani
Abstract:
Two directed graphs are called covariance equivalent if they induce the same set of covariance matrices, up to a Lebesgue measure zero set, on the random variables of their associated linear structural equation models. For acyclic graphs, covariance equivalence is characterized both structurally, via essential graphs and characteristic imsets, and transformationally, through sequences of covered e…
▽ More
Two directed graphs are called covariance equivalent if they induce the same set of covariance matrices, up to a Lebesgue measure zero set, on the random variables of their associated linear structural equation models. For acyclic graphs, covariance equivalence is characterized both structurally, via essential graphs and characteristic imsets, and transformationally, through sequences of covered edge flips. However, when cycles are allowed, only a transformational characterization of covariance equivalence has been discovered. We consider a linear map whose fibers correspond to the sets of graphs with identical characteristic imset vectors, and study the toric ideal associated to its integer matrix. Using properties of this ideal we show that directed graphs with the same characteristic imset vectors are covariance equivalent. In applications, imsets form a smaller search space for solving causal discovery via greedy search.
△ Less
Submitted 16 June, 2025;
originally announced June 2025.
-
Characterizing $ (\mathcal{F}, \mathcal{G}) $-syndetic, $ (\mathcal{F}, \mathcal{G}) $-thick, and related notions of size using derived sets along ultrafilters
Authors:
Shea D. Burns,
Dennis Davenport,
Shakuan Frankson,
Conner Griffin,
John H. Johnson Jr.,
Malick Kebe
Abstract:
We characterize relative notions of syndetic and thick sets using, what we call, "derived" sets along ultrafilters. Manipulations of derived sets is a characteristic feature of algebra in the Stone-Čech compactification and its applications. Combined with the existence of idempotents and structure of the smallest ideal in closed subsemigroups of the Stone-Čch compactification, our particular use o…
▽ More
We characterize relative notions of syndetic and thick sets using, what we call, "derived" sets along ultrafilters. Manipulations of derived sets is a characteristic feature of algebra in the Stone-Čech compactification and its applications. Combined with the existence of idempotents and structure of the smallest ideal in closed subsemigroups of the Stone-Čch compactification, our particular use of derived sets adapts and generalizes methods recently used by Griffin arXiv:2311.09436 to characterize relative piecewise syndetic sets. As an application, we define an algebraically interesting subset of the Stone-Čech compactification and show, in some ways, it shares structural properties analogous to the smallest ideal.
△ Less
Submitted 7 March, 2025;
originally announced March 2025.
-
Order-Preserving outer automorphisms of free and surface groups
Authors:
Jonathan Johnson,
Khanh Le
Abstract:
We give a complete classification to when a finite group of outer automorphisms preserves a bi-order on a non-abelian free group and bi-orderable surface groups. We also give another new criterion for an outer automorphism of $F_n$ induced by action of an $n$-strand braid to preserve a bi-order on $F_n.$ Using the new criterion, we produce examples of order-preserving whose underlying permutation…
▽ More
We give a complete classification to when a finite group of outer automorphisms preserves a bi-order on a non-abelian free group and bi-orderable surface groups. We also give another new criterion for an outer automorphism of $F_n$ induced by action of an $n$-strand braid to preserve a bi-order on $F_n.$ Using the new criterion, we produce examples of order-preserving whose underlying permutation is a full cycle which answers in affirmative a question of Kin and Rolfsen.
△ Less
Submitted 30 January, 2025;
originally announced January 2025.
-
Reverse Faber-Krahn inequalities for the Logarithmic potential operator
Authors:
T. V. Anoop,
Jiya Rose Johnson
Abstract:
For a bounded open set $Ω\subset \mathbb{R}^2,$ we consider the largest eigenvalue $τ_1(Ω)$ of the Logarithmic potential operator $\mathcal{L}$. If $diam(Ω)\le 1$, we prove reverse Faber-Krahn type inequalities for $τ_1(Ω)$ under polarization and Schwarz symmetrization. Further, we establish the monotonicity of $τ_1(Ω\setminus\mathcal{O})$ with respect to certain translations and rotations of the…
▽ More
For a bounded open set $Ω\subset \mathbb{R}^2,$ we consider the largest eigenvalue $τ_1(Ω)$ of the Logarithmic potential operator $\mathcal{L}$. If $diam(Ω)\le 1$, we prove reverse Faber-Krahn type inequalities for $τ_1(Ω)$ under polarization and Schwarz symmetrization. Further, we establish the monotonicity of $τ_1(Ω\setminus\mathcal{O})$ with respect to certain translations and rotations of the obstacle $\mathcal{O}$ within $Ω$. The analogous results are also stated for the largest eigenvalue of the Riesz potential operator. Furthermore, we investigate properties of the smallest eigenvalue $\tildeτ_1(Ω)$ for a domain whose transfinite diameter is greater than 1. Finally, we characterize the eigenvalues of $\mathcal{L}$ on $B_R$, including the $\tildeτ_1(B_R)$ when $R>1$.
△ Less
Submitted 23 January, 2025;
originally announced January 2025.
-
Duality between prime factors and the Prime Number Theorem for Arithmetic Progressions -- II
Authors:
Krishnaswami Alladi,
Jason Johnson
Abstract:
In the first paper under this title (1977), the first author utilized a duality identity between the largest and smallest prime factors involving the Moebius function, to establish the following result as a consequence of the Prime Number Theorem for Arithmetic Progressions: If $k$ and $\ell$ are positive integers, with $1\le\ell\le k$ and $(\ell, k)=1$, then…
▽ More
In the first paper under this title (1977), the first author utilized a duality identity between the largest and smallest prime factors involving the Moebius function, to establish the following result as a consequence of the Prime Number Theorem for Arithmetic Progressions: If $k$ and $\ell$ are positive integers, with $1\le\ell\le k$ and $(\ell, k)=1$, then $$ \sum_{n\ge 2,\, p(n)\equiv\ell(mod\,k)}\frac{μ(n)}{n}=\frac{-1}{φ(k)}, $$ where $μ(n)$ is the Moebius function, $p(n)$ is the smallest prime factor of $n$, and $φ(k)$ is the Euler function. Here we utilize the next level Duality identity between the second largest prime factor and the smallest prime factor, involving the Moebius function and $ω(n)$, the number of distinct prime factors of $n$, to establish the following result as a consequence of the Prime Number Theorem for Arithmetic Progressions: For all $\ell$ and $k$ as above, $$ \sum_{n\ge 2, \, p(n)\equiv\ell(mod\,k)}\frac{μ(n)ω(n)}{n}=0. $$ A quantitative version of this result is proved.
△ Less
Submitted 23 October, 2024;
originally announced October 2024.
-
Searching for non-order-preserving braids algorithmically
Authors:
Jonathan Johnson,
Nancy Scherich,
Hannah Turner
Abstract:
An $n$-strand braid is order-preserving if its action on the free group $F_n$ preserves some bi-order of $F_n$. A braid $β$ is order-preserving if and only if the link $L$ obtained as the union of the closure of $β$ and its axis has bi-orderable complement. We describe and implement an algorithm which, given a non-order-preserving braid $β$, confirms this property and returns a proof that $β$ is i…
▽ More
An $n$-strand braid is order-preserving if its action on the free group $F_n$ preserves some bi-order of $F_n$. A braid $β$ is order-preserving if and only if the link $L$ obtained as the union of the closure of $β$ and its axis has bi-orderable complement. We describe and implement an algorithm which, given a non-order-preserving braid $β$, confirms this property and returns a proof that $β$ is indeed not order-preserving. Guided by the algorithm, we prove that the infinite family of simple 3-braids $σ_1σ_2^{2m+1}$ are not order-preserving for any integer $m$.
△ Less
Submitted 14 October, 2024;
originally announced October 2024.
-
Equilibria in a Hypercube Spatial Voting Model
Authors:
A. Nicholas Day,
J. Robert Johnson
Abstract:
We give conditions for equilibria in the following Voronoi game on the discrete hypercube. Two players position themselves in $\{0,1\}^d$ and each receives payoff equal to the measure (under some probability distribution) of their Voronoi cell (the set of all points which are closer to them than to the other player). This game can be thought of as a discrete analogue of the Hotelling--Downs spatia…
▽ More
We give conditions for equilibria in the following Voronoi game on the discrete hypercube. Two players position themselves in $\{0,1\}^d$ and each receives payoff equal to the measure (under some probability distribution) of their Voronoi cell (the set of all points which are closer to them than to the other player). This game can be thought of as a discrete analogue of the Hotelling--Downs spatial voting model in which the political spectrum is determined by $d$ binary issues rather than a continuous interval.
We observe that if an equilibrium does exist then it must involve the two players co-locating at the majority point (ie the point representing majority opinion on each separate issue). Our main result is that a sufficient condition for an equilibrium is that on each issue the majority option is held by at least $\frac{3}{4}$ of voters. The value $\frac{3}{4}$ can be improved slightly in a way that depends on $d$ and with this improvement the result is best possible. We give similar sufficient conditions for the existence of a local equilibrium.
We also analyse the situation where the distribution is a mix of two product measures. We show that either there is an equilibrium or the best response to the majority point is its antipode.
△ Less
Submitted 26 June, 2024;
originally announced June 2024.
-
Modeling the distribution of insulin in pancreas
Authors:
Changbing Hu,
Junyuan Yang,
James D. Johnson,
Jiaxu Li
Abstract:
Maintenance of adequate physical and functional pancreatic $β$-cell mass is critical for the prevention or delay of diabetes mellitus. It is well established that insulin potently activates mitogenic and anti-apoptotic signaling cascades in cultured $β$-cells. Loss of $β$-cell insulin receptors is sufficient to induce type 2 diabetes in mice. However, it remains unclear whether the {\em in vitro}…
▽ More
Maintenance of adequate physical and functional pancreatic $β$-cell mass is critical for the prevention or delay of diabetes mellitus. It is well established that insulin potently activates mitogenic and anti-apoptotic signaling cascades in cultured $β$-cells. Loss of $β$-cell insulin receptors is sufficient to induce type 2 diabetes in mice. However, it remains unclear whether the {\em in vitro} effect in human islets and the {\em in vivo} effects in mice can be applied to human physiology. The major obstacle to a complete understanding of the effects of insulin's feedback in human pancreas is the absence of technology to measure the concentrations of insulin inside of pancreas. To contextualize recent {\em in vitro} data, it is essential to know the local concentration and distribution of insulin in pancreas. To this end, we continue to estimate the local insulin concentration within pancreas. In this paper, we investigate the distribution of insulin concentration along the pancreatic vein through a novel mathematical modeling approach using existing physiological data and islet imaging data, in contrast to our previous work focusing on the insulin level within an islet. Our studies suggest that, in response to an increase in glucose, the insulin concentration along the pancreatic vein increases nearly linearly in the fashion of increasing quicker in tail area but slower in head area depending of the initial distribution.
△ Less
Submitted 1 June, 2024;
originally announced June 2024.
-
Hyperplane Representations of Interventional Characteristic Imset Polytopes
Authors:
Benjamin Hollering,
Joseph Johnson,
Liam Solus
Abstract:
Characteristic imsets are 0/1-vectors representing directed acyclic graphs whose edges represent direct cause-effect relations between jointly distributed random variables. A characteristic imset (CIM) polytope is the convex hull of a collection of characteristic imsets. CIM polytopes arise as feasible regions of a linear programming approach to the problem of causal disovery, which aims to infer…
▽ More
Characteristic imsets are 0/1-vectors representing directed acyclic graphs whose edges represent direct cause-effect relations between jointly distributed random variables. A characteristic imset (CIM) polytope is the convex hull of a collection of characteristic imsets. CIM polytopes arise as feasible regions of a linear programming approach to the problem of causal disovery, which aims to infer a cause-effect structure from data. Linear optimization methods typically require a hyperplane representation of the feasible region, which has proven difficult to compute for CIM polytopes despite continued efforts. We solve this problem for CIM polytopes that are the convex hull of imsets associated to DAGs whose underlying graph of adjacencies is a tree. Our methods use the theory of toric fiber products as well as the novel notion of interventional CIM polytopes. Our solution is obtained as a corollary of a more general result for interventional CIM polytopes. The identified hyperplanes are applied to yield a linear optimization-based causal discovery algorithm for learning polytree causal networks from a combination of observational and interventional data.
△ Less
Submitted 29 April, 2024;
originally announced April 2024.
-
New directions in algebraic statistics: Three challenges from 2023
Authors:
Yulia Alexandr,
Miles Bakenhus,
Mark Curiel,
Sameer K. Deshpande,
Elizabeth Gross,
Yuqi Gu,
Max Hill,
Joseph Johnson,
Bryson Kagy,
Vishesh Karwa,
Jiayi Li,
Hanbaek Lyu,
Sonja Petrović,
Jose Israel Rodriguez
Abstract:
In the last quarter of a century, algebraic statistics has established itself as an expanding field which uses multilinear algebra, commutative algebra, computational algebra, geometry, and combinatorics to tackle problems in mathematical statistics. These developments have found applications in a growing number of areas, including biology, neuroscience, economics, and social sciences.
Naturally…
▽ More
In the last quarter of a century, algebraic statistics has established itself as an expanding field which uses multilinear algebra, commutative algebra, computational algebra, geometry, and combinatorics to tackle problems in mathematical statistics. These developments have found applications in a growing number of areas, including biology, neuroscience, economics, and social sciences.
Naturally, new connections continue to be made with other areas of mathematics and statistics. This paper outlines three such connections: to statistical models used in educational testing, to a classification problem for a family of nonparametric regression models, and to phase transition phenomena under uniform sampling of contingency tables. We illustrate the motivating problems, each of which is for algebraic statistics a new direction, and demonstrate an enhancement of related methodologies.
△ Less
Submitted 21 February, 2024;
originally announced February 2024.
-
Interval and $\ell$-interval Rational Parking Functions
Authors:
Tomás Aguilar-Fraga,
Jennifer Elder,
Rebecca E. Garcia,
Kimberly P. Hadaway,
Pamela E. Harris,
Kimberly J. Harry,
Imhotep B. Hogan,
Jakeyl Johnson,
Jan Kretschmann,
Kobe Lawson-Chavanu,
J. Carlos Martínez Mori,
Casandra D. Monroe,
Daniel Quiñonez,
Dirk Tolson III,
Dwight Anderson Williams II
Abstract:
Interval parking functions are a generalization of parking functions in which cars have an interval preference for their parking. We generalize this definition to parking functions with $n$ cars and $m\geq n$ parking spots, which we call interval rational parking functions and provide a formula for their enumeration. By specifying an integer parameter $\ell\geq 0$, we then consider the subset of i…
▽ More
Interval parking functions are a generalization of parking functions in which cars have an interval preference for their parking. We generalize this definition to parking functions with $n$ cars and $m\geq n$ parking spots, which we call interval rational parking functions and provide a formula for their enumeration. By specifying an integer parameter $\ell\geq 0$, we then consider the subset of interval rational parking functions in which each car parks at most $\ell$ spots away from their initial preference. We call these $\ell$-interval rational parking functions and provide recursive formulas to enumerate this set for all positive integers $m\geq n$ and $\ell$. We also establish formulas for the number of nondecreasing $\ell$-interval rational parking functions via the outcome map on rational parking functions. We also consider the intersection between $\ell$-interval parking functions and Fubini rankings and show the enumeration of these sets is given by generalized Fibonacci numbers. We conclude by specializing $\ell=1$, and establish that the set of $1$-interval rational parking functions with $n$ cars and $m$ spots are in bijection with the set of barred preferential arrangements of $[n]$ with $m-n$ bars. This readily implies enumerative formulas. Further, in the case where $\ell=1$, we recover the results of Hadaway and Harris that unit interval parking functions are in bijection with the set of Fubini rankings, which are enumerated by the Fubini numbers.
△ Less
Submitted 16 September, 2024; v1 submitted 23 November, 2023;
originally announced November 2023.
-
Plane partitions and rowmotion on rectangular and trapezoidal posets
Authors:
Joseph Johnson,
Ricky Ini Liu
Abstract:
We define a birational map between labelings of a rectangular poset and its associated trapezoidal poset. This map tropicalizes to a bijection between the plane partitions of these posets of fixed height, giving a new bijective proof of a result by Proctor. We also show that this map is equivariant with respect to birational rowmotion, resolving a conjecture of Williams and implying that birationa…
▽ More
We define a birational map between labelings of a rectangular poset and its associated trapezoidal poset. This map tropicalizes to a bijection between the plane partitions of these posets of fixed height, giving a new bijective proof of a result by Proctor. We also show that this map is equivariant with respect to birational rowmotion, resolving a conjecture of Williams and implying that birational rowmotion on trapezoidal posets has finite order.
△ Less
Submitted 13 November, 2023;
originally announced November 2023.
-
Non-standard bi-orders on punctured torus bundles
Authors:
Jonathan Johnson,
Henry Segerman
Abstract:
Results of Perron and Rolfsen imply that untwisted hyperbolic once-punctured torus bundles over the circle have bi-orderable fundamental groups. They do this by showing that the action of the monodromy preserves a "standard" bi-ordering formed using the lower central series of the free group. Here we investigate other bi-orderings that punctured torus bundle groups can have. We show that for every…
▽ More
Results of Perron and Rolfsen imply that untwisted hyperbolic once-punctured torus bundles over the circle have bi-orderable fundamental groups. They do this by showing that the action of the monodromy preserves a "standard" bi-ordering formed using the lower central series of the free group. Here we investigate other bi-orderings that punctured torus bundle groups can have. We show that for every such bi-ordering, the largest and second largest proper convex subgroups match the corresponding convex subgroups for a standard bi-ordering. Moreover, if there exists a third largest convex subgroup, it must also match the third largest convex subgroup for a standard bi-ordering. However, we also show that these groups admit non-standard bi-orderings.
△ Less
Submitted 20 October, 2023;
originally announced October 2023.
-
The Codegree, Weak Maximum Likelihood Threshold, and the Gorenstein Property of Hierarchical Models
Authors:
Joseph Johnson,
Seth Sullivant
Abstract:
The codegree of a lattice polytope is the smallest integer dilate that contains a lattice point in the relative interior. The weak maximum likelihood threshold of a statistical model is the smallest number of data points for which there is a non-zero probability that the maximum likelihood estimate exists. The codegree of a marginal polytope is a lower bound on the maximum likelihood threshold of…
▽ More
The codegree of a lattice polytope is the smallest integer dilate that contains a lattice point in the relative interior. The weak maximum likelihood threshold of a statistical model is the smallest number of data points for which there is a non-zero probability that the maximum likelihood estimate exists. The codegree of a marginal polytope is a lower bound on the maximum likelihood threshold of the associated log-linear model, and they are equal when the marginal polytope is normal. We prove a lower bound on the codegree in the case of hierarchical log-linear models and provide a conjectural formula for the codegree in general. As an application, we study when the marginal polytopes of hierarchical models are Gorenstein, including a classification of Gorenstein decomposable models, and a conjectural classification of Gorenstein binary hierarchical models.
△ Less
Submitted 17 October, 2023;
originally announced October 2023.
-
Asymmetry of 2-step Transit Probabilities in 2-Coloured Regular Graphs
Authors:
Ron Gray,
J. Robert Johnson
Abstract:
Suppose that the vertices of a regular graph are coloured red and blue with an equal number of each (we call this a balanced colouring). Since the graph is undirected, the number of edges from a red vertex to a blue vertex is clearly the same as the number of edges from a blue vertex to a red vertex. However, if instead of edges we count walks of length 2 which do not stay within their starting co…
▽ More
Suppose that the vertices of a regular graph are coloured red and blue with an equal number of each (we call this a balanced colouring). Since the graph is undirected, the number of edges from a red vertex to a blue vertex is clearly the same as the number of edges from a blue vertex to a red vertex. However, if instead of edges we count walks of length 2 which do not stay within their starting colour class, then this symmetry disappears. Our aim in this paper is to investigate how extreme this asymmetry can be.
Our main question is: Given a $d$-regular graph, for which pairs $(x,y)\in[0,1]^2$ is there a balanced colouring for which the probability that a random walk starting from a red vertex stays within the red class for at least $2$ steps is $x$, and the corresponding probability for blue is $y$?
Our most general result is that for any $d$-regular graph, these pairs lie within the convex hull of the $2d$ points $\left\{\left(\frac{l}{d},\frac{l^2}{d^2}\right),\left(\frac{l^2}{d^2},\frac{l}{d}\right) :0\leq l\leq d\right\}$.
Our main focus is the torus for which we prove both sharper bounds and existence results via constructions. In particular, for the $2$-dimensional torus, we show that asymptotically, the region in which these pairs of probabilities can lie is exactly the convex hull of: \[ \left\{\left(0,0\right),\left(\frac{1}{2},\frac{1}{4}\right),\left(\frac{3}{4},\frac{9}{16}\right),\left(\frac{1}{4},\frac{1}{2}\right),\left(\frac{9}{16},\frac{3}{4}\right),\left(1,1\right)\right\} \]
△ Less
Submitted 9 June, 2025; v1 submitted 12 July, 2023;
originally announced July 2023.
-
Partial shuffles by lazy swaps
Authors:
Barnabás Janzer,
J. Robert Johnson,
Imre Leader
Abstract:
What is the smallest number of random transpositions (meaning that we swap given pairs of elements with given probabilities) that we can make on an $n$-point set to ensure that each element is uniformly distributed -- in the sense that the probability that $i$ is mapped to $j$ is $1/n$ for all $i$ and $j$? And what if we insist that each pair is uniformly distributed?
In this paper we show that…
▽ More
What is the smallest number of random transpositions (meaning that we swap given pairs of elements with given probabilities) that we can make on an $n$-point set to ensure that each element is uniformly distributed -- in the sense that the probability that $i$ is mapped to $j$ is $1/n$ for all $i$ and $j$? And what if we insist that each pair is uniformly distributed?
In this paper we show that the minimum for the first problem is about $\frac{1}{2} n \log_2 n$, with this being exact when $n$ is a power of $2$. For the second problem, we show that, rather surprisingly, the answer is not quadratic: $O(n \log^2 n)$ random transpositions suffice. We also show that if we ask only that the pair $1,2$ is uniformly distributed then the answer is $2n-3$. This proves a conjecture of Groenland, Johnston, Radcliffe and Scott.
△ Less
Submitted 24 October, 2022;
originally announced October 2022.
-
Piecewise-linear promotion and RSK in rectangles and moon polyominoes
Authors:
Joseph Johnson,
Ricky Ini Liu
Abstract:
We study piecewise-linear and birational lifts of Schützenberger promotion, evacuation, and the RSK correspondence defined in terms of toggles. Using this perspective, we prove that certain chain statistics in rectangles shift predictably under the action of these maps. We then use this to construct piecewise-linear and birational versions of Rubey's bijections between fillings of equivalent moon…
▽ More
We study piecewise-linear and birational lifts of Schützenberger promotion, evacuation, and the RSK correspondence defined in terms of toggles. Using this perspective, we prove that certain chain statistics in rectangles shift predictably under the action of these maps. We then use this to construct piecewise-linear and birational versions of Rubey's bijections between fillings of equivalent moon polyominoes that preserve these chain statistics, and we show that these maps form a commuting diagram. We also discuss how these results imply Ehrhart equivalence and Ehrhart quasi-polynomial period collapse of certain analogues of chain polytopes for moon polyominoes.
△ Less
Submitted 2 May, 2023; v1 submitted 9 October, 2022;
originally announced October 2022.
-
Toric Ideals of Characteristic Imsets via Quasi-Independence Gluing
Authors:
Benjamin Hollering,
Joseph Johnson,
Irem Portakal,
Liam Solus
Abstract:
Characteristic imsets are 0-1 vectors which correspond to Markov equivalence classes of directed acyclic graphs. The study of their convex hull, named the characteristic imset polytope, has led to new and interesting geometric perspectives on the important problem of causal discovery. In this paper we begin the study of the associated toric ideal. We develop a new generalization of the toric fiber…
▽ More
Characteristic imsets are 0-1 vectors which correspond to Markov equivalence classes of directed acyclic graphs. The study of their convex hull, named the characteristic imset polytope, has led to new and interesting geometric perspectives on the important problem of causal discovery. In this paper we begin the study of the associated toric ideal. We develop a new generalization of the toric fiber product, which we call a quasi-independence gluing, and show that under certain combinatorial homogeneity conditions, one can iteratively compute a Gröbner basis via lifting. For faces of the characteristic imset polytope associated to trees, we apply this technique to compute a Gröbner basis for the associated toric ideal. We end with a study of the characteristic ideal of the cycle and propose directions for future work.
△ Less
Submitted 19 September, 2022; v1 submitted 5 September, 2022;
originally announced September 2022.
-
Optimal Resistor Networks
Authors:
J. Robert Johnson,
Mark Walters
Abstract:
Given a graph on n vertices with m edges, each of unit resistance, how small can the average resistance between pairs of vertices be? There are two very plausible extremal constructions -- graphs like a star, and graphs which are close to regular -- with the transition between them occuring when the average degree is 3. However, one of our main aims in this paper is to show that there are signific…
▽ More
Given a graph on n vertices with m edges, each of unit resistance, how small can the average resistance between pairs of vertices be? There are two very plausible extremal constructions -- graphs like a star, and graphs which are close to regular -- with the transition between them occuring when the average degree is 3. However, one of our main aims in this paper is to show that there are significantly better constructions for a range of average degree including average degree near 3.
A key idea is to link this question to a analogous question about rooted graphs -- namely `which rooted graph minimises the average resistance to the root?'. The rooted case is much simpler to analyse than the unrooted, and one of the main results of this paper is that the two cases are asymptotically equivalent.
△ Less
Submitted 16 June, 2022;
originally announced June 2022.
-
The Kernelized Taylor Diagram
Authors:
Kristoffer Wickstrøm,
J. Emmanuel Johnson,
Sigurd Løkse,
Gustau Camps-Valls,
Karl Øyvind Mikalsen,
Michael Kampffmeyer,
Robert Jenssen
Abstract:
This paper presents the kernelized Taylor diagram, a graphical framework for visualizing similarities between data populations. The kernelized Taylor diagram builds on the widely used Taylor diagram, which is used to visualize similarities between populations. However, the Taylor diagram has several limitations such as not capturing non-linear relationships and sensitivity to outliers. To address…
▽ More
This paper presents the kernelized Taylor diagram, a graphical framework for visualizing similarities between data populations. The kernelized Taylor diagram builds on the widely used Taylor diagram, which is used to visualize similarities between populations. However, the Taylor diagram has several limitations such as not capturing non-linear relationships and sensitivity to outliers. To address such limitations, we propose the kernelized Taylor diagram. Our proposed kernelized Taylor diagram is capable of visualizing similarities between populations with minimal assumptions of the data distributions. The kernelized Taylor diagram relates the maximum mean discrepancy and the kernel mean embedding in a single diagram, a construction that, to the best of our knowledge, have not been devised prior to this work. We believe that the kernelized Taylor diagram can be a valuable tool in data visualization.
△ Less
Submitted 18 May, 2022;
originally announced May 2022.
-
A non-associative incidence near-ring with a generalized Möbius function
Authors:
John Johnson,
Max Wakefield
Abstract:
There is a convolution product on 3-variable partial flag functions of a locally finite poset that produces a generalized Möbius function. Under the product this generalized Möbius function is a one sided inverse of the zeta function and satisfies many generalizations of classical results. In particular we prove analogues of Phillip Hall's Theorem on the Möbius function as an alternating sum of ch…
▽ More
There is a convolution product on 3-variable partial flag functions of a locally finite poset that produces a generalized Möbius function. Under the product this generalized Möbius function is a one sided inverse of the zeta function and satisfies many generalizations of classical results. In particular we prove analogues of Phillip Hall's Theorem on the Möbius function as an alternating sum of chain counts, Weisner's theorem, and Rota's Crosscut Theorem. A key ingredient to these results is that this function is an overlapping product of classical Möbius functions. Using this generalized Möbius function we define analogues of the characteristic polynomial and Möbius polynomials for ranked lattices. We compute these polynomials for certain families of matroids and prove that this generalized Möbius polynomial has -1 as root if the matroid is modular. Using results from Ardila and Sanchez we prove that this generalized characteristic polynomial is a matroid valuation.
△ Less
Submitted 14 April, 2022;
originally announced April 2022.
-
Birational Rowmotion and the Octahedron Recurrence
Authors:
Joseph Johnson,
Ricky Ini Liu
Abstract:
We use the octahedron recurrence to give a simplified statement and proof of a formula for iterated birational rowmotion on a product of two chains, first described by Musiker and Roby. Using this, we show that weights of certain chains in rectangles shift in a predictable way under the action of rowmotion. We then define generalized Stanley-Thomas words whose cyclic rotation uniquely determines b…
▽ More
We use the octahedron recurrence to give a simplified statement and proof of a formula for iterated birational rowmotion on a product of two chains, first described by Musiker and Roby. Using this, we show that weights of certain chains in rectangles shift in a predictable way under the action of rowmotion. We then define generalized Stanley-Thomas words whose cyclic rotation uniquely determines birational rowmotion on the product of two chains. We also discuss the relationship between rowmotion and birational RSK and give a birational analogue of Greene's theorem in this setting.
△ Less
Submitted 8 April, 2022;
originally announced April 2022.
-
A Dynamical Model for the Origin of Anisogamy
Authors:
Joseph D. Johnson,
Nathan L. White,
Alain Kangabire,
Daniel M. Abrams
Abstract:
The vast majority of multi-cellular organisms are anisogamous, meaning that male and female sex cells differ in size. It remains an open question how this asymmetric state evolved, presumably from the symmetric isogamous state where all gametes are roughly the same size (drawn from the same distribution). Here, we use tools from the study of nonlinear dynamical systems to develop a simple mathemat…
▽ More
The vast majority of multi-cellular organisms are anisogamous, meaning that male and female sex cells differ in size. It remains an open question how this asymmetric state evolved, presumably from the symmetric isogamous state where all gametes are roughly the same size (drawn from the same distribution). Here, we use tools from the study of nonlinear dynamical systems to develop a simple mathematical model for this phenomenon. Using theoretical analysis and numerical simulation, we demonstrate that competition between individuals that is linked to the mean gamete size will almost inevitably result in a stable anisogamous equilibrium, and thus isogamy may naturally lead to anisogamy.
△ Less
Submitted 16 December, 2021;
originally announced December 2021.
-
A Mathematical Model for the Origin of Name Brands and Generics
Authors:
Joseph D. Johnson,
Adam M. Redlich,
Daniel M. Abrams
Abstract:
Firms in the U.S. spend over 200 billion dollars each year advertising their products to consumers, around one percent of the country's gross domestic product. It is of great interest to understand how that aggregate expenditure affects prices, market efficiency, and overall welfare. Here, we present a mathematical model for the dynamics of competition through advertising and find a surprising pre…
▽ More
Firms in the U.S. spend over 200 billion dollars each year advertising their products to consumers, around one percent of the country's gross domestic product. It is of great interest to understand how that aggregate expenditure affects prices, market efficiency, and overall welfare. Here, we present a mathematical model for the dynamics of competition through advertising and find a surprising prediction: when advertising is relatively cheap compared to the maximum benefit advertising offers, rational firms split into two groups, one with significantly less advertising (a "generic" group) and one with significantly more advertising (a "name brand" group). Our model predicts that this segmentation will also be reflected in price distributions; we use large consumer data sets to test this prediction and find good qualitative agreement.
△ Less
Submitted 14 December, 2021;
originally announced December 2021.
-
Shattering $k$-sets with Permutations
Authors:
J. Robert Johnson,
Belinda Wickes
Abstract:
Many concepts from extremal set theory have analogues for families of permutations. This paper is concerned with the notion of shattering for permutations. A family $\mathcal{P}$ of permutations of an $n$-element set $X$ shatters a $k$-set from $X$ if it appears in each of the $k!$ possible orders in some permutation in $\mathcal{P}$. The smallest family $\mathcal{P}$ which shatters every $k$-subs…
▽ More
Many concepts from extremal set theory have analogues for families of permutations. This paper is concerned with the notion of shattering for permutations. A family $\mathcal{P}$ of permutations of an $n$-element set $X$ shatters a $k$-set from $X$ if it appears in each of the $k!$ possible orders in some permutation in $\mathcal{P}$. The smallest family $\mathcal{P}$ which shatters every $k$-subset of $X$ is known to have size $Θ(\log n)$.
Our aim is to introduce and study two natural partial versions of this shattering problem.
Our first main result concerns the case where our family must contain only $t$ out of $k!$ of the possible orders. When $k=3$ we show that there are three distinct regimes depending on $t$: constant, $Θ(\log\log n)$, $Θ(\log n)$. We also show that for larger $k$ these same regimes exist although they may not cover all values of $t$.
Our second direction concerns the problem of determining the largest number of $k$-sets that can be totally shattered by a family with given size. We show that for any $n$, a family of $6$ permutations is enough to shatter a proportion between $\frac{17}{42}$ and $\frac{11}{14}$ of all triples.
△ Less
Submitted 5 April, 2023; v1 submitted 3 December, 2021;
originally announced December 2021.
-
Algebraic characterizations of some relative notions of size
Authors:
Cory Christopherson,
John H. Johnson Jr
Abstract:
We obtain algebraic characterizations of relative notions of size in a discrete semigroup that generalize the usual combinatorial notions of syndetic, thick, and piecewise syndetic sets. "Filtered" syndetic and piecewise syndetic sets were defined and applied earlier by Shuungula, Zelenyuk, and Zelenyuk [24]. Other instances of these relative notions of size have appeared explicitly (and more ofte…
▽ More
We obtain algebraic characterizations of relative notions of size in a discrete semigroup that generalize the usual combinatorial notions of syndetic, thick, and piecewise syndetic sets. "Filtered" syndetic and piecewise syndetic sets were defined and applied earlier by Shuungula, Zelenyuk, and Zelenyuk [24]. Other instances of these relative notions of size have appeared explicitly (and more often implicitly) in the literature related to the algebraic structure of the Stone-Čech compactification. Building on this prior work, we observe a natural duality and demonstrate how these notions of size may be composed to characterize previous notions of size (like piecewise syndetic sets) and serve as a convenient description for new notions of size.
△ Less
Submitted 19 July, 2021; v1 submitted 20 May, 2021;
originally announced May 2021.
-
Determinantal Formulas for SEM Expansions of Schubert Polynomials
Authors:
Hassan Hatam,
Joseph Johnson,
Ricky Ini Liu,
Maria Macaulay
Abstract:
We show that for any permutation $w$ that avoids a certain set of 13 patterns of lengths 5 and 6, the Schubert polynomial $\mathfrak S_w$ can be expressed as the determinant of a matrix of elementary symmetric polynomials in a manner similar to the Jacobi-Trudi identity. For such $w$, this determinantal formula is equivalent to a (signed) subtraction-free expansion of $\mathfrak S_w$ in the basis…
▽ More
We show that for any permutation $w$ that avoids a certain set of 13 patterns of lengths 5 and 6, the Schubert polynomial $\mathfrak S_w$ can be expressed as the determinant of a matrix of elementary symmetric polynomials in a manner similar to the Jacobi-Trudi identity. For such $w$, this determinantal formula is equivalent to a (signed) subtraction-free expansion of $\mathfrak S_w$ in the basis of standard elementary monomials.
△ Less
Submitted 9 April, 2021;
originally announced April 2021.
-
Stochastic Entry Guidance
Authors:
Jack Ridderhof,
Panagiotis Tsiotras,
Breanna J. Johnson
Abstract:
In this paper, closed-loop entry guidance in a randomly perturbed atmosphere, using bank angle control, is posed as a stochastic optimal control problem. The entry trajectory, as well as the closed-loop controls, are both modeled as random processes with statistics determined by the entry dynamics, the entry guidance, and the probabilistic structure of altitude-dependent atmospheric density variat…
▽ More
In this paper, closed-loop entry guidance in a randomly perturbed atmosphere, using bank angle control, is posed as a stochastic optimal control problem. The entry trajectory, as well as the closed-loop controls, are both modeled as random processes with statistics determined by the entry dynamics, the entry guidance, and the probabilistic structure of altitude-dependent atmospheric density variations. The entry guidance, which is parameterized as a sequence of linear feedback gains, is designed to steer the probability distribution of the entry trajectories while satisfying bounds on the allowable control inputs and on the maximum allowable state errors. Numerical simulations of a Mars entry scenario demonstrate improved range targeting performance with approximately 50% lower 1st and 99th percentile final range errors when using the developed stochastic guidance scheme as compared to the existing Apollo final phase algorithm.
△ Less
Submitted 17 January, 2022; v1 submitted 8 March, 2021;
originally announced March 2021.
-
Generalizing Kirchhoff laws for Signed Graphs
Authors:
Lucas J. Rusnak,
Josephine Reynes,
Skyler J. Johnson,
Peter Ye
Abstract:
Kirchhoff-type Laws for signed graphs are characterized by generalizing transpedances through the incidence-oriented structure of bidirected graphs. The classical $2$-arborescence interpretation of Tutte is shown to be equivalent to single-element Boolean classes of reduced incidence-based cycle covers, called contributors. A generalized contributor-transpedance is introduced using entire Boolean…
▽ More
Kirchhoff-type Laws for signed graphs are characterized by generalizing transpedances through the incidence-oriented structure of bidirected graphs. The classical $2$-arborescence interpretation of Tutte is shown to be equivalent to single-element Boolean classes of reduced incidence-based cycle covers, called contributors. A generalized contributor-transpedance is introduced using entire Boolean classes that naturally cancel in a graph; classical conservation is proven to be property of the trivial Boolean classes. The contributor-transpedances on signed graphs are shown to produce non-conservative Kirchhoff-type Laws, where every contributor possesses the unique source-sink path property. Finally, the maximum value of a contributor-transpedance is calculated through the signless Laplacian.
△ Less
Submitted 26 September, 2020;
originally announced September 2020.
-
Residual Torsion-Free Nilpotence, Bi-Orderability and Pretzel Knots
Authors:
Jonathan Johnson
Abstract:
The residual torsion-free nilpotence of the commutator subgroup of a knot group has played a key role in studying the bi-orderability of knot groups. A technique developed by Mayland provides a sufficient condition for the commutator subgroup of a knot group to be residually-torsion-free nilpotent using work of Baumslag. In this paper, we apply Mayland's technique to several genus one pretzel knot…
▽ More
The residual torsion-free nilpotence of the commutator subgroup of a knot group has played a key role in studying the bi-orderability of knot groups. A technique developed by Mayland provides a sufficient condition for the commutator subgroup of a knot group to be residually-torsion-free nilpotent using work of Baumslag. In this paper, we apply Mayland's technique to several genus one pretzel knots and a family of pretzel knots with arbitrarily high genus. As a result, we obtain a large number of new examples of knots with bi-orderable knot groups. These are the first examples of bi-orderable knot groups for knots which are not fibered or alternating.
△ Less
Submitted 9 August, 2021; v1 submitted 31 August, 2020;
originally announced August 2020.
-
Synchronizing Times for $k$-sets in Automata
Authors:
Natalie C. Behague,
J. Robert Johnson
Abstract:
An automaton is synchronizing if there is a word that maps all states onto the same state. Černý's conjecture on the length of the shortest such word is probably the most famous open problem in automata theory. We consider the closely related question of determining the minimum length of a word that maps $k$ states onto a single state. For synchronizing automata, we improve the upper bound on the…
▽ More
An automaton is synchronizing if there is a word that maps all states onto the same state. Černý's conjecture on the length of the shortest such word is probably the most famous open problem in automata theory. We consider the closely related question of determining the minimum length of a word that maps $k$ states onto a single state. For synchronizing automata, we improve the upper bound on the minimum length of a word that sends some triple to a a single state from $0.5n^2$ to $\approx 0.19n^2$. We further extend this to an improved bound on the length of such a word for 4 states and 5 states. In the case of non-synchronizing automata, we give an example to show that the minimum length of a word that sends $k$ states to a single state can be as large as $Θ\left(n^{k-1}\right)$.
△ Less
Submitted 8 August, 2022; v1 submitted 27 August, 2020;
originally announced August 2020.
-
Characterization of the lengths of binary circular words containing no squares other than 00, 11, and 0101
Authors:
James D. Currie,
Jesse T. Johnson
Abstract:
We characterize exactly the lengths of binary circular words containing no squares other than 00, 11, and 0101. Key words: combinatorics on words, circular words, necklaces, square-free words, non-repetitive sequences
We characterize exactly the lengths of binary circular words containing no squares other than 00, 11, and 0101. Key words: combinatorics on words, circular words, necklaces, square-free words, non-repetitive sequences
△ Less
Submitted 19 May, 2020;
originally announced May 2020.
-
There are level ternary circular square-free words of length $n$ for $n\ne 5,7,9,10,14,17.$
Authors:
James D. Currie,
Jesse T. Johnson
Abstract:
A word is level if each letter appears in it the same number of times, plus or minus 1. We give a complete characterization of the lengths for which level ternary circular square-free words exist. Key words: combinatorics on words, circular words, necklaces, square-free words, non-repetitive sequences
A word is level if each letter appears in it the same number of times, plus or minus 1. We give a complete characterization of the lengths for which level ternary circular square-free words exist. Key words: combinatorics on words, circular words, necklaces, square-free words, non-repetitive sequences
△ Less
Submitted 19 May, 2020; v1 submitted 13 May, 2020;
originally announced May 2020.
-
Efficient, arbitrarily high precision hardware logarithmic arithmetic for linear algebra
Authors:
Jeff Johnson
Abstract:
The logarithmic number system (LNS) is arguably not broadly used due to exponential circuit overheads for summation tables relative to arithmetic precision. Methods to reduce this overhead have been proposed, yet still yield designs with high chip area and power requirements. Use remains limited to lower precision or high multiply/add ratio cases, while much of linear algebra (near 1:1 multiply/ad…
▽ More
The logarithmic number system (LNS) is arguably not broadly used due to exponential circuit overheads for summation tables relative to arithmetic precision. Methods to reduce this overhead have been proposed, yet still yield designs with high chip area and power requirements. Use remains limited to lower precision or high multiply/add ratio cases, while much of linear algebra (near 1:1 multiply/add ratio) does not qualify. We present a dual-base approximate logarithmic arithmetic comparable to floating point in use, yet unlike LNS it is easily fully pipelined, extendable to arbitrary precision with $O(n^2)$ overhead, and energy efficient at a 1:1 multiply/add ratio. Compared to float32 or float64 vector inner product with FMA, our design is respectively 2.3x and 4.6x more energy efficient in 7 nm CMOS. It depends on exp and log evaluation 5.4x and 3.2x more energy efficient, at 0.23x and 0.37x the chip area for equivalent accuracy versus standard hyperbolic CORDIC using shift-and-add and approximated ODE integration in the style of Revol and Yakoubsohn. This technique is a novel design alternative for low power, high precision hardened linear algebra in computer vision, graphics and machine learning applications.
△ Less
Submitted 14 May, 2020; v1 submitted 16 April, 2020;
originally announced April 2020.
-
Residual Torsion-Free Nilpotence, Bi-Orderability and Two-Bridge Links
Authors:
Jonathan Johnson
Abstract:
Residual torsion-free nilpotence has proven to be an important property for knot groups with applications to bi-orderability and ribbon concordance. Mayland proposed a strategy to show that a two-bridge knot group has a commutator subgroup which is a union of an ascending chain of parafree groups. This paper proves Mayland's assertion and expands the result to the subgroups of two-bridge link grou…
▽ More
Residual torsion-free nilpotence has proven to be an important property for knot groups with applications to bi-orderability and ribbon concordance. Mayland proposed a strategy to show that a two-bridge knot group has a commutator subgroup which is a union of an ascending chain of parafree groups. This paper proves Mayland's assertion and expands the result to the subgroups of two-bridge link groups that correspond to the kernels of maps to $\mathbb{Z}$. We call these kernels the Alexander subgroups of the links. As a result, we show the bi-orderability of a large family of two-bridge link groups. This proof makes use of a modified version of a graph theoretic construction of Hirasawa and Murasugi in order to understand the structure of the Alexander subgroup for a two-bridge link group.
△ Less
Submitted 8 July, 2021; v1 submitted 18 December, 2019;
originally announced December 2019.
-
Correlation for permutations
Authors:
J. Robert Johnson,
Imre Leader,
Eoin Long
Abstract:
In this note we investigate correlation inequalities for `up-sets' of permutations, in the spirit of the Harris--Kleitman inequality. We focus on two well-studied partial orders on $S_n$, giving rise to differing notions of up-sets. Our first result shows that, under the strong Bruhat order on $S_n$, up-sets are positively correlated (in the Harris--Kleitman sense). Thus, for example, for a (unifo…
▽ More
In this note we investigate correlation inequalities for `up-sets' of permutations, in the spirit of the Harris--Kleitman inequality. We focus on two well-studied partial orders on $S_n$, giving rise to differing notions of up-sets. Our first result shows that, under the strong Bruhat order on $S_n$, up-sets are positively correlated (in the Harris--Kleitman sense). Thus, for example, for a (uniformly) random permutation $π$, the event that no point is displaced by more than a fixed distance $d$ and the event that $π$ is the product of at most $k$ adjacent transpositions are positively correlated. In contrast, under the weak Bruhat order we show that this completely fails: surprisingly, there are two up-sets each of measure $1/2$ whose intersection has arbitrarily small measure.
We also prove analogous correlation results for a class of non-uniform measures, which includes the Mallows measures. Some applications and open problems are discussed.
△ Less
Submitted 21 April, 2020; v1 submitted 9 September, 2019;
originally announced September 2019.
-
A Coupled Oscillator Model for the Origin of Bimodality and Multimodality
Authors:
Joseph D. Johnson,
Daniel M. Abrams
Abstract:
Perhaps because of the elegance of the central limit theorem, it is often assumed that distributions in nature will approach singly-peaked, unimodal shapes reminiscent of the Gaussian normal distribution. However, many systems behave differently, with variables following apparently bimodal or multimodal distributions. Here we argue that multimodality may emerge naturally as a result of repulsive o…
▽ More
Perhaps because of the elegance of the central limit theorem, it is often assumed that distributions in nature will approach singly-peaked, unimodal shapes reminiscent of the Gaussian normal distribution. However, many systems behave differently, with variables following apparently bimodal or multimodal distributions. Here we argue that multimodality may emerge naturally as a result of repulsive or inhibitory coupling dynamics, and we show rigorously how it emerges for a broad class of coupling functions in variants of the paradigmatic Kuramoto model.
△ Less
Submitted 24 February, 2020; v1 submitted 13 May, 2019;
originally announced May 2019.
-
On the Diophantine Equation 1/a + 1/b = (q+1) / pq
Authors:
Jeremiah W. Johnson
Abstract:
Let $p$ and $q$ be distinct primes such that $q+1 | p-1$. In this paper we find all integer solutions $a$, $b$ to the equation $1/a + 1/b = (q+1)/pq$ using only elementary methods.
Let $p$ and $q$ be distinct primes such that $q+1 | p-1$. In this paper we find all integer solutions $a$, $b$ to the equation $1/a + 1/b = (q+1)/pq$ using only elementary methods.
△ Less
Submitted 6 May, 2019;
originally announced May 2019.
-
Subspace Match Probably Does Not Accurately Assess the Similarity of Learned Representations
Authors:
Jeremiah Johnson
Abstract:
Learning informative representations of data is one of the primary goals of deep learning, but there is still little understanding as to what representations a neural network actually learns. To better understand this, subspace match was recently proposed as a method for assessing the similarity of the representations learned by neural networks. It has been shown that two networks with the same ar…
▽ More
Learning informative representations of data is one of the primary goals of deep learning, but there is still little understanding as to what representations a neural network actually learns. To better understand this, subspace match was recently proposed as a method for assessing the similarity of the representations learned by neural networks. It has been shown that two networks with the same architecture trained from different initializations learn representations that at hidden layers show low similarity when assessed with subspace match, even when the output layers show high similarity and the networks largely exhibit similar performance on classification tasks. In this note, we present a simple example motivated by standard results in commutative algebra to illustrate how this can happen, and show that although the subspace match at a hidden layer may be 0, the representations learned may be isomorphic as vector spaces. This leads us to conclude that a subspace match comparison of learned representations may well be uninformative, and it points to the need for better methods of understanding learned representations.
△ Less
Submitted 3 January, 2019;
originally announced January 2019.
-
Rethinking floating point for deep learning
Authors:
Jeff Johnson
Abstract:
Reducing hardware overhead of neural networks for faster or lower power inference and training is an active area of research. Uniform quantization using integer multiply-add has been thoroughly investigated, which requires learning many quantization parameters, fine-tuning training or other prerequisites. Little effort is made to improve floating point relative to this baseline; it remains energy…
▽ More
Reducing hardware overhead of neural networks for faster or lower power inference and training is an active area of research. Uniform quantization using integer multiply-add has been thoroughly investigated, which requires learning many quantization parameters, fine-tuning training or other prerequisites. Little effort is made to improve floating point relative to this baseline; it remains energy inefficient, and word size reduction yields drastic loss in needed dynamic range. We improve floating point to be more energy efficient than equivalent bit width integer hardware on a 28 nm ASIC process while retaining accuracy in 8 bits with a novel hybrid log multiply/linear add, Kulisch accumulation and tapered encodings from Gustafson's posit format. With no network retraining, and drop-in replacement of all math and float32 parameters via round-to-nearest-even only, this open-sourced 8-bit log float is within 0.9% top-1 and 0.2% top-5 accuracy of the original float32 ResNet-50 CNN model on ImageNet. Unlike int8 quantization, it is still a general purpose floating point arithmetic, interpretable out-of-the-box. Our 8/38-bit log float multiply-add is synthesized and power profiled at 28 nm at 0.96x the power and 1.12x the area of 8/32-bit integer multiply-add. In 16 bits, our log float multiply-add is 0.59x the power and 0.68x the area of IEEE 754 float16 fused multiply-add, maintaining the same signficand precision and dynamic range, proving useful for training ASICs as well.
△ Less
Submitted 1 November, 2018;
originally announced November 2018.
-
The $Q_2$-free process in the hypercube
Authors:
J. Robert Johnson,
Trevor Pinto
Abstract:
The generation of a random triangle-saturated graph via the triangle-free process has been studied extensively. In this short note our aim is to introduce an analogous process in the hypercube. Specifically, we consider the $Q_2$-free process in $Q_d$ and the random subgraph of $Q_d$ it generates. Our main result is that with high probability the graph resulting from this process has at least…
▽ More
The generation of a random triangle-saturated graph via the triangle-free process has been studied extensively. In this short note our aim is to introduce an analogous process in the hypercube. Specifically, we consider the $Q_2$-free process in $Q_d$ and the random subgraph of $Q_d$ it generates. Our main result is that with high probability the graph resulting from this process has at least $cd^{2/3} 2^d$ edges. We also discuss a heuristic argument based on the differential equations method which suggests a stronger conjecture, and discuss the issues with making this rigorous. We conclude with some open questions related to this process.
△ Less
Submitted 13 October, 2020; v1 submitted 24 April, 2018;
originally announced April 2018.
-
Counting Roots of Polynomials over $\mathbb{Z}/p^2\mathbb{Z}$
Authors:
Trajan Hammonds,
Jeremy Johnson,
Angela Patini,
Robert M. Walker
Abstract:
Until recently, the only known method of finding the roots of polynomials over prime power rings, other than fields, was brute force. One reason for this is the lack of a division algorithm, obstructing the use of greatest common divisors. Fix a prime $p \in \mathbb{Z}$ and $f \in ( \mathbb{Z}/p^n \mathbb{Z} ) [x]$ any nonzero polynomial of degree $d$ whose coefficients are not all divisible by…
▽ More
Until recently, the only known method of finding the roots of polynomials over prime power rings, other than fields, was brute force. One reason for this is the lack of a division algorithm, obstructing the use of greatest common divisors. Fix a prime $p \in \mathbb{Z}$ and $f \in ( \mathbb{Z}/p^n \mathbb{Z} ) [x]$ any nonzero polynomial of degree $d$ whose coefficients are not all divisible by $p$. For the case $n=2$, we prove a new efficient algorithm to count the roots of $f$ in $\mathbb{Z}/p^2\mathbb{Z}$ within time polynomial in $(d+\operatorname{size}(f)+\log{p})$, and record a concise formula for the number of roots, formulated by Cheng, Gao, Rojas, and Wan.
△ Less
Submitted 12 December, 2017; v1 submitted 15 August, 2017;
originally announced August 2017.
-
Elastic Splines II: unicity of optimal s-curves and $G^2$ regularity of splines
Authors:
Albert Borbely,
Michael J. Johnson
Abstract:
Given points $P_1,P_2,\ldots,P_m$ in the complex plane, we are concerned with the problem of finding an interpolating curve with minimal bending energy (i.e., an optimal interpolating curve). It was shown previously that existence is assured if one requires that the pieces of the interpolating curve be s-curves. In the present article we also impose the restriction that these s-curves have chord a…
▽ More
Given points $P_1,P_2,\ldots,P_m$ in the complex plane, we are concerned with the problem of finding an interpolating curve with minimal bending energy (i.e., an optimal interpolating curve). It was shown previously that existence is assured if one requires that the pieces of the interpolating curve be s-curves. In the present article we also impose the restriction that these s-curves have chord angles not exceeding $π/2$ in magnitude. With this setup, we have identified a sufficient condition for the $G^2$ regularity of optimal interpolating curves. This sufficient condition relates to the stencil angles $\{ψ_j\}$, where $ψ_j$ is defined as the angular change in direction from segment $[P_{j-1},P_j]$ to segment $[P_j,P_{j+1}]$. A distinguished angle $Ψ$ ($\approx 37^\circ$) is identified, and we show that if the stencil angles satisfy $|ψ_j|<Ψ$, then optimal interpolating curves are globally $G^2$.
As with the previous article, most of our effort is concerned with the geometric Hermite interpolation problem of finding an optimal s-curve which connects $P_1$ to $P_2$ with prescribed chord angles $(α,β)$. Whereas existence was previously shown, and sometimes uniqueness, the present article begins by establishing uniqueness when $|α|,|β|\leqπ/2$ and $|α-β|<π$.
△ Less
Submitted 31 December, 2016;
originally announced January 2017.
-
Revisiting the nilpotent polynomial Hales-Jewett theorem
Authors:
John H. Johnson,
Florian Karl Richter
Abstract:
Answering a question posed by Bergelson and Leibman in [6], we establish a nilpotent version of the polynomial Hales-Jewett theorem that contains the main theorem in [6] as a special case. Important to the formulation and the proof of our main theorem is the notion of a relative syndetic set (relative with respect to a closed non-empty subsets of $β\mathbf{G}$) [25]. As a corollary of our main the…
▽ More
Answering a question posed by Bergelson and Leibman in [6], we establish a nilpotent version of the polynomial Hales-Jewett theorem that contains the main theorem in [6] as a special case. Important to the formulation and the proof of our main theorem is the notion of a relative syndetic set (relative with respect to a closed non-empty subsets of $β\mathbf{G}$) [25]. As a corollary of our main theorem we prove an extension of the restricted van der Waerden Theorem to nilpotent groups, which involves nilprogressions.
△ Less
Submitted 21 November, 2018; v1 submitted 18 July, 2016;
originally announced July 2016.
-
Transitive Avoidance Games
Authors:
J. Robert Johnson,
Imre Leader,
Mark Walters
Abstract:
Positional games are a well-studied class of combinatorial game. In their usual form, two players take turns to play moves in a set (`the board'), and certain subsets are designated as `winning': the first person to occupy such a set wins the game. For these games, it is well known that (with correct play) the game cannot be a second-player win.
In the avoidance (or misère) form, the first perso…
▽ More
Positional games are a well-studied class of combinatorial game. In their usual form, two players take turns to play moves in a set (`the board'), and certain subsets are designated as `winning': the first person to occupy such a set wins the game. For these games, it is well known that (with correct play) the game cannot be a second-player win.
In the avoidance (or misère) form, the first person to occupy such a set \emph{loses} the game. Here it would be natural to expect that the game cannot be a first-player win, at least if the game is transitive, meaning that all points of the board look the same. Our main result is that, contrary to this expectation, there are transitive games that are first-player wins, for all board sizes which are not prime or a power of 2.
Further, we show that such games can have additional properties such as stronger transitivity conditions, fast winning times, and `small' winning sets.
△ Less
Submitted 11 July, 2016;
originally announced July 2016.
-
Multicolour Ramsey Numbers of Odd Cycles
Authors:
A. Nicholas Day,
J. Robert Johnson
Abstract:
We show that for any positive integer $r$ there exists an integer $k$ and a $k$-colouring of the edges of $K_{2^{k}+1}$ with no monochromatic odd cycle of length less than $r$. This makes progress on a problem of Erdős and Graham and answers a question of Chung. We use these colourings to give new lower bounds on the $k$-colour Ramsey number of the odd cycle and prove that, for all odd $r$ and all…
▽ More
We show that for any positive integer $r$ there exists an integer $k$ and a $k$-colouring of the edges of $K_{2^{k}+1}$ with no monochromatic odd cycle of length less than $r$. This makes progress on a problem of Erdős and Graham and answers a question of Chung. We use these colourings to give new lower bounds on the $k$-colour Ramsey number of the odd cycle and prove that, for all odd $r$ and all $k$ sufficiently large, there exists a constant $ε= ε(r) > 0$ such that $R_{k}(C_{r}) > (r-1)(2+ε)^{k-1}$.
△ Less
Submitted 16 January, 2017; v1 submitted 24 February, 2016;
originally announced February 2016.
-
Neighbors of knots in the Gordian graph
Authors:
Ryan Blair,
Marion Campisi,
Jesse Johnson,
Scott A. Taylor,
Maggy Tomova
Abstract:
We show that every knot is one crossing change away from a knot of arbitrarily high bridge number and arbitrarily high bridge distance.
We show that every knot is one crossing change away from a knot of arbitrarily high bridge number and arbitrarily high bridge distance.
△ Less
Submitted 10 June, 2016; v1 submitted 1 May, 2015;
originally announced May 2015.
-
Bridge numbers of knots in the page of an open book
Authors:
R. Sean Bowman,
Jesse Johnson
Abstract:
Given any closed, connected, orientable $3$--manifold and integers $g\geq g(M), D > 0$, we show the existence of knots in $M$ whose genus $g$ bridge number is greater than $D$. These knots lie in a page of an open book decomposition of $M$, and the proof proceeds by examining the action of the map induced by the monodromy on the arc and curve complex of a page. A corollary is that there are Berge…
▽ More
Given any closed, connected, orientable $3$--manifold and integers $g\geq g(M), D > 0$, we show the existence of knots in $M$ whose genus $g$ bridge number is greater than $D$. These knots lie in a page of an open book decomposition of $M$, and the proof proceeds by examining the action of the map induced by the monodromy on the arc and curve complex of a page. A corollary is that there are Berge knots of arbitrarily large genus one bridge number.
△ Less
Submitted 14 February, 2015;
originally announced February 2015.
-
New polynomial and multidimensional extensions of classical partition results
Authors:
Vitaly Bergelson,
John H. Johnson Jr.,
Joel Moreira
Abstract:
In the 1970s Deuber introduced the notion of $(m,p,c)$-sets in $\mathbb{N}$ and showed that these sets are partition regular and contain all linear partition regular configurations in $\mathbb{N}$. In this paper we obtain enhancements and extensions of classical results on $(m,p,c)$-sets in two directions. First, we show, with the help of ultrafilter techniques, that Deuber's results extend to pol…
▽ More
In the 1970s Deuber introduced the notion of $(m,p,c)$-sets in $\mathbb{N}$ and showed that these sets are partition regular and contain all linear partition regular configurations in $\mathbb{N}$. In this paper we obtain enhancements and extensions of classical results on $(m,p,c)$-sets in two directions. First, we show, with the help of ultrafilter techniques, that Deuber's results extend to polynomial configurations in abelian groups. In particular, we obtain new partition regular polynomial configurations in $\mathbb{Z}^d$. Second, we give two proofs of a generalization of Deuber's results to general commutative semigroups. We also obtain a polynomial version of the central sets theorem of Furstenberg, extend the theory of $(m,p,c)$-systems of Deuber, Hindman and Lefmann and generalize a classical theorem of Rado regarding partition regularity of linear systems of equations over $\mathbb{N}$ to commutative semigroups.
△ Less
Submitted 11 May, 2016; v1 submitted 10 January, 2015;
originally announced January 2015.