Search | arXiv e-print repository

doi 10.1016/j.aop.2017.11.013

Benford's Law and Continuous Dependent Random Variables

Authors: Thealexa Becker, David Burt, Taylor C. Corcoran, Alec Greaves-Tunnell, Joseph R. Iafrate, Joy Jing, Steven J. Miller, Jaclyn D. Porfilio, Ryan Ronan, Jirapat Samranvedhya, Frederick W. Strauch, Blaine Talbut

Abstract: Many mathematical, man-made and natural systems exhibit a leading-digit bias, where a first digit (base 10) of 1 occurs not 11\% of the time, as one would expect if all digits were equally likely, but rather 30\%. This phenomenon is known as Benford's Law. Analyzing which datasets adhere to Benford's Law and how quickly Benford behavior sets in are the two most important problems in the field. Mos… ▽ More Many mathematical, man-made and natural systems exhibit a leading-digit bias, where a first digit (base 10) of 1 occurs not 11\% of the time, as one would expect if all digits were equally likely, but rather 30\%. This phenomenon is known as Benford's Law. Analyzing which datasets adhere to Benford's Law and how quickly Benford behavior sets in are the two most important problems in the field. Most previous work studied systems of independent random variables, and relied on the independence in their analyses. Inspired by natural processes such as particle decay, we study the dependent random variables that emerge from models of decomposition of conserved quantities. We prove that in many instances the distribution of lengths of the resulting pieces converges to Benford behavior as the number of divisions grow, and give several conjectures for other fragmentation processes. The main difficulty is that the resulting random variables are dependent. We handle this by using tools from Fourier analysis and irrationality exponents to obtain quantified convergence rates as well as introducing and developing techniques to measure and control the dependencies. The construction of these tools is one of the major motivations of this work, as our approach can be applied to many other dependent systems. As an example, we show that the $n!$ entries in the determinant expansions of $n\times n$ matrices with entries independently drawn from nice random variables converges to Benford's Law. △ Less

Submitted 9 March, 2018; v1 submitted 22 September, 2013; originally announced September 2013.

Comments: Version 4.0, 33 pages, 7 figures, keywords: Benford's Law, Fourier transform, Mellin transform, dependent random variables, fragmentation. This replaces Benford's Law and Continuous Dependent Random Variables, arXiv:1111.0568

MSC Class: 60A10; 11K06 (primary); (secondary) 60E10

Journal ref: Annals of Physics 388 (2018), 350--381

arXiv:1111.0568

Benford's Law and Continuous Dependent Random Variables

Authors: Thealexa Becker, Alec Greaves-Tunnell, Steven J. Miller, Ryan Ronan, Frederick W. Strauch

Abstract: Many systems exhibit a digit bias. For example, the first digit base 10 of the Fibonacci numbers, or of $2^n$, equals 1 not 10% or 11% of the time, as one would expect if all digits were equally likely, but about 30% of the time. This phenomenon, known as Benford's Law, has many applications, ranging from detecting tax fraud for the IRS to analyzing round-off errors in computer science. The cent… ▽ More Many systems exhibit a digit bias. For example, the first digit base 10 of the Fibonacci numbers, or of $2^n$, equals 1 not 10% or 11% of the time, as one would expect if all digits were equally likely, but about 30% of the time. This phenomenon, known as Benford's Law, has many applications, ranging from detecting tax fraud for the IRS to analyzing round-off errors in computer science. The central question is determining which data sets follow Benford's law. Inspired by natural processes such as particle decay, our work examines models for the decomposition of conserved quantities. We prove that in many instances the distribution of lengths of the resulting pieces converges to Benford behavior as the number of divisions grow. The main difficulty is that the resulting random variables are dependent, which we handle by a careful analysis of the dependencies and tools from Fourier analysis to obtain quantified convergence rates. △ Less

Submitted 22 September, 2013; v1 submitted 2 November, 2011; originally announced November 2011.

Comments: Version 1.0, 16 pages, 1 figure. This paper is being withdrawn and replaced with an expanded version with many more authors

MSC Class: 11K06; 60A10 (primary); 60E10 (secondary)

arXiv:1008.4812 [pdf, ps, other]

The Limiting Spectral Measure for Ensembles of Symmetric Block Circulant Matrices

Authors: Murat Kologlu, Gene S. Kopp, Steven J. Miller, Frederick W. Strauch, Wentao Xiong

Abstract: Given an ensemble of NxN random matrices, a natural question to ask is whether or not the empirical spectral measures of typical matrices converge to a limiting spectral measure as N --> oo. While this has been proved for many thin patterned ensembles sitting inside all real symmetric matrices, frequently there is no nice closed form expression for the limiting measure. Further, current theorems p… ▽ More Given an ensemble of NxN random matrices, a natural question to ask is whether or not the empirical spectral measures of typical matrices converge to a limiting spectral measure as N --> oo. While this has been proved for many thin patterned ensembles sitting inside all real symmetric matrices, frequently there is no nice closed form expression for the limiting measure. Further, current theorems provide few pictures of transitions between ensembles. We consider the ensemble of symmetric m-block circulant matrices with entries i.i.d.r.v. These matrices have toroidal diagonals periodic of period m. We view m as a "dial" we can "turn" from the thin ensemble of symmetric circulant matrices, whose limiting eigenvalue density is a Gaussian, to all real symmetric matrices, whose limiting eigenvalue density is a semi-circle. The limiting eigenvalue densities f_m show a visually stunning convergence to the semi-circle as m tends to infinity, which we prove. In contrast to most studies of patterned matrix ensembles, our paper gives explicit closed form expressions for the densities. We prove that f_m is the product of a Gaussian and a degree 2m-2 polynomial; the formula equals that of the m x m Gaussian Unitary Ensemble (GUE). The proof is by the moments. The new feature, which allows us to obtain closed form expressions, is converting the central combinatorial problem in the moment calculation into an equivalent counting problem in algebraic topology. We end with a generalization of the m-block circulant pattern, dropping the assumption that the m random variables be distinct. We prove that the limiting spectral distribution exists and is determined by the pattern of the independent elements within an m-period, depending on not only the frequency at which each element appears, but also the way the elements are arranged. △ Less

Submitted 28 June, 2011; v1 submitted 27 August, 2010; originally announced August 2010.

Comments: 39 pages, 10 figures; version 3.1 (includes a new appendix on generalized m-block circulant ensembles)

arXiv:1006.0163 [pdf, ps, other]

A combinatorial identity for studying Sato-Tate type problems

Authors: Steven J. Miller, M. Ram Murty, Frederick W. Strauch

Abstract: We derive a combinatorial identity which is useful in studying the distribution of Fourier coefficients of L-functions by allowing us to pass from knowledge of moments of the coefficients to the distribution of the coefficients. We derive a combinatorial identity which is useful in studying the distribution of Fourier coefficients of L-functions by allowing us to pass from knowledge of moments of the coefficients to the distribution of the coefficients. △ Less

Submitted 1 June, 2010; originally announced June 2010.

Comments: This paper contains the proof of a combinatorial identity used to study effective equidistribution laws for the Fourier coefficients of elliptic curve L-functions investigated by the first two authors in https://arxiv.boxedpaper.com/abs/1004.2753

MSC Class: 05A40; 05A10 (primary) 33C05; 11K38; 14H52; 11M41 (secondary)

Showing 1–4 of 4 results for author: Strauch, F W