A Note On the Size of Largest Bins Using Placement With Linear Transformations
Authors:
Martin Babka
Abstract:
We study the placement of n balls into n bins where balls and bins are represented as two vector spaces over Z 2 . The placement is done according to a linear transformation between the two vector spaces. We analyze the expected size of a largest bin. The only currently known upper bound is O(log n log log n) by Alon et al. and holds for placing n log n balls into n bins. We show that this bound c…
▽ More
We study the placement of n balls into n bins where balls and bins are represented as two vector spaces over Z 2 . The placement is done according to a linear transformation between the two vector spaces. We analyze the expected size of a largest bin. The only currently known upper bound is O(log n log log n) by Alon et al. and holds for placing n log n balls into n bins. We show that this bound can be improved to O(log n) in the case when n balls are placed into n bins. We use the same basic technique as Alon et al. but give a tighter analysis for this case.
△ Less
Submitted 8 October, 2018;
originally announced October 2018.
Expected number of uniformly distributed balls in a most loaded bin using placement with simple linear functions
Authors:
Martin Babka
Abstract:
We estimate the size of a most loaded bin in the setting when the balls are placed into the bins using a random linear function in a finite field. The balls are chosen from a transformed interval. We show that in this setting the expected load of the most loaded bins is constant.
This is an interesting fact because using fully random hash functions with the same class of input sets leads to an e…
▽ More
We estimate the size of a most loaded bin in the setting when the balls are placed into the bins using a random linear function in a finite field. The balls are chosen from a transformed interval. We show that in this setting the expected load of the most loaded bins is constant.
This is an interesting fact because using fully random hash functions with the same class of input sets leads to an expectation of $Θ\left(\frac{\log m}{\log \log m}\right)$ balls in most loaded bins where $m$ is the number of balls and bins.
Although the family of the functions is quite common the size of largest bins was not known even in this simple case.
△ Less
Submitted 31 December, 2014;
originally announced January 2015.
On Online Labeling with Polynomially Many Labels
Authors:
Martin Babka,
Jan Bulánek,
Vladimír Čunát,
Michal Koucký,
Michael Saks
Abstract:
In the online labeling problem with parameters n and m we are presented with a sequence of n keys from a totally ordered universe U and must assign each arriving key a label from the label set {1,2,...,m} so that the order of labels (strictly) respects the ordering on U. As new keys arrive it may be necessary to change the labels of some items; such changes may be done at any time at unit cost for…
▽ More
In the online labeling problem with parameters n and m we are presented with a sequence of n keys from a totally ordered universe U and must assign each arriving key a label from the label set {1,2,...,m} so that the order of labels (strictly) respects the ordering on U. As new keys arrive it may be necessary to change the labels of some items; such changes may be done at any time at unit cost for each change. The goal is to minimize the total cost. An alternative formulation of this problem is the file maintenance problem, in which the items, instead of being labeled, are maintained in sorted order in an array of length m, and we pay unit cost for moving an item.
For the case m=cn for constant c>1, there are known algorithms that use at most O(n log(n)^2) relabelings in total [Itai, Konheim, Rodeh, 1981], and it was shown recently that this is asymptotically optimal [Bulánek, Koucký, Saks, 2012]. For the case of m=Θ(n^C) for C>1, algorithms are known that use O(n log n) relabelings. A matching lower bound was claimed in [Dietz, Seiferas, Zhang, 2004]. That proof involved two distinct steps: a lower bound for a problem they call prefix bucketing and a reduction from prefix bucketing to online labeling. The reduction seems to be incorrect, leaving a (seemingly significant) gap in the proof. In this paper we close the gap by presenting a correct reduction to prefix bucketing. Furthermore we give a simplified and improved analysis of the prefix bucketing lower bound. This improvement allows us to extend the lower bounds for online labeling to the case where the number m of labels is superpolynomial in n. In particular, for superpolynomial m we get an asymptotically optimal lower bound Ω((n log n) / (log log m - log log n)).
△ Less
Submitted 11 October, 2012;
originally announced October 2012.