Search | arXiv e-print repository

A Note On the Size of Largest Bins Using Placement With Linear Transformations

Abstract: We study the placement of n balls into n bins where balls and bins are represented as two vector spaces over Z 2 . The placement is done according to a linear transformation between the two vector spaces. We analyze the expected size of a largest bin. The only currently known upper bound is O(log n log log n) by Alon et al. and holds for placing n log n balls into n bins. We show that this bound c… ▽ More We study the placement of n balls into n bins where balls and bins are represented as two vector spaces over Z 2 . The placement is done according to a linear transformation between the two vector spaces. We analyze the expected size of a largest bin. The only currently known upper bound is O(log n log log n) by Alon et al. and holds for placing n log n balls into n bins. We show that this bound can be improved to O(log n) in the case when n balls are placed into n bins. We use the same basic technique as Alon et al. but give a tighter analysis for this case. △ Less

Submitted 8 October, 2018; originally announced October 2018.

arXiv:1501.00162 [pdf, other]

Expected number of uniformly distributed balls in a most loaded bin using placement with simple linear functions

Authors: Martin Babka

Abstract: We estimate the size of a most loaded bin in the setting when the balls are placed into the bins using a random linear function in a finite field. The balls are chosen from a transformed interval. We show that in this setting the expected load of the most loaded bins is constant. This is an interesting fact because using fully random hash functions with the same class of input sets leads to an e… ▽ More We estimate the size of a most loaded bin in the setting when the balls are placed into the bins using a random linear function in a finite field. The balls are chosen from a transformed interval. We show that in this setting the expected load of the most loaded bins is constant. This is an interesting fact because using fully random hash functions with the same class of input sets leads to an expectation of $Θ\left(\frac{\log m}{\log \log m}\right)$ balls in most loaded bins where $m$ is the number of balls and bins. Although the family of the functions is quite common the size of largest bins was not known even in this simple case. △ Less

Submitted 31 December, 2014; originally announced January 2015.

ACM Class: E.2

arXiv:1210.3197 [pdf, ps, other]

On Online Labeling with Polynomially Many Labels

Authors: Martin Babka, Jan Bulánek, Vladimír Čunát, Michal Koucký, Michael Saks

Abstract: In the online labeling problem with parameters n and m we are presented with a sequence of n keys from a totally ordered universe U and must assign each arriving key a label from the label set {1,2,...,m} so that the order of labels (strictly) respects the ordering on U. As new keys arrive it may be necessary to change the labels of some items; such changes may be done at any time at unit cost for… ▽ More In the online labeling problem with parameters n and m we are presented with a sequence of n keys from a totally ordered universe U and must assign each arriving key a label from the label set {1,2,...,m} so that the order of labels (strictly) respects the ordering on U. As new keys arrive it may be necessary to change the labels of some items; such changes may be done at any time at unit cost for each change. The goal is to minimize the total cost. An alternative formulation of this problem is the file maintenance problem, in which the items, instead of being labeled, are maintained in sorted order in an array of length m, and we pay unit cost for moving an item. For the case m=cn for constant c>1, there are known algorithms that use at most O(n log(n)^2) relabelings in total [Itai, Konheim, Rodeh, 1981], and it was shown recently that this is asymptotically optimal [Bulánek, Koucký, Saks, 2012]. For the case of m=Θ(n^C) for C>1, algorithms are known that use O(n log n) relabelings. A matching lower bound was claimed in [Dietz, Seiferas, Zhang, 2004]. That proof involved two distinct steps: a lower bound for a problem they call prefix bucketing and a reduction from prefix bucketing to online labeling. The reduction seems to be incorrect, leaving a (seemingly significant) gap in the proof. In this paper we close the gap by presenting a correct reduction to prefix bucketing. Furthermore we give a simplified and improved analysis of the prefix bucketing lower bound. This improvement allows us to extend the lower bounds for online labeling to the case where the number m of labels is superpolynomial in n. In particular, for superpolynomial m we get an asymptotically optimal lower bound Ω((n log n) / (log log m - log log n)). △ Less

Submitted 11 October, 2012; originally announced October 2012.

Comments: 15 pages, Presented at European Symposium on Algorithms 2012

Showing 1–3 of 3 results for author: Babka, M