-
arXiv:2505.14371 [pdf, ps, other]
Layer-wise Quantization for Quantized Optimistic Dual Averaging
Abstract: Modern deep neural networks exhibit heterogeneity across numerous layers of various types such as residuals, multi-head attention, etc., due to varying structures (dimensions, activation functions, etc.), distinct representation characteristics, which impact predictions. We develop a general layer-wise quantization framework with tight variance and code-length bounds, adapting to the heterogeneiti… ▽ More
Submitted 20 May, 2025; originally announced May 2025.
Comments: Accepted at the International Conference on Machine Learning (ICML 2025)
-
NUQSGD: Provably Communication-efficient Data-parallel SGD via Nonuniform Quantization
Abstract: As the size and complexity of models and datasets grow, so does the need for communication-efficient variants of stochastic gradient descent that can be deployed to perform parallel model training. One popular communication-compression method for data-parallel SGD is QSGD (Alistarh et al., 2017), which quantizes and encodes gradients to reduce communication costs. The baseline variant of QSGD prov… ▽ More
Submitted 1 May, 2021; v1 submitted 28 April, 2021; originally announced April 2021.
Comments: This entry is redundant and was created in error. See arXiv:1908.06077 for the latest version
-
Graph Symmetry Detection and Canonical Labeling: Differences and Synergies
Abstract: Symmetries of combinatorial objects are known to complicate search algorithms, but such obstacles can often be removed by detecting symmetries early and discarding symmetric subproblems. Canonical labeling of combinatorial objects facilitates easy equivalence checking through quick matching. All existing canonical labeling software also finds symmetries, but the fastest symmetry-finding software d… ▽ More
Submitted 30 August, 2012; originally announced August 2012.
Comments: 15 pages, 10 figures, 1 table, Turing-100
MSC Class: 68R10
Journal ref: H. Katebi, K. A. Sakallah and I. L. Markov, "Graph Symmetry Detection and Canonical Labeling: Differences and Synergies'' in Proc. Turing-100, EPIC vol. 10, pp. 181-195, Manchester, UK, 2012
-
Conflict Anticipation in the Search for Graph Automorphisms
Abstract: Effective search for graph automorphisms allows identifying symmetries in many discrete structures, ranging from chemical molecules to microprocessor circuits. Using this type of structure can enhance visualization as well as speed up computational optimization and verification. Competitive algorithms for the graph automorphism problem are based on efficient partition refinement augmented with gro… ▽ More
Submitted 30 August, 2012; originally announced August 2012.
Comments: 15 pages, 9 Figures, 1 Table, Int'l Conf. on Logic for Programming, Artificial Intelligence and Reasoning (LPAR)
MSC Class: 68R10
Journal ref: H. Katebi, K. A. Sakallah and I. L. Markov, "Conflict Anticipation in the Search for Graph Automorphisms" in Proc. Int'l Conf. on Logic for Programming, Artificial Intelligence and Reasoning (LPAR), pp. 243-257, Merida, Venezuela, 2012
-
arXiv:0707.3622 [pdf, ps, other]
Constant-degree graph expansions that preserve the treewidth
Abstract: Many hard algorithmic problems dealing with graphs, circuits, formulas and constraints admit polynomial-time upper bounds if the underlying graph has small treewidth. The same problems often encourage reducing the maximal degree of vertices to simplify theoretical arguments or address practical concerns. Such degree reduction can be performed through a sequence of splittings of vertices, resulti… ▽ More
Submitted 24 July, 2007; originally announced July 2007.
Comments: 12 pages, 6 figures, the main result used by quant-ph/0511070
ACM Class: F.2.2; G.2.2
Journal ref: Algorithmica, Volume 59, Number 4, 461-470,2011