-
QuAKE: Speeding up Model Inference Using Quick and Approximate Kernels for Exponential Non-Linearities
Authors:
Sai Kiran Narayanaswami,
Gopalakrishnan Srinivasan,
Balaraman Ravindran
Abstract:
As machine learning gets deployed more and more widely, and model sizes continue to grow, improving computational efficiency during model inference has become a key challenge. In many commonly used model architectures, including Transformers, a significant portion of the inference computation is comprised of exponential non-linearities such as Softmax. In this work, we develop QuAKE, a collection…
▽ More
As machine learning gets deployed more and more widely, and model sizes continue to grow, improving computational efficiency during model inference has become a key challenge. In many commonly used model architectures, including Transformers, a significant portion of the inference computation is comprised of exponential non-linearities such as Softmax. In this work, we develop QuAKE, a collection of novel operators that leverage certain properties of IEEE-754 floating point representations to quickly approximate the exponential function without requiring specialized hardware, extra memory, or precomputation. We propose optimizations that enhance the efficiency of QuAKE in commonly used exponential non-linearities such as Softmax, GELU, and the Logistic function. Our benchmarks demonstrate substantial inference speed improvements between 10% and 35% on server CPUs, and 5% and 45% on embedded and mobile-scale CPUs for a variety of model architectures and sizes. Evaluations of model performance on standard datasets and tasks from various domains show that QuAKE operators are able to provide sizable speed benefits with little to no loss of performance on downstream tasks.
△ Less
Submitted 30 November, 2024;
originally announced December 2024.
-
Reflections on Euler's reflection formula and an additive analogue of Legendre's duplication formula
Authors:
Ritesh Goenka,
Gopala Krishna Srinivasan
Abstract:
In this note, we look at some of the less explored aspects of the gamma function. We provide a new proof of Euler's reflection formula and discuss its significance in the theory of special functions. We also discuss a result of Landau concerning the determination of values of the gamma function using functional identities. We show that his result is sharp and extend it to complex arguments. In 184…
▽ More
In this note, we look at some of the less explored aspects of the gamma function. We provide a new proof of Euler's reflection formula and discuss its significance in the theory of special functions. We also discuss a result of Landau concerning the determination of values of the gamma function using functional identities. We show that his result is sharp and extend it to complex arguments. In 1848, Oskar Schlömilch gave an interesting additive analogue of the duplication formula. We prove a generalized version of this formula using the theory of hypergeometric functions.
△ Less
Submitted 17 May, 2020;
originally announced May 2020.
-
Free groups, covering spaces and Artin's theorem
Authors:
Gopala Krishna Srinivasan
Abstract:
In this expository note we provide a proof of Artin's theorem which states that the commutator subgroup of a free group on two generators is not finitely generated. The proof employs the infinite grid as in two other proofs in the literature mentioned in the note but takes a somewhat different approach which seems to be of didactic value.
In this expository note we provide a proof of Artin's theorem which states that the commutator subgroup of a free group on two generators is not finitely generated. The proof employs the infinite grid as in two other proofs in the literature mentioned in the note but takes a somewhat different approach which seems to be of didactic value.
△ Less
Submitted 14 May, 2019; v1 submitted 13 April, 2019;
originally announced April 2019.
-
The Exterior Derivative - A direct approach
Authors:
Gopala Krishna Srinivasan
Abstract:
In this note we provide a direct approach to the most basic operator in this theory namely the exterior derivative. The crucial ingredient is a calculus lemma based on determinants. We maintain the view that in a first course at least this direct approach is preferable to the more abstract one based on characterization of the exterior derivative in terms of its properties.
In this note we provide a direct approach to the most basic operator in this theory namely the exterior derivative. The crucial ingredient is a calculus lemma based on determinants. We maintain the view that in a first course at least this direct approach is preferable to the more abstract one based on characterization of the exterior derivative in terms of its properties.
△ Less
Submitted 28 August, 2018;
originally announced August 2018.
-
Reduced-Order Modeling through Machine Learning Approaches for Brittle Fracture Applications
Authors:
A. Hunter,
B. A. Moore,
M. K. Mudunuru,
V. T. Chau,
R. L. Miller,
R. B. Tchoua,
C. Nyshadham,
S. Karra,
D. O. Malley,
E. Rougier,
H. S. Viswanathan,
G. Srinivasan
Abstract:
In this paper, five different approaches for reduced-order modeling of brittle fracture in geomaterials, specifically concrete, are presented and compared. Four of the five methods rely on machine learning (ML) algorithms to approximate important aspects of the brittle fracture problem. In addition to the ML algorithms, each method incorporates different physics-based assumptions in order to reduc…
▽ More
In this paper, five different approaches for reduced-order modeling of brittle fracture in geomaterials, specifically concrete, are presented and compared. Four of the five methods rely on machine learning (ML) algorithms to approximate important aspects of the brittle fracture problem. In addition to the ML algorithms, each method incorporates different physics-based assumptions in order to reduce the computational complexity while maintaining the physics as much as possible. This work specifically focuses on using the ML approaches to model a 2D concrete sample under low strain rate pure tensile loading conditions with 20 preexisting cracks present. A high-fidelity finite element-discrete element model is used to both produce a training dataset of 150 simulations and an additional 35 simulations for validation. Results from the ML approaches are directly compared against the results from the high-fidelity model. Strengths and weaknesses of each approach are discussed and the most important conclusion is that a combination of physics-informed and data-driven features are necessary for emulating the physics of crack propagation, interaction and coalescence. All of the models presented here have runtimes that are orders of magnitude faster than the original high-fidelity model and pave the path for developing accurate reduced order models that could be used to inform larger length-scale models with important sub-scale physics that often cannot be accounted for due to computational cost.
△ Less
Submitted 5 June, 2018;
originally announced June 2018.
-
A unified approach to the integrals of Mellin--Barnes--Hecke type
Authors:
Gopala Krishna Srinivasan
Abstract:
In this paper we provide a unified approach to a family of integrals of Mellin--Barnes type using distribution theory and Fourier transforms.
Interesting features arise in many of the cases which call for the application of pull-backs of distributions via smooth submersive maps defined by Hörmander. We derive by this method the integrals of Hecke and Sonine relating to various types of Bessel fu…
▽ More
In this paper we provide a unified approach to a family of integrals of Mellin--Barnes type using distribution theory and Fourier transforms.
Interesting features arise in many of the cases which call for the application of pull-backs of distributions via smooth submersive maps defined by Hörmander. We derive by this method the integrals of Hecke and Sonine relating to various types of Bessel functions which have found applications in analytic and algebraic number theory.
△ Less
Submitted 30 August, 2012;
originally announced August 2012.
-
On a remarkable formula of Ramanujan
Authors:
Debraj Chakrabarti,
Gopala Krishna Srinivasan
Abstract:
A simple proof of Ramanujan's formula for the Fourier transform of the square of the modulus of the Gamma function restricted to a vertical line in the right half-plane is given.
The result is extended to vertical lines in the left half-plane by solving an inhomogeneous ODE. We then use it to calculate the jump across the imaginary axis.
A simple proof of Ramanujan's formula for the Fourier transform of the square of the modulus of the Gamma function restricted to a vertical line in the right half-plane is given.
The result is extended to vertical lines in the left half-plane by solving an inhomogeneous ODE. We then use it to calculate the jump across the imaginary axis.
△ Less
Submitted 22 June, 2012;
originally announced June 2012.