Search | arXiv e-print repository

QuAKE: Speeding up Model Inference Using Quick and Approximate Kernels for Exponential Non-Linearities

Authors: Sai Kiran Narayanaswami, Gopalakrishnan Srinivasan, Balaraman Ravindran

Abstract: As machine learning gets deployed more and more widely, and model sizes continue to grow, improving computational efficiency during model inference has become a key challenge. In many commonly used model architectures, including Transformers, a significant portion of the inference computation is comprised of exponential non-linearities such as Softmax. In this work, we develop QuAKE, a collection… ▽ More As machine learning gets deployed more and more widely, and model sizes continue to grow, improving computational efficiency during model inference has become a key challenge. In many commonly used model architectures, including Transformers, a significant portion of the inference computation is comprised of exponential non-linearities such as Softmax. In this work, we develop QuAKE, a collection of novel operators that leverage certain properties of IEEE-754 floating point representations to quickly approximate the exponential function without requiring specialized hardware, extra memory, or precomputation. We propose optimizations that enhance the efficiency of QuAKE in commonly used exponential non-linearities such as Softmax, GELU, and the Logistic function. Our benchmarks demonstrate substantial inference speed improvements between 10% and 35% on server CPUs, and 5% and 45% on embedded and mobile-scale CPUs for a variety of model architectures and sizes. Evaluations of model performance on standard datasets and tasks from various domains show that QuAKE operators are able to provide sizable speed benefits with little to no loss of performance on downstream tasks. △ Less

Submitted 30 November, 2024; originally announced December 2024.

arXiv:2005.08237 [pdf, ps, other]

doi 10.1007/s12045-021-1136-x

Reflections on Euler's reflection formula and an additive analogue of Legendre's duplication formula

Authors: Ritesh Goenka, Gopala Krishna Srinivasan

Abstract: In this note, we look at some of the less explored aspects of the gamma function. We provide a new proof of Euler's reflection formula and discuss its significance in the theory of special functions. We also discuss a result of Landau concerning the determination of values of the gamma function using functional identities. We show that his result is sharp and extend it to complex arguments. In 184… ▽ More In this note, we look at some of the less explored aspects of the gamma function. We provide a new proof of Euler's reflection formula and discuss its significance in the theory of special functions. We also discuss a result of Landau concerning the determination of values of the gamma function using functional identities. We show that his result is sharp and extend it to complex arguments. In 1848, Oskar Schlömilch gave an interesting additive analogue of the duplication formula. We prove a generalized version of this formula using the theory of hypergeometric functions. △ Less

Submitted 17 May, 2020; originally announced May 2020.

MSC Class: 33B15 (Primary) 44A05; 33C05 (Secondary)

Journal ref: Resonance 26 (2021) 367-386

arXiv:1904.07689 [pdf, ps, other]

Free groups, covering spaces and Artin's theorem

Authors: Gopala Krishna Srinivasan

Abstract: In this expository note we provide a proof of Artin's theorem which states that the commutator subgroup of a free group on two generators is not finitely generated. The proof employs the infinite grid as in two other proofs in the literature mentioned in the note but takes a somewhat different approach which seems to be of didactic value. In this expository note we provide a proof of Artin's theorem which states that the commutator subgroup of a free group on two generators is not finitely generated. The proof employs the infinite grid as in two other proofs in the literature mentioned in the note but takes a somewhat different approach which seems to be of didactic value. △ Less

Submitted 14 May, 2019; v1 submitted 13 April, 2019; originally announced April 2019.

arXiv:1808.09627 [pdf, ps, other]

The Exterior Derivative - A direct approach

Authors: Gopala Krishna Srinivasan

Abstract: In this note we provide a direct approach to the most basic operator in this theory namely the exterior derivative. The crucial ingredient is a calculus lemma based on determinants. We maintain the view that in a first course at least this direct approach is preferable to the more abstract one based on characterization of the exterior derivative in terms of its properties. In this note we provide a direct approach to the most basic operator in this theory namely the exterior derivative. The crucial ingredient is a calculus lemma based on determinants. We maintain the view that in a first course at least this direct approach is preferable to the more abstract one based on characterization of the exterior derivative in terms of its properties. △ Less

Submitted 28 August, 2018; originally announced August 2018.

MSC Class: 58A10

arXiv:1806.01949 [pdf, ps, other]

Reduced-Order Modeling through Machine Learning Approaches for Brittle Fracture Applications

Authors: A. Hunter, B. A. Moore, M. K. Mudunuru, V. T. Chau, R. L. Miller, R. B. Tchoua, C. Nyshadham, S. Karra, D. O. Malley, E. Rougier, H. S. Viswanathan, G. Srinivasan

Abstract: In this paper, five different approaches for reduced-order modeling of brittle fracture in geomaterials, specifically concrete, are presented and compared. Four of the five methods rely on machine learning (ML) algorithms to approximate important aspects of the brittle fracture problem. In addition to the ML algorithms, each method incorporates different physics-based assumptions in order to reduc… ▽ More In this paper, five different approaches for reduced-order modeling of brittle fracture in geomaterials, specifically concrete, are presented and compared. Four of the five methods rely on machine learning (ML) algorithms to approximate important aspects of the brittle fracture problem. In addition to the ML algorithms, each method incorporates different physics-based assumptions in order to reduce the computational complexity while maintaining the physics as much as possible. This work specifically focuses on using the ML approaches to model a 2D concrete sample under low strain rate pure tensile loading conditions with 20 preexisting cracks present. A high-fidelity finite element-discrete element model is used to both produce a training dataset of 150 simulations and an additional 35 simulations for validation. Results from the ML approaches are directly compared against the results from the high-fidelity model. Strengths and weaknesses of each approach are discussed and the most important conclusion is that a combination of physics-informed and data-driven features are necessary for emulating the physics of crack propagation, interaction and coalescence. All of the models presented here have runtimes that are orders of magnitude faster than the original high-fidelity model and pave the path for developing accurate reduced order models that could be used to inform larger length-scale models with important sub-scale physics that often cannot be accounted for due to computational cost. △ Less

Submitted 5 June, 2018; originally announced June 2018.

Comments: 25 pages, 8 figures

arXiv:1208.6079 [pdf, ps, other]

A unified approach to the integrals of Mellin--Barnes--Hecke type

Authors: Gopala Krishna Srinivasan

Abstract: In this paper we provide a unified approach to a family of integrals of Mellin--Barnes type using distribution theory and Fourier transforms. Interesting features arise in many of the cases which call for the application of pull-backs of distributions via smooth submersive maps defined by Hörmander. We derive by this method the integrals of Hecke and Sonine relating to various types of Bessel fu… ▽ More In this paper we provide a unified approach to a family of integrals of Mellin--Barnes type using distribution theory and Fourier transforms. Interesting features arise in many of the cases which call for the application of pull-backs of distributions via smooth submersive maps defined by Hörmander. We derive by this method the integrals of Hecke and Sonine relating to various types of Bessel functions which have found applications in analytic and algebraic number theory. △ Less

Submitted 30 August, 2012; originally announced August 2012.

Comments: The paper has been accepted for publication in Expositiones Mathematicae

arXiv:1206.5122 [pdf, ps, other]

On a remarkable formula of Ramanujan

Authors: Debraj Chakrabarti, Gopala Krishna Srinivasan

Abstract: A simple proof of Ramanujan's formula for the Fourier transform of the square of the modulus of the Gamma function restricted to a vertical line in the right half-plane is given. The result is extended to vertical lines in the left half-plane by solving an inhomogeneous ODE. We then use it to calculate the jump across the imaginary axis. A simple proof of Ramanujan's formula for the Fourier transform of the square of the modulus of the Gamma function restricted to a vertical line in the right half-plane is given. The result is extended to vertical lines in the left half-plane by solving an inhomogeneous ODE. We then use it to calculate the jump across the imaginary axis. △ Less

Submitted 22 June, 2012; originally announced June 2012.

Comments: To appear in Archiv der Mathematik

MSC Class: 33B15

Showing 1–7 of 7 results for author: Srinivasan, G