Showing 1–1 of 1 results for author: Omtzigt, E T L

Search v0.5.6 released 2020-02-24

arXiv:2408.13400 [pdf, ps, other]

math.NA

Iterative Refinement with Low-Precision Posits

Authors: James Quinlan, E. Theodore L. Omtzigt

Abstract: This research investigates using a mixed-precision iterative refinement method using posit numbers instead of the standard IEEE floating-point format. The method is applied to solve a general linear system represented by the equation $Ax = b$, where $A$ is a large sparse matrix. Various scaling techniques, such as row and column equilibration, map the matrix entries to higher-density regions of ma… ▽ More This research investigates using a mixed-precision iterative refinement method using posit numbers instead of the standard IEEE floating-point format. The method is applied to solve a general linear system represented by the equation $Ax = b$, where $A$ is a large sparse matrix. Various scaling techniques, such as row and column equilibration, map the matrix entries to higher-density regions of machine numbers before performing the $O(n^3)$ factorization operation. Low-precision LU factorization followed by forward/backward substitution provides an initial estimate. The results demonstrate that a 16-bit posit configuration combined with equilibration produces accuracy comparable to IEEE half-precision (fp16), indicating a potential for achieving a balance between efficiency and accuracy. △ Less

Submitted 27 August, 2024; v1 submitted 23 August, 2024; originally announced August 2024.

Comments: preprint CoNGA'24

MSC Class: 65F05 (Primary) 65F50 (Secondary)

Search v0.5.6 released 2020-02-24