-
A Unified Hardware Accelerator for Fast Fourier Transform and Number Theoretic Transform
Authors:
Rishabh Shrivastava,
Chaitanya Prasad Ratnala,
Durga Manasa Puli,
Utsav Banerjee
Abstract:
The Number Theoretic Transform (NTT) is an indispensable tool for computing efficient polynomial multiplications in post-quantum lattice-based cryptography. It has strong resemblance with the Fast Fourier Transform (FFT), which is the most widely used algorithm in digital signal processing. In this work, we demonstrate a unified hardware accelerator supporting both 512-point complex FFT as well as…
▽ More
The Number Theoretic Transform (NTT) is an indispensable tool for computing efficient polynomial multiplications in post-quantum lattice-based cryptography. It has strong resemblance with the Fast Fourier Transform (FFT), which is the most widely used algorithm in digital signal processing. In this work, we demonstrate a unified hardware accelerator supporting both 512-point complex FFT as well as 256-point NTT for the recently standardized NIST post-quantum key encapsulation and digital signature algorithms ML-KEM and ML-DSA respectively. Our proposed architecture effectively utilizes the arithmetic circuitry required for complex FFT, and the only additional circuits required are for modular reduction along with modifications in the control logic. Our implementation achieves performance comparable to state-of-the-art ML-KEM / ML-DSA NTT accelerators on FPGA, thus demonstrating how an FFT accelerator can be augmented to support NTT and the unified hardware can be used for both digital signal processing and post-quantum lattice-based cryptography applications.
△ Less
Submitted 15 April, 2025;
originally announced April 2025.
-
aTENNuate: Optimized Real-time Speech Enhancement with Deep SSMs on Raw Audio
Authors:
Yan Ru Pei,
Ritik Shrivastava,
FNU Sidharth
Abstract:
We present aTENNuate, a simple deep state-space autoencoder configured for efficient online raw speech enhancement in an end-to-end fashion. The network's performance is primarily evaluated on raw speech denoising, with additional assessments on tasks such as super-resolution and de-quantization. We benchmark aTENNuate on the VoiceBank + DEMAND and the Microsoft DNS1 synthetic test sets. The netwo…
▽ More
We present aTENNuate, a simple deep state-space autoencoder configured for efficient online raw speech enhancement in an end-to-end fashion. The network's performance is primarily evaluated on raw speech denoising, with additional assessments on tasks such as super-resolution and de-quantization. We benchmark aTENNuate on the VoiceBank + DEMAND and the Microsoft DNS1 synthetic test sets. The network outperforms previous real-time denoising models in terms of PESQ score, parameter count, MACs, and latency. Even as a raw waveform processing model, the model maintains high fidelity to the clean signal with minimal audible artifacts. In addition, the model remains performant even when the noisy input is compressed down to 4000Hz and 4 bits, suggesting general speech enhancement capabilities in low-resource environments. Try it out by pip install attenuate
△ Less
Submitted 19 May, 2025; v1 submitted 5 September, 2024;
originally announced September 2024.
-
Simultaneous Localization and Mapping: Through the Lens of Nonlinear Optimization
Authors:
Amay Saxena,
Chih-Yuan Chiu,
Joseph Menke,
Ritika Shrivastava,
Shankar Sastry
Abstract:
Simultaneous Localization and Mapping (SLAM) algorithms perform visual-inertial estimation via filtering or batch optimization methods. Empirical evidence suggests that filtering algorithms are computationally faster, while optimization methods are more accurate. This work presents an optimization-based framework that unifies these approaches, and allows users to flexibly implement different desig…
▽ More
Simultaneous Localization and Mapping (SLAM) algorithms perform visual-inertial estimation via filtering or batch optimization methods. Empirical evidence suggests that filtering algorithms are computationally faster, while optimization methods are more accurate. This work presents an optimization-based framework that unifies these approaches, and allows users to flexibly implement different design choices, e.g., the number and types of variables maintained in the algorithm at each time. We prove that filtering methods correspond to specific design choices in our generalized framework. We then reformulate the Multi-State Constrained Kalman Filter (MSCKF), implement the reformulation on challenging image sequence datasets in simulation, and contrast its performance with that of sliding window based filters. Using these results, we explain the relative performance characteristics of these two classes of algorithms in the context of our algorithm. Finally, we illustrate that under different design choices, the empirical performance of our algorithm interpolates between those of state-of-the-art approaches.
△ Less
Submitted 3 August, 2022; v1 submitted 11 December, 2021;
originally announced December 2021.
-
Application of Support Vector Machines for Seismogram Analysis and Differentiation
Authors:
Rohit Kumar Shrivastava
Abstract:
Support Vector Machines (SVM) is a computational technique which has been used in various fields of sciences as a classifier with k-class classification capability, k being 2,3,4, etc. Seismograms of volcanic tremors often contain noises which can prove harmful for correct interpretation. The PCAB station (located in the northern region of Panarea island, Italy) has been recording seismic signals…
▽ More
Support Vector Machines (SVM) is a computational technique which has been used in various fields of sciences as a classifier with k-class classification capability, k being 2,3,4, etc. Seismograms of volcanic tremors often contain noises which can prove harmful for correct interpretation. The PCAB station (located in the northern region of Panarea island, Italy) has been recording seismic signals from a pump installed nearby, corrupting the useful signals from Strombolli volcano. SVM with k=2 classification technique after optimization through grid search has been instrumental in identification and classification of the seismic signals coming from pump, reaching a score of 99.7149% of patterns which match the actual membership of class (determined through cross-validation). The predicted labels of SVM has been used to estimate the pump's duration of activity leading to the declaration of corresponding seismograms redundant (not fit for processing and interpretation). However, when the same trained SVM was used to determine whether the seismogram used by Pino et al., 2011 recorded at the same PCAB station on 4th April, 2003 contained pump's signals or not, SVM showed 100% absence of pump's signals thereby authenticating the research work done in the latter.
△ Less
Submitted 5 March, 2020;
originally announced March 2020.