Skip to main content

Showing 1–5 of 5 results for author: Giri, R

Searching in archive stat. Search in all archives.
.
  1. arXiv:2502.17453  [pdf, other

    nlin.CD stat.AP

    Permutation extropy: a time series complexity measure

    Authors: Ritik Roshan Giri, Suchandan Kayal

    Abstract: On account of a greater need for understanding the complexity of time series like physiological time series, financial time series, and many more that enter into picture for their inculpation with real-world problems, several complexity parameters have already been proposed in the literature. Permutation entropy, Lyapunov exponents are such complexity parameters out of many. In this article, we in… ▽ More

    Submitted 8 February, 2025; originally announced February 2025.

  2. arXiv:2008.04470  [pdf, other

    eess.AS cs.LG cs.NE cs.SD stat.ML

    PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings, Semi-Supervised Conversational Data, and Biased Loss

    Authors: Umut Isik, Ritwik Giri, Neerad Phansalkar, Jean-Marc Valin, Karim Helwani, Arvindh Krishnaswamy

    Abstract: Neural network applications generally benefit from larger-sized models, but for current speech enhancement models, larger scale networks often suffer from decreased robustness to the variety of real-world use cases beyond what is encountered in training data. We introduce several innovations that lead to better large neural networks for speech enhancement. The novel PoCoNet architecture is a convo… ▽ More

    Submitted 10 August, 2020; originally announced August 2020.

    Comments: 5 pages, 3 figures, INTERSPEECH 2020

  3. arXiv:2001.11542  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    Channel-Attention Dense U-Net for Multichannel Speech Enhancement

    Authors: Bahareh Tolooshams, Ritwik Giri, Andrew H. Song, Umut Isik, Arvindh Krishnaswamy

    Abstract: Supervised deep learning has gained significant attention for speech enhancement recently. The state-of-the-art deep learning methods perform the task by learning a ratio/binary mask that is applied to the mixture in the time-frequency domain to produce the clean speech. Despite the great performance in the single-channel setting, these frameworks lag in performance in the multichannel setting as… ▽ More

    Submitted 30 January, 2020; originally announced January 2020.

  4. arXiv:1604.02181  [pdf, ps, other

    stat.ML

    A Unified Framework for Sparse Non-Negative Least Squares using Multiplicative Updates and the Non-Negative Matrix Factorization Problem

    Authors: Igor Fedorov, Alican Nalci, Ritwik Giri, Bhaskar D. Rao, Truong Q. Nguyen, Harinath Garudadri

    Abstract: We study the sparse non-negative least squares (S-NNLS) problem. S-NNLS occurs naturally in a wide variety of applications where an unknown, non-negative quantity must be recovered from linear measurements. We present a unified framework for S-NNLS based on a rectified power exponential scale mixture prior on the sparse codes. We show that the proposed framework encompasses a large class of S-NNLS… ▽ More

    Submitted 2 January, 2018; v1 submitted 7 April, 2016; originally announced April 2016.

    Comments: To appear in Signal Processing

  5. Type I and Type II Bayesian Methods for Sparse Signal Recovery using Scale Mixtures

    Authors: Ritwik Giri, Bhaskar D. Rao

    Abstract: In this paper, we propose a generalized scale mixture family of distributions, namely the Power Exponential Scale Mixture (PESM) family, to model the sparsity inducing priors currently in use for sparse signal recovery (SSR). We show that the successful and popular methods such as LASSO, Reweighted $\ell_1$ and Reweighted $\ell_2$ methods can be formulated in an unified manner in a maximum a poste… ▽ More

    Submitted 17 July, 2015; originally announced July 2015.

    Comments: Under Review