Skip to main content

Showing 1–5 of 5 results for author: Narayanaswamy, V

Searching in archive eess. Search in all archives.
.
  1. arXiv:2308.00491  [pdf, other

    eess.IV cs.CV

    An L2-Normalized Spatial Attention Network For Accurate And Fast Classification Of Brain Tumors In 2D T1-Weighted CE-MRI Images

    Authors: Grace Billingsley, Julia Dietlmeier, Vivek Narayanaswamy, Andreas Spanias, Noel E. OConnor

    Abstract: We propose an accurate and fast classification network for classification of brain tumors in MRI images that outperforms all lightweight methods investigated in terms of accuracy. We test our model on a challenging 2D T1-weighted CE-MRI dataset containing three types of brain tumors: Meningioma, Glioma and Pituitary. We introduce an l2-normalized spatial attention mechanism that acts as a regulari… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

    Comments: Accepted to be published in: IEEE International Conference on Image Processing (ICIP), Kuala Lumpur October 8-11, 2023

  2. arXiv:2104.07161  [pdf, other

    cs.SD cs.LG eess.AS

    On the Design of Deep Priors for Unsupervised Audio Restoration

    Authors: Vivek Sivaraman Narayanaswamy, Jayaraman J. Thiagarajan, Andreas Spanias

    Abstract: Unsupervised deep learning methods for solving audio restoration problems extensively rely on carefully tailored neural architectures that carry strong inductive biases for defining priors in the time or spectral domain. In this context, lot of recent success has been achieved with sophisticated convolutional network constructions that recover audio signals in the spectral domain. However, in prac… ▽ More

    Submitted 14 April, 2021; originally announced April 2021.

  3. arXiv:2005.13769  [pdf, other

    eess.AS cs.SD stat.ML

    Unsupervised Audio Source Separation using Generative Priors

    Authors: Vivek Narayanaswamy, Jayaraman J. Thiagarajan, Rushil Anirudh, Andreas Spanias

    Abstract: State-of-the-art under-determined audio source separation systems rely on supervised end-end training of carefully tailored neural network architectures operating either in the time or the spectral domain. However, these methods are severely challenged in terms of requiring access to expensive source level labeled data and being specific to a given set of sources and the mixing process, which dema… ▽ More

    Submitted 27 May, 2020; originally announced May 2020.

    Comments: 5 pages, 2 figures

  4. arXiv:1904.04161  [pdf, other

    cs.LG cs.SD eess.AS stat.ML

    Audio Source Separation via Multi-Scale Learning with Dilated Dense U-Nets

    Authors: Vivek Sivaraman Narayanaswamy, Sameeksha Katoch, Jayaraman J. Thiagarajan, Huan Song, Andreas Spanias

    Abstract: Modern audio source separation techniques rely on optimizing sequence model architectures such as, 1D-CNNs, on mixture recordings to generalize well to unseen mixtures. Specifically, recent focus is on time-domain based architectures such as Wave-U-Net which exploit temporal context by extracting multi-scale features. However, the optimality of the feature extraction process in these architectures… ▽ More

    Submitted 8 April, 2019; originally announced April 2019.

  5. arXiv:1811.00183  [pdf, other

    stat.ML cs.LG cs.SD eess.AS

    Designing an Effective Metric Learning Pipeline for Speaker Diarization

    Authors: Vivek Sivaraman Narayanaswamy, Jayaraman J. Thiagarajan, Huan Song, Andreas Spanias

    Abstract: State-of-the-art speaker diarization systems utilize knowledge from external data, in the form of a pre-trained distance metric, to effectively determine relative speaker identities to unseen data. However, much of recent focus has been on choosing the appropriate feature extractor, ranging from pre-trained $i-$vectors to representations learned via different sequence modeling architectures (e.g.… ▽ More

    Submitted 31 October, 2018; originally announced November 2018.