Skip to main content

Showing 1–4 of 4 results for author: Rajan, P

Searching in archive eess. Search in all archives.
.
  1. arXiv:2111.10897  [pdf, other

    cs.SD cs.AI cs.LG eess.AS eess.SP

    Health Monitoring of Industrial machines using Scene-Aware Threshold Selection

    Authors: Arshdeep Singh, Raju Arvind, Padmanabhan Rajan

    Abstract: This paper presents an autoencoder based unsupervised approach to identify anomaly in an industrial machine using sounds produced by the machine. The proposed framework is trained using log-melspectrogram representations of the sound signal. In classification, our hypothesis is that the reconstruction error computed for an abnormal machine is larger than that of the a normal machine, since only no… ▽ More

    Submitted 21 November, 2021; originally announced November 2021.

    Comments: 5 pages, 4 figures, 1 Table

  2. arXiv:1903.10713  [pdf, other

    eess.AS cs.LG cs.SD

    Multiscale CNN based Deep Metric Learning for Bioacoustic Classification: Overcoming Training Data Scarcity Using Dynamic Triplet Loss

    Authors: Anshul Thakur, Daksh Thapar, Padmanabhan Rajan, Aditya Nigam

    Abstract: This paper proposes multiscale convolutional neural network (CNN)-based deep metric learning for bioacoustic classification, under low training data conditions. The proposed CNN is characterized by the utilization of four different filter sizes at each level to analyze input feature maps. This multiscale nature helps in describing different bioacoustic events effectively: smaller filters help in l… ▽ More

    Submitted 27 March, 2019; v1 submitted 26 March, 2019; originally announced March 2019.

    Comments: Under Review at JASA. Primitive version of paper. We are still working on getting better performances out of the comparative methods

  3. arXiv:1902.09765  [pdf, other

    eess.AS cs.LG cs.SD

    Directional Embedding Based Semi-supervised Framework For Bird Vocalization Segmentation

    Authors: Anshul Thakur, Padmanabhan Rajan

    Abstract: This paper proposes a data-efficient, semi-supervised, two-pass framework for segmenting bird vocalizations. The framework utilizes a binary classification model to categorize frames of an input audio recording into the background or bird vocalization. The first pass of the framework automatically generates training labels from the input recording itself, while model training and classification is… ▽ More

    Submitted 26 February, 2019; originally announced February 2019.

    Comments: Accepted for publication in Applied Acoustics

  4. arXiv:1902.02498  [pdf, other

    eess.AS cs.LG cs.SD

    Conv-codes: Audio Hashing For Bird Species Classification

    Authors: Anshul Thakur, Pulkit Sharma, Vinayak Abrol, Padmanabhan Rajan

    Abstract: In this work, we propose a supervised, convex representation based audio hashing framework for bird species classification. The proposed framework utilizes archetypal analysis, a matrix factorization technique, to obtain convex-sparse representations of a bird vocalization. These convex representations are hashed using Bloom filters with non-cryptographic hash functions to obtain compact binary co… ▽ More

    Submitted 7 February, 2019; originally announced February 2019.

    Comments: Accepted for presentation at ICASSP 2019