Skip to main content

Showing 1–16 of 16 results for author: P, A

Searching in archive eess. Search in all archives.
.
  1. arXiv:2411.05886  [pdf, other

    eess.IV cs.CV

    UnDIVE: Generalized Underwater Video Enhancement Using Generative Priors

    Authors: Suhas Srinath, Aditya Chandrasekar, Hemang Jamadagni, Rajiv Soundararajan, Prathosh A P

    Abstract: With the rise of marine exploration, underwater imaging has gained significant attention as a research topic. Underwater video enhancement has become crucial for real-time computer vision tasks in marine exploration. However, most existing methods focus on enhancing individual frames and neglect video temporal dynamics, leading to visually poor enhancements. Furthermore, the lack of ground-truth r… ▽ More

    Submitted 8 November, 2024; originally announced November 2024.

    Comments: Accepted to IEEE/CVF WACV 2025

  2. arXiv:2404.05763  [pdf

    eess.IV cs.CV

    Deep Learning-Based Brain Image Segmentation for Automated Tumour Detection

    Authors: Suman Sourabh, Murugappan Valliappan, Narayana Darapaneni, Anwesh R P

    Abstract: Introduction: The present study on the development and evaluation of an automated brain tumor segmentation technique based on deep learning using the 3D U-Net model. Objectives: The objective is to leverage state-of-the-art convolutional neural networks (CNNs) on a large dataset of brain MRI scans for segmentation. Methods: The proposed methodology applies pre-processing techniques for enhanced pe… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

  3. arXiv:2404.04635  [pdf

    eess.IV cs.CV

    A Deep Look Into -- Automated Lung X-Ray Abnormality Detection System

    Authors: Nagullas KS, Vivekanand. V, Narayana Darapaneni, Anwesh R P

    Abstract: Introduction: Automated Lung X-Ray Abnormality Detection System is the application which distinguish the normal x-ray images from infected x-ray images and highlight area considered for prediction, with the recent pandemic a need to have a non-conventional method and faster detecting diseases, for which X ray serves the purpose. Obectives: As of current situation any viral disease that is infectio… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

  4. arXiv:2403.08261  [pdf, other

    cs.CV cs.AI eess.IV

    CoroNetGAN: Controlled Pruning of GANs via Hypernetworks

    Authors: Aman Kumar, Khushboo Anand, Shubham Mandloi, Ashutosh Mishra, Avinash Thakur, Neeraj Kasera, Prathosh A P

    Abstract: Generative Adversarial Networks (GANs) have proven to exhibit remarkable performance and are widely used across many generative computer vision applications. However, the unprecedented demand for the deployment of GANs on resource-constrained edge devices still poses a challenge due to huge number of parameters involved in the generation process. This has led to focused attention on the area of co… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  5. arXiv:2309.12054  [pdf

    eess.SY

    HiveLink, an IoT based Smart Bee Hive Monitoring System

    Authors: Ajwin Dsouza, Aditya P, Sameer Hegde

    Abstract: HiveLink, the IoT-based Smart Bee Hive Monitoring System addresses the challenges faced by beekeepers in managing the influence of environmental impact, diseases, and collapse in honey bee colonies. Integrated with advanced sensors, the system monitors temperature, humidity, hive weight, and diurnal cycle. Leveraging IoT technology, the system provides real-time data, remote connectivity, and acti… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  6. arXiv:2206.07484  [pdf, ps, other

    eess.SP cs.AI cs.LG

    Intelligent analysis of EEG signals to assess consumer decisions: A Study on Neuromarketing

    Authors: Nikunj Phutela, Abhilash P, Kaushik Sreevathsan, B N Krupa

    Abstract: Neuromarketing is an emerging field that combines neuroscience and marketing to understand the factors that influence consumer decisions better. The study proposes a method to understand consumers' positive and negative reactions to advertisements (ads) and products by analysing electroencephalogram (EEG) signals. These signals are recorded using a low-cost single electrode headset from volunteers… ▽ More

    Submitted 29 May, 2022; originally announced June 2022.

    Comments: 7 pages, 6 figures

  7. arXiv:2111.10765  [pdf, other

    eess.SP

    Orthogonal Delay Scale Space Modulation: A New Technique for Wideband Time-Varying Channels

    Authors: Arunkumar K. P., Chandra R. Murthy

    Abstract: Orthogonal Time Frequency Space (OTFS) modulation is a recently proposed scheme for time-varying narrowband channels in terrestrial radio-frequency communications. Underwater acoustic (UWA) and ultra-wideband (UWB) communication systems, on the other hand, confront wideband time-varying channels. Unlike narrowband channels, for which time contractions or dilations due to Doppler effect can be appr… ▽ More

    Submitted 8 May, 2022; v1 submitted 21 November, 2021; originally announced November 2021.

    Comments: 18 pages, 21 figures, accepted for publication in the IEEE Transactions on Signal Processing

  8. Fronthaul Compression for Uplink Massive MIMO using Matrix Decomposition

    Authors: Aswathylakshmi P, Radha Krishna Ganti

    Abstract: Massive MIMO opens up attractive possibilities for next generation wireless systems with its large number of antennas offering spatial diversity and multiplexing gain. However, the fronthaul link that connects a massive MIMO Remote Radio Head (RRH) and carries IQ samples to the Baseband Unit (BBU) of the base station can throttle the network capacity/speed if appropriate data compression technique… ▽ More

    Submitted 24 October, 2021; originally announced October 2021.

    Comments: 7 pages, 3 figures

    Journal ref: Proceedings of the 2022 IEEE Wireless Communications and Networking Conference (WCNC), 2524-2529

  9. arXiv:2109.05494  [pdf, other

    cs.CL cs.SD eess.AS

    Unsupervised Domain Adaptation Schemes for Building ASR in Low-resource Languages

    Authors: Anoop C S, Prathosh A P, A G Ramakrishnan

    Abstract: Building an automatic speech recognition (ASR) system from scratch requires a large amount of annotated speech data, which is difficult to collect in many languages. However, there are cases where the low-resource language shares a common acoustic space with a high-resource language having enough annotated data to build an ASR. In such cases, we show that the domain-independent acoustic models lea… ▽ More

    Submitted 16 September, 2021; v1 submitted 12 September, 2021; originally announced September 2021.

    Comments: Submitted to ASRU 2021

  10. arXiv:2008.09466  [pdf, other

    cs.LG eess.IV stat.ML

    RespVAD: Voice Activity Detection via Video-Extracted Respiration Patterns

    Authors: Arnab Kumar Mondal, Prathosh A. P

    Abstract: Voice Activity Detection (VAD) refers to the task of identification of regions of human speech in digital signals such as audio and video. While VAD is a necessary first step in many speech processing systems, it poses challenges when there are high levels of ambient noise during the audio recording. To improve the performance of VAD in such conditions, several methods utilizing the visual informa… ▽ More

    Submitted 21 August, 2020; originally announced August 2020.

    Comments: Accepted in IEEE Sensor Letters

  11. arXiv:2004.10174  [pdf

    eess.SY cs.CY

    Internet of Things(IoT) Based Multilevel Drunken Driving Detection and Prevention System Using Raspberry Pi 3

    Authors: Viswanatha V, Venkata Siva Reddy R, Ashwini Kumari P, Pradeep Kumar S

    Abstract: In this paper, the proposed system has demonstrated three ways of detecting alcohol level in the body of the car driver and prevent car driver from driving the vehicle by turning off the ignition system. It also sends messages to concerned people. In order to detect breath alcohol level MQ-3 sensor is included in this module along with a heartbeat sensor which can detect the heart beat rate of dri… ▽ More

    Submitted 21 April, 2020; originally announced April 2020.

  12. arXiv:2003.09293  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    U-Det: A Modified U-Net architecture with bidirectional feature network for lung nodule segmentation

    Authors: Nikhil Varma Keetha, Samson Anosh Babu P, Chandra Sekhara Rao Annavarapu

    Abstract: Early diagnosis and analysis of lung cancer involve a precise and efficient lung nodule segmentation in computed tomography (CT) images. However, the anonymous shapes, visual features, and surroundings of the nodule in the CT image pose a challenging problem to the robust segmentation of the lung nodules. This article proposes U-Det, a resource-efficient model architecture, which is an end to end… ▽ More

    Submitted 20 March, 2020; originally announced March 2020.

    Comments: 14 pages, 7 figures, 5 tables

  13. arXiv:1911.06298  [pdf

    q-bio.QM cs.CV eess.IV

    Fetal Head and Abdomen Measurement Using Convolutional Neural Network, Hough Transform, and Difference of Gaussian Revolved along Elliptical Path (Dogell) Algorithm

    Authors: Kezia Irene, Aditya Yudha P., Harlan Haidi, Nurul Faza, Winston Chandra

    Abstract: The number of fetal-neonatal death in Indonesia is still high compared to developed countries. This is caused by the absence of maternal monitoring during pregnancy. This paper presents an automated measurement for fetal head circumference (HC) and abdominal circumference (AC) from the ultrasonography (USG) image. This automated measurement is beneficial to detect early fetal abnormalities during… ▽ More

    Submitted 14 November, 2019; originally announced November 2019.

    Comments: 5 pages, 9 figures

  14. arXiv:1903.12248  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    Adversarial Approximate Inference for Speech to Electroglottograph Conversion

    Authors: Prathosh A. P., Varun Srivastava, Mayank Mishra

    Abstract: Speech produced by human vocal apparatus conveys substantial non-semantic information including the gender of the speaker, voice quality, affective state, abnormalities in the vocal apparatus etc. Such information is attributed to the properties of the voice source signal, which is usually estimated from the speech signal. However, most of the source estimation techniques depend heavily on the goo… ▽ More

    Submitted 7 September, 2019; v1 submitted 28 March, 2019; originally announced March 2019.

    Comments: Submitted to IEEE/ACM Transactions on Audio, Speech and Language Processing

  15. QR Approximation for Massive MIMO Fronthaul Compression

    Authors: Aswathylakshmi P, Radha Krishna Ganti

    Abstract: Massive MIMO's immense potential to serve large number of users at fast data rates also comes with the caveat of requiring tremendous processing power. This favours a centralized radio access network (C-RAN) architecture that concentrates the processing power at a common baseband unit (BBU) connected to multiple remote radio heads (RRH) via fronthaul links. The high bandwidths of 5G make the front… ▽ More

    Submitted 12 March, 2019; originally announced March 2019.

    Journal ref: Proceedings of the 2019 IEEE Globecom Workshops, 1-7

  16. arXiv:1804.10147  [pdf, other

    cs.SD eess.AS

    Detection of Glottal Closure Instants from Raw Speech using Convolutional Neural Networks

    Authors: Mohit Goyal, Varun Srivastava, Prathosh A. P

    Abstract: Glottal Closure Instants (GCIs) correspond to the temporal locations of significant excitation to the vocal tract occurring during the production of voiced speech. GCI detection from speech signals is a well-studied problem given its importance in speech processing. Most of the existing approaches for GCI detection adopt a two-stage approach (i) Transformation of speech signal into a representativ… ▽ More

    Submitted 9 July, 2019; v1 submitted 26 April, 2018; originally announced April 2018.

    Comments: Updated submission. Figures Added. Accepted in Interspeech 2019