Skip to main content

Showing 1–9 of 9 results for author: Karki, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.10128  [pdf, other

    cs.SD cs.AI cs.LG eess.AS

    Advancing Voice Cloning for Nepali: Leveraging Transfer Learning in a Low-Resource Language

    Authors: Manjil Karki, Pratik Shakya, Sandesh Acharya, Ravi Pandit, Dinesh Gothe

    Abstract: Voice cloning is a prominent feature in personalized speech interfaces. A neural vocal cloning system can mimic someone's voice using just a few audio samples. Both speaker encoding and speaker adaptation are topics of research in the field of voice cloning. Speaker adaptation relies on fine-tuning a multi-speaker generative model, which involves training a separate model to infer a new speaker em… ▽ More

    Submitted 23 August, 2024; v1 submitted 19 August, 2024; originally announced August 2024.

    Comments: 6 pages, 10 figures

    MSC Class: 91F20 ACM Class: I.2.7

  2. arXiv:2003.13868  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Lesion Conditional Image Generation for Improved Segmentation of Intracranial Hemorrhage from CT Images

    Authors: Manohar Karki, Junghwan Cho, Seokhwan Ko

    Abstract: Data augmentation can effectively resolve a scarcity of images when training machine-learning algorithms. It can make them more robust to unseen images. We present a lesion conditional Generative Adversarial Network LcGAN to generate synthetic Computed Tomography (CT) images for data augmentation. A lesion conditional image (segmented mask) is an input to both the generator and the discriminator o… ▽ More

    Submitted 30 March, 2020; originally announced March 2020.

  3. DeepSat V2: Feature Augmented Convolutional Neural Nets for Satellite Image Classification

    Authors: Qun Liu, Saikat Basu, Sangram Ganguly, Supratik Mukhopadhyay, Robert DiBiano, Manohar Karki, Ramakrishna Nemani

    Abstract: Satellite image classification is a challenging problem that lies at the crossroads of remote sensing, computer vision, and machine learning. Due to the high variability inherent in satellite data, most of the current object classification approaches are not suitable for handling satellite datasets. The progress of satellite image analytics has also been inhibited by the lack of a single labeled h… ▽ More

    Submitted 14 November, 2019; originally announced November 2019.

    Comments: This is an Accepted Manuscript of an article published by Taylor & Francis Group in Remote Sensing Letters. arXiv admin note: text overlap with arXiv:1509.03602

  4. arXiv:1806.08037  [pdf, other

    cs.CV cs.LG

    Pixel-level Reconstruction and Classification for Noisy Handwritten Bangla Characters

    Authors: Manohar Karki, Qun Liu, Robert DiBiano, Saikat Basu, Supratik Mukhopadhyay

    Abstract: Classification techniques for images of handwritten characters are susceptible to noise. Quadtrees can be an efficient representation for learning from sparse features. In this paper, we improve the effectiveness of probabilistic quadtrees by using a pixel level classifier to extract the character pixels and remove noise from handwritten character images. The pixel level denoiser (a deep belief ne… ▽ More

    Submitted 20 June, 2018; originally announced June 2018.

    Comments: Paper was accepted at the 16th International Conference on Frontiers in Handwriting Recognition (ICFHR 2018)

  5. arXiv:1701.08918  [pdf

    cs.CV

    Feature Selection based on PCA and PSO for Multimodal Medical Image Fusion using DTCWT

    Authors: Padmavathi K, Mahima Bhat, Maya V Karki

    Abstract: Multimodal medical image fusion helps to increase efficiency in medical diagnosis. This paper presents multimodal medical image fusion by selecting relevant features using Principle Component Analysis (PCA) and Particle Swarm Optimization techniques (PSO). DTCWT is used for decomposition of the images into low and high frequency coefficients. Fusion rules such as combination of minimum, maximum an… ▽ More

    Submitted 31 January, 2017; originally announced January 2017.

    Comments: 8 pages, 6 figures

  6. arXiv:1612.01981  [pdf, other

    cs.CV cs.LG

    Core Sampling Framework for Pixel Classification

    Authors: Manohar Karki, Robert DiBiano, Saikat Basu, Supratik Mukhopadhyay

    Abstract: The intermediate map responses of a Convolutional Neural Network (CNN) contain information about an image that can be used to extract contextual knowledge about it. In this paper, we present a core sampling framework that is able to use these activation maps from several layers as features to another neural network using transfer learning to provide an understanding of an input image. Our framewor… ▽ More

    Submitted 6 December, 2016; originally announced December 2016.

  7. arXiv:1605.02699  [pdf, other

    cs.CV cs.LG stat.ML

    A Theoretical Analysis of Deep Neural Networks for Texture Classification

    Authors: Saikat Basu, Manohar Karki, Robert DiBiano, Supratik Mukhopadhyay, Sangram Ganguly, Ramakrishna Nemani, Shreekant Gayaka

    Abstract: We investigate the use of Deep Neural Networks for the classification of image datasets where texture features are important for generating class-conditional discriminative representations. To this end, we first derive the size of the feature space for some standard textural features extracted from the input dataset and then use the theory of Vapnik-Chervonenkis dimension to show that hand-crafted… ▽ More

    Submitted 21 June, 2016; v1 submitted 9 May, 2016; originally announced May 2016.

    Comments: Accepted in International Joint Conference on Neural Networks, IJCNN 2016

  8. arXiv:1509.03602  [pdf, other

    cs.CV

    DeepSat - A Learning framework for Satellite Imagery

    Authors: Saikat Basu, Sangram Ganguly, Supratik Mukhopadhyay, Robert DiBiano, Manohar Karki, Ramakrishna Nemani

    Abstract: Satellite image classification is a challenging problem that lies at the crossroads of remote sensing, computer vision, and machine learning. Due to the high variability inherent in satellite data, most of the current object classification approaches are not suitable for handling satellite datasets. The progress of satellite image analytics has also been inhibited by the lack of a single labeled h… ▽ More

    Submitted 11 September, 2015; originally announced September 2015.

    Comments: Paper was accepted at ACM SIGSPATIAL 2015

  9. arXiv:1509.03413  [pdf, other

    cs.CV

    Learning Sparse Feature Representations using Probabilistic Quadtrees and Deep Belief Nets

    Authors: Saikat Basu, Manohar Karki, Sangram Ganguly, Robert DiBiano, Supratik Mukhopadhyay, Ramakrishna Nemani

    Abstract: Learning sparse feature representations is a useful instrument for solving an unsupervised learning problem. In this paper, we present three labeled handwritten digit datasets, collectively called n-MNIST. Then, we propose a novel framework for the classification of handwritten digits that learns sparse representations using probabilistic quadtrees and Deep Belief Nets. On the MNIST and n-MNIST da… ▽ More

    Submitted 11 September, 2015; originally announced September 2015.

    Comments: Published in the European Symposium on Artificial Neural Networks, ESANN 2015