Skip to main content

Showing 1–50 of 59 results for author: Monga, V

.
  1. arXiv:2508.03751  [pdf, ps, other

    cs.CV

    Modular Transformer Architecture for Precision Agriculture Imaging

    Authors: Brian Gopalan, Nathalia Nascimento, Vishal Monga

    Abstract: This paper addresses the critical need for efficient and accurate weed segmentation from drone video in precision agriculture. A quality-aware modular deep-learning framework is proposed that addresses common image degradation by analyzing quality conditions-such as blur and noise-and routing inputs through specialized pre-processing and transformer models optimized for each degradation type. The… ▽ More

    Submitted 7 August, 2025; v1 submitted 4 August, 2025; originally announced August 2025.

    Comments: Preprint of paper submitted to IEEE-AIOT 2025

  2. arXiv:2402.12872  [pdf, other

    eess.IV eess.SP

    Deep, convergent, unrolled half-quadratic splitting for image deconvolution

    Authors: Yanan Zhao, Yuelong Li, Haichuan Zhang, Vishal Monga, Yonina C. Eldar

    Abstract: In recent years, algorithm unrolling has emerged as a powerful technique for designing interpretable neural networks based on iterative algorithms. Imaging inverse problems have particularly benefited from unrolling-based deep network design since many traditional model-based approaches rely on iterative optimization. Despite exciting progress, typical unrolling approaches heuristically design lay… ▽ More

    Submitted 25 February, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Comments: Accepted with mandatory minor revisions by Transactions on Computational Imaging

  3. arXiv:2308.14904  [pdf, other

    cs.CV cs.LG

    Maturity-Aware Active Learning for Semantic Segmentation with Hierarchically-Adaptive Sample Assessment

    Authors: Amirsaeed Yazdani, Xuelu Li, Vishal Monga

    Abstract: Active Learning (AL) for semantic segmentation is challenging due to heavy class imbalance and different ways of defining "sample" (pixels, areas, etc.), leaving the interpretation of the data distribution ambiguous. We propose "Maturity-Aware Distribution Breakdown-based Active Learning'' (MADBAL), an AL method that benefits from a hierarchical approach to define a multiview data distribution, wh… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

    Comments: Accepted to the 34th British Machine Vision Conference (BMVC 2023)

    MSC Class: 68-06 ACM Class: I.4.6; I.5.1

  4. Iterative, Deep Synthetic Aperture Sonar Image Segmentation

    Authors: Yung-Chen Sun, Isaac D. Gerg, Vishal Monga

    Abstract: Synthetic aperture sonar (SAS) systems produce high-resolution images of the seabed environment. Moreover, deep learning has demonstrated superior ability in finding robust features for automating imagery analysis. However, the success of deep learning is conditioned on having lots of labeled training data, but obtaining generous pixel-level annotations of SAS imagery is often practically infeasib… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: arXiv admin note: text overlap with arXiv:2107.14563

  5. arXiv:2203.09580  [pdf, other

    cs.CV cs.AI

    Surface Defect Detection and Evaluation for Marine Vessels using Multi-Stage Deep Learning

    Authors: Li Yu, Kareem Metwaly, James Z. Wang, Vishal Monga

    Abstract: Detecting and evaluating surface coating defects is important for marine vessel maintenance. Currently, the assessment is carried out manually by qualified inspectors using international standards and their own experience. Automating the processes is highly challenging because of the high level of variation in vessel type, paint surface, coatings, lighting condition, weather condition, paint color… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

  6. arXiv:2203.03079  [pdf, other

    cs.CV cs.LG

    GlideNet: Global, Local and Intrinsic based Dense Embedding NETwork for Multi-category Attributes Prediction

    Authors: Kareem Metwaly, Aerin Kim, Elliot Branson, Vishal Monga

    Abstract: Attaching attributes (such as color, shape, state, action) to object categories is an important computer vision problem. Attribute prediction has seen exciting recent progress and is often formulated as a multi-label classification problem. Yet significant challenges remain in: 1) predicting diverse attributes over multiple categories, 2) modeling attributes-category dependency, 3) capturing both… ▽ More

    Submitted 14 March, 2022; v1 submitted 6 March, 2022; originally announced March 2022.

    Comments: CVPR 2022, 16 pages (including supplementary), CAR Dataset, VAW Dataset, http://signal.ee.psu.edu/research/glidenet.html

  7. arXiv:2111.08243  [pdf, other

    cs.CV cs.LG cs.RO

    CAR -- Cityscapes Attributes Recognition A Multi-category Attributes Dataset for Autonomous Vehicles

    Authors: Kareem Metwaly, Aerin Kim, Elliot Branson, Vishal Monga

    Abstract: Self-driving vehicles are the future of transportation. With current advancements in this field, the world is getting closer to safe roads with almost zero probability of having accidents and eliminating human errors. However, there is still plenty of research and development necessary to reach a level of robustness. One important aspect is to understand a scene fully including all details. As som… ▽ More

    Submitted 16 November, 2021; originally announced November 2021.

  8. arXiv:2108.06637  [pdf, other

    cs.CV

    Deep Algorithm Unrolling for Biomedical Imaging

    Authors: Yuelong Li, Or Bar-Shira, Vishal Monga, Yonina C. Eldar

    Abstract: In this chapter, we review biomedical applications and breakthroughs via leveraging algorithm unrolling, an important technique that bridges between traditional iterative algorithms and modern deep learning techniques. To provide context, we start by tracing the origin of algorithm unrolling and providing a comprehensive tutorial on how to unroll iterative algorithms into deep networks. We then ex… ▽ More

    Submitted 14 August, 2021; originally announced August 2021.

  9. arXiv:2107.14563  [pdf, other

    cs.CV

    Iterative, Deep, and Unsupervised Synthetic Aperture Sonar Image Segmentation

    Authors: Yung-Chen Sun, Isaac D. Gerg, Vishal Monga

    Abstract: Deep learning has not been routinely employed for semantic segmentation of seabed environment for synthetic aperture sonar (SAS) imagery due to the implicit need of abundant training data such methods necessitate. Abundant training data, specifically pixel-level labels for all images, is usually not available for SAS imagery due to the complex logistics (e.g., diver survey, chase boat, precision p… ▽ More

    Submitted 30 July, 2021; originally announced July 2021.

    Comments: IEEE OCEANS 2021

  10. arXiv:2105.02209  [pdf, other

    cs.CV cs.AI

    Physically Inspired Dense Fusion Networks for Relighting

    Authors: Amirsaeed Yazdani, Tiantong Guo, Vishal Monga

    Abstract: Image relighting has emerged as a problem of significant research interest inspired by augmented reality applications. Physics-based traditional methods, as well as black box deep learning models, have been developed. The existing deep networks have exploited training to achieve a new state of the art; however, they may perform poorly when training is limited or does not represent problem phenomen… ▽ More

    Submitted 5 May, 2021; originally announced May 2021.

    Comments: Rank second in NTIRE 2021 One-to-one depth guided image relighting challenge, accepted by CVPRW 2021

  11. arXiv:2104.14713  [pdf, other

    eess.IV

    Simultaneous Denoising and Localization Network for Photoacoustic Target Localization

    Authors: Amirsaeed Yazdani, Sumit Agrawal, Kerrick Johnstonbaugh, Sri-Rajasekhar Kothapalli, Vishal Monga

    Abstract: A significant research problem of recent interest is the localization of targets like vessels, surgical needles, and tumors in photoacoustic (PA) images. To achieve accurate localization, a high photoacoustic signal-to-noise ratio (SNR) is required. However, this is not guaranteed for deep targets, as optical scattering causes an exponential decay in optical fluence with respect to tissue depth. T… ▽ More

    Submitted 29 April, 2021; originally announced April 2021.

    Comments: Accepted by IEEE Transactions on Medical Imaging

  12. arXiv:2104.10705  [pdf, other

    eess.IV cs.CV

    Multi-Class Micro-CT Image Segmentation Using Sparse Regularized Deep Networks

    Authors: Amirsaeed Yazdani, Yung-Chen Sun, Nicholas B. Stephens, Timothy Ryan, Vishal Monga

    Abstract: It is common in anthropology and paleontology to address questions about extant and extinct species through the quantification of osteological features observable in micro-computed tomographic (micro-CT) scans. In cases where remains were buried, the grey values present in these scans may be classified as belonging to air, dirt, or bone. While various intensity-based methods have been proposed to… ▽ More

    Submitted 21 April, 2021; originally announced April 2021.

    Comments: 5 pages, 6 figures, accepted in 2020 54th Asilomar Conference on Signals, Systems, and Computers

  13. arXiv:2103.10312  [pdf, other

    cs.CV

    Real-Time, Deep Synthetic Aperture Sonar (SAS) Autofocus

    Authors: Isaac D. Gerg, Vishal Monga

    Abstract: Synthetic aperture sonar (SAS) requires precise time-of-flight measurements of the transmitted/received waveform to produce well-focused imagery. It is not uncommon for errors in these measurements to be present resulting in image defocusing. To overcome this, an \emph{autofocus} algorithm is employed as a post-processing step after image reconstruction to improve image focus. A particular class o… ▽ More

    Submitted 1 June, 2021; v1 submitted 18 March, 2021; originally announced March 2021.

    Comments: Four pages. Accepted to IGARSS 2021. Fixed Eq 9

  14. arXiv:2010.15687   

    eess.IV cs.CV

    Deep Autofocus for Synthetic Aperture Sonar

    Authors: Isaac Gerg, Vishal Monga

    Abstract: Synthetic aperture sonar (SAS) requires precise positional and environmental information to produce well-focused output during the image reconstruction step. However, errors in these measurements are commonly present resulting in defocused imagery. To overcome these issues, an \emph{autofocus} algorithm is employed as a post-processing step after image reconstruction for the purpose of improving i… ▽ More

    Submitted 30 July, 2021; v1 submitted 29 October, 2020; originally announced October 2020.

    Comments: superseded by another work

  15. arXiv:2010.13317  [pdf, other

    cs.CV

    Structural Prior Driven Regularized Deep Learning for Sonar Image Classification

    Authors: Isaac D. Gerg, Vishal Monga

    Abstract: Deep learning has been recently shown to improve performance in the domain of synthetic aperture sonar (SAS) image classification. Given the constant resolution with range of a SAS, it is no surprise that deep learning techniques perform so well. Despite deep learning's recent success, there are still compelling open challenges in reducing the high false alarm rate and enabling success when traini… ▽ More

    Submitted 26 October, 2020; originally announced October 2020.

    Comments: To appear in TGRS, 2021

  16. arXiv:2005.03457  [pdf, other

    cs.CV

    NTIRE 2020 Challenge on NonHomogeneous Dehazing

    Authors: Codruta O. Ancuti, Cosmin Ancuti, Florin-Alexandru Vasluianu, Radu Timofte, Jing Liu, Haiyan Wu, Yuan Xie, Yanyun Qu, Lizhuang Ma, Ziling Huang, Qili Deng, Ju-Chin Chao, Tsung-Shan Yang, Peng-Wen Chen, Po-Min Hsu, Tzu-Yi Liao, Chung-En Sun, Pei-Yuan Wu, Jeonghyeok Do, Jongmin Park, Munchurl Kim, Kareem Metwaly, Xuelu Li, Tiantong Guo, Vishal Monga , et al. (27 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2020 Challenge on NonHomogeneous Dehazing of images (restoration of rich details in hazy image). We focus on the proposed solutions and their results evaluated on NH-Haze, a novel dataset consisting of 55 pairs of real haze free and nonhomogeneous hazy images recorded outdoor. NH-Haze is the first realistic nonhomogeneous haze dataset that provides ground truth images.… ▽ More

    Submitted 7 May, 2020; originally announced May 2020.

    Comments: CVPR Workshops Proceedings 2020

  17. arXiv:2004.01817  [pdf, other

    cs.CV

    Group Based Deep Shared Feature Learning for Fine-grained Image Classification

    Authors: Xuelu Li, Vishal Monga

    Abstract: Fine-grained image classification has emerged as a significant challenge because objects in such images have small inter-class visual differences but with large variations in pose, lighting, and viewpoints, etc. Most existing work focuses on highly customized feature extraction via deep network architectures which have been shown to deliver state of the art performance. Given that images from dist… ▽ More

    Submitted 3 April, 2020; originally announced April 2020.

  18. arXiv:1912.10557  [pdf, other

    eess.IV cs.CV cs.LG eess.SP

    Algorithm Unrolling: Interpretable, Efficient Deep Learning for Signal and Image Processing

    Authors: Vishal Monga, Yuelong Li, Yonina C. Eldar

    Abstract: Deep neural networks provide unprecedented performance gains in many real world problems in signal and image processing. Despite these gains, future development and practical deployment of deep networks is hindered by their blackbox nature, i.e., lack of interpretability, and by the need for very large training sets. An emerging technique called algorithm unrolling or unfolding offers promise in e… ▽ More

    Submitted 7 August, 2020; v1 submitted 22 December, 2019; originally announced December 2019.

  19. arXiv:1910.10908  [pdf, other

    eess.SP math.NA

    Thresholded Non-Uniform Fourier Frame-Based Reconstruction for Stripmap SAR

    Authors: John McKay, Anne Gelb, Suren Jayasuriya, Vishal Monga

    Abstract: Fourier domain methods are fast algorithms for SAR imaging. They typically involve an interpolation in the frequency domain to re-grid non-uniform data so inverse fast Fourier transforms can be performed. In this paper, we apply a frame reconstruction algorithm, extending the non-uniform fast Fourier transform, to stripmap SAR data. Further, we present an improved thresholded frame reconstruction… ▽ More

    Submitted 24 October, 2019; originally announced October 2019.

  20. Deep Retinal Image Segmentation with Regularization Under Geometric Priors

    Authors: Venkateswararao Cherukuri, Vijay Kumar BG, Raja Bala, Vishal Monga

    Abstract: Vessel segmentation of retinal images is a key diagnostic capability in ophthalmology. This problem faces several challenges including low contrast, variable vessel size and thickness, and presence of interfering pathology such as micro-aneurysms and hemorrhages. Early approaches addressing this problem employed hand-crafted filters to capture vessel structures, accompanied by morphological post-p… ▽ More

    Submitted 19 September, 2019; originally announced September 2019.

    Comments: Accepted to IEEE TIP

  21. Deep MR Brain Image Super-Resolution Using Spatio-Structural Priors

    Authors: Venkateswararao Cherukuri, Tiantong Guo, Steve. J. Schiff, Vishal Monga

    Abstract: High resolution Magnetic Resonance (MR) images are desired for accurate diagnostics. In practice, image resolution is restricted by factors like hardware and processing constraints. Recently, deep learning methods have been shown to produce compelling state-of-the-art results for image enhancement/super-resolution. Paying particular attention to desired hi-resolution MR image structure, we propose… ▽ More

    Submitted 10 September, 2019; originally announced September 2019.

    Comments: Accepted to IEEE transactions on Image Processing

  22. Adaptive Transform Domain Image Super-resolution Via Orthogonally Regularized Deep Networks

    Authors: Tiantong Guo, Hojjat S. Mousavi, Vishal Monga

    Abstract: Deep learning methods, in particular, trained Convolutional Neural Networks (CNN) have recently been shown to produce compelling results for single image Super-Resolution (SR). Invariably, a CNN is learned to map the Low Resolution (LR) image to its corresponding High Resolution (HR) version in the spatial domain. We propose a novel network structure for learning the SR mapping function in an imag… ▽ More

    Submitted 22 April, 2019; originally announced April 2019.

  23. Transmit MIMO Radar Beampattern Design Via Optimization on the Complex Circle Manifol

    Authors: Khaled Alhujaili, Vishal Monga, Muralidhar Rangaswamy

    Abstract: The ability of Multiple-Input Multiple-Output (MIMO) radar systems to adapt waveforms across antennas allows flexibility in the transmit beampattern design. In cognitive radar, a popular cost function is to minimize the deviation against an idealized beampattern (which is arrived at with knowledge of the environment). The optimization of the transmit beampattern becomes particularly challenging in… ▽ More

    Submitted 15 April, 2019; originally announced April 2019.

  24. Robust Alignment for Panoramic Stitching via an Exact Rank Constraint

    Authors: Yuelong Li, Mohammad Tofighi, Vishal Monga

    Abstract: We study the problem of image alignment for panoramic stitching. Unlike most existing approaches that are feature-based, our algorithm works on pixels directly, and accounts for errors across the whole images globally. Technically, we formulate the alignment problem as rank-1 and sparse matrix decomposition over transformed images, and develop an efficient algorithm for solving this challenging no… ▽ More

    Submitted 1 April, 2019; originally announced April 2019.

    Comments: Accepted for publication in IEEE Transactions on Image Processing

  25. arXiv:1902.05399  [pdf, other

    cs.CV cs.LG stat.ML

    An Algorithm Unrolling Approach to Deep Image Deblurring

    Authors: Yuelong Li, Mohammad Tofighi, Vishal Monga, Yonina C. Eldar

    Abstract: While neural networks have achieved vastly enhanced performance over traditional iterative methods in many cases, they are generally empirically designed and the underlying structures are difficult to interpret. The algorithm unrolling approach has helped connect iterative algorithms to neural network architectures. However, such connections have not been made yet for blind image deblurring. In th… ▽ More

    Submitted 15 February, 2019; v1 submitted 9 February, 2019; originally announced February 2019.

    Comments: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

  26. arXiv:1902.03493  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Deep Algorithm Unrolling for Blind Image Deblurring

    Authors: Yuelong Li, Mohammad Tofighi, Junyi Geng, Vishal Monga, Yonina C. Eldar

    Abstract: Blind image deblurring remains a topic of enduring interest. Learning based approaches, especially those that employ neural networks have emerged to complement traditional model based methods and in many cases achieve vastly enhanced performance. That said, neural network approaches are generally empirically designed and the underlying structures are difficult to interpret. In recent years, a prom… ▽ More

    Submitted 29 May, 2019; v1 submitted 9 February, 2019; originally announced February 2019.

  27. arXiv:1901.07061  [pdf, other

    eess.IV cs.LG stat.ML

    Prior Information Guided Regularized Deep Learning for Cell Nucleus Detection

    Authors: Mohammad Tofighi, Tiantong Guo, Jairam K. P. Vanamala, Vishal Monga

    Abstract: Cell nuclei detection is a challenging research topic because of limitations in cellular image quality and diversity of nuclear morphology, i.e. varying nuclei shapes, sizes, and overlaps between multiple cell nuclei. This has been a topic of enduring interest with promising recent success shown by deep learning methods. These methods train Convolutional Neural Networks (CNNs) with a training set… ▽ More

    Submitted 21 January, 2019; originally announced January 2019.

    Comments: Accepted for Publication

    Journal ref: IEEE Transactions on Medical Imaging, January 2019

  28. arXiv:1811.11627  [pdf, other

    eess.SP

    Spatio-Spectral Radar Beampattern Design for Co-existence with Wireless Communication Systems

    Authors: Bosung Kang, Omar Aldayel, Vishal Monga, Muralidhar Rangaswamy

    Abstract: We address the problem of designing a transmit beampattern for multiple-input multiple-output (MIMO) radar considering co-existence with wireless communication systems. The designed beampattern is able to manage the transmit energy in spatial directions as well as in spectral frequency bands of interest by minimizing the deviation of the designed beampattern versus a desired one under a spectral c… ▽ More

    Submitted 28 November, 2018; originally announced November 2018.

  29. arXiv:1810.02812  [pdf, other

    eess.SP cs.LG stat.ML

    Classifying Multi-channel UWB SAR Imagery via Tensor Sparsity Learning Techniques

    Authors: Tiep Vu, Lam Nguyen, Vishal Monga

    Abstract: Using low-frequency (UHF to L-band) ultra-wideband (UWB) synthetic aperture radar (SAR) technology for detecting buried and obscured targets, e.g. bomb or mine, has been successfully demonstrated recently. Despite promising recent progress, a significant open challenge is to distinguish obscured targets from other (natural and manmade) clutter sources in the scene. The problem becomes exacerbated… ▽ More

    Submitted 4 October, 2018; originally announced October 2018.

  30. arXiv:1809.03140  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    Deep MR Image Super-Resolution Using Structural Priors

    Authors: Venkateswararao Cherukuri, Tiantong Guo, Steven J. Schiff, Vishal Monga

    Abstract: High resolution magnetic resonance (MR) images are desired for accurate diagnostics. In practice, image resolution is restricted by factors like hardware, cost and processing constraints. Recently, deep learning methods have been shown to produce compelling state of the art results for image super-resolution. Paying particular attention to desired hi-resolution MR image structure, we propose a new… ▽ More

    Submitted 10 September, 2018; originally announced September 2018.

    Comments: Accepted to IEEE ICIP 2018

  31. arXiv:1807.03135  [pdf, other

    cs.CV cs.LG stat.ML

    Deep Networks with Shape Priors for Nucleus Detection

    Authors: Mohammad Tofighi, Tiantong Guo, Jairam K. P. Vanamala, Vishal Monga

    Abstract: Detection of cell nuclei in microscopic images is a challenging research topic, because of limitations in cellular image quality and diversity of nuclear morphology, i.e. varying nuclei shapes, sizes, and overlaps between multiple cell nuclei. This has been a topic of enduring interest with promising recent success shown by deep learning methods. These methods train for example convolutional neura… ▽ More

    Submitted 29 June, 2018; originally announced July 2018.

    Comments: Accepted paper to 2018 IEEE International Conference on Image Processing (ICIP 2018)

  32. arXiv:1803.07220  [pdf, other

    eess.IV

    Collaborative Sparse Priors for Infrared Image Multi-view ATR

    Authors: Xuelu Li, Vishal Monga

    Abstract: Feature extraction from infrared (IR) images remains a challenging task. Learning based methods that can work on raw imagery/patches have therefore assumed significance. We propose a novel multi-task extension of the widely used sparse-representation-classification (SRC) method in both single and multi-view set-ups. That is, the test sample could be a single IR image or images from different views… ▽ More

    Submitted 3 May, 2018; v1 submitted 19 March, 2018; originally announced March 2018.

    Comments: 4 pages, 3 figures, conference paper

  33. arXiv:1802.02721  [pdf, other

    cs.CV

    Deep Image Super Resolution via Natural Image Priors

    Authors: Hojjat S. Mousavi, Tiantong Guo, Vishal Monga

    Abstract: Single image super-resolution (SR) via deep learning has recently gained significant attention in the literature. Convolutional neural networks (CNNs) are typically learned to represent the mapping between low-resolution (LR) and high-resolution (HR) images/patches with the help of training examples. Most existing deep networks for SR produce high quality results when training data is abundant. Ho… ▽ More

    Submitted 8 February, 2018; originally announced February 2018.

  34. arXiv:1802.02018  [pdf, other

    cs.CV

    Orthogonally Regularized Deep Networks For Image Super-resolution

    Authors: Tiantong Guo, Hojjat S. Mousavi, Vishal Monga

    Abstract: Deep learning methods, in particular trained Convolutional Neural Networks (CNNs) have recently been shown to produce compelling state-of-the-art results for single image Super-Resolution (SR). Invariably, a CNN is learned to map the low resolution (LR) image to its corresponding high resolution (HR) version in the spatial domain. Aiming for faster inference and more efficient solutions than solvi… ▽ More

    Submitted 6 February, 2018; originally announced February 2018.

  35. arXiv:1801.05458  [pdf, other

    eess.IV cs.CV

    Deep Network for Simultaneous Decomposition and Classification in UWB-SAR Imagery

    Authors: Tiep Vu, Lam Nguyen, Tiantong Guo, Vishal Monga

    Abstract: Classifying buried and obscured targets of interest from other natural and manmade clutter objects in the scene is an important problem for the U.S. Army. Targets of interest are often represented by signals captured using low-frequency (UHF to L-band) ultra-wideband (UWB) synthetic aperture radar (SAR) technology. This technology has been used in various applications, including ground penetration… ▽ More

    Submitted 22 February, 2018; v1 submitted 16 January, 2018; originally announced January 2018.

  36. arXiv:1801.02548  [pdf, other

    cs.CV

    Bridging the Gap: Simultaneous Fine Tuning for Data Re-Balancing

    Authors: John McKay, Isaac Gerg, Vishal Monga

    Abstract: There are many real-world classification problems wherein the issue of data imbalance (the case when a data set contains substantially more samples for one/many classes than the rest) is unavoidable. While under-sampling the problematic classes is a common solution, this is not a compelling option when the large data class is itself diverse and/or the limited data class is especially small. We sug… ▽ More

    Submitted 8 January, 2018; originally announced January 2018.

    Comments: Submitted to IGARSS 2018, 4 Pages, 8 Figures

  37. arXiv:1712.08227  [pdf, other

    eess.IV

    Analysis-synthesis model learning with shared features: a new framework for histopathological image classification

    Authors: Xuelu Li, Vishal Monga, U. K. Arvind Rao

    Abstract: Automated histopathological image analysis offers exciting opportunities for the early diagnosis of several medical conditions including cancer. There are however stiff practical challenges: 1.) discriminative features from such images for separating diseased vs. healthy classes are not readily apparent, and 2.) distinct classes, e.g. healthy vs. stages of disease continue to share several geometr… ▽ More

    Submitted 21 December, 2017; originally announced December 2017.

    Comments: 2018 ISBI conference accepted paper

  38. arXiv:1712.03993  [pdf, other

    eess.IV

    Learning Based Segmentation of CT Brain Images: Application to Post-Operative Hydrocephalic Scans

    Authors: Venkateswararao Cherukuri, Peter Ssenyonga, Benjamin C. Warf, Abhaya V. Kulkarni, Vishal Monga, Steven J. Schiff

    Abstract: Objective: Hydrocephalus is a medical condition in which there is an abnormal accumulation of cerebrospinal fluid (CSF) in the brain. Segmentation of brain imagery into brain tissue and CSF (before and after surgery, i.e. pre-op vs. postop) plays a crucial role in evaluating surgical treatment. Segmentation of pre-op images is often a relatively straightforward problem and has been well researched… ▽ More

    Submitted 11 December, 2017; originally announced December 2017.

    Comments: IEEE Transactions on Biomedical Engineering, 2018

    Journal ref: IEEE Transactions on Biomedical Engineering 65.8 (2018): 1871-1884

  39. Blind Image Deblurring Using Row-Column Sparse Representations

    Authors: Mohammad Tofighi, Yuelong Li, Vishal Monga

    Abstract: Blind image deblurring is a particularly challenging inverse problem where the blur kernel is unknown and must be estimated en route to recover the deblurred image. The problem is of strong practical relevance since many imaging devices such as cellphone cameras, must rely on deblurring algorithms to yield satisfactory image quality. Despite significant research effort, handling large motions rema… ▽ More

    Submitted 5 December, 2017; originally announced December 2017.

    Comments: Accepted to IEEE Signal Processing Letters, December 2017

  40. arXiv:1707.02336  [pdf, other

    cs.CV

    Fast Stochastic Hierarchical Bayesian MAP for Tomographic Imaging

    Authors: John McKay, Raghu G. Raj, Vishal Monga

    Abstract: Any image recovery algorithm attempts to achieve the highest quality reconstruction in a timely manner. The former can be achieved in several ways, among which are by incorporating Bayesian priors that exploit natural image tendencies to cue in on relevant phenomena. The Hierarchical Bayesian MAP (HB-MAP) is one such approach which is known to produce compelling results albeit at a substantial com… ▽ More

    Submitted 7 July, 2017; originally announced July 2017.

    Comments: 5 Pages, 4 Figures, Conference (Accepted to Asilomar 2017)

  41. arXiv:1706.09858  [pdf, other

    cs.CV

    What's Mine is Yours: Pretrained CNNs for Limited Training Sonar ATR

    Authors: John McKay, Isaac Gerg, Vishal Monga, Raghu Raj

    Abstract: Finding mines in Sonar imagery is a significant problem with a great deal of relevance for seafaring military and commercial endeavors. Unfortunately, the lack of enormous Sonar image data sets has prevented automatic target recognition (ATR) algorithms from some of the same advances seen in other computer vision fields. Namely, the boom in convolutional neural nets (CNNs) which have been able to… ▽ More

    Submitted 29 June, 2017; originally announced June 2017.

    Comments: Accepted to OCEANS 2017 - Anchorage (Conference)

  42. Robust Sonar ATR Through Bayesian Pose Corrected Sparse Classification

    Authors: John McKay, Vishal Monga, Raghu G. Raj

    Abstract: Sonar imaging has seen vast improvements over the last few decades due in part to advances in synthetic aperture Sonar (SAS). Sophisticated classification techniques can now be used in Sonar automatic target recognition (ATR) to locate mines and other threatening objects. Among the most promising of these methods is sparse reconstruction-based classification (SRC) which has shown an impressive res… ▽ More

    Submitted 26 June, 2017; originally announced June 2017.

    Comments: 14 Pages, 16 Figures, Accepted TGARS

  43. arXiv:1706.08575  [pdf, other

    math.NA cs.CV

    Using Frame Theoretic Convolutional Gridding for Robust Synthetic Aperture Sonar Imaging

    Authors: John McKay, Anne Gelb, Vishal Monga, Raghu Raj

    Abstract: Recent progress in synthetic aperture sonar (SAS) technology and processing has led to significant advances in underwater imaging, outperforming previously common approaches in both accuracy and efficiency. There are, however, inherent limitations to current SAS reconstruction methodology. In particular, popular and efficient Fourier domain SAS methods require a 2D interpolation which is often ill… ▽ More

    Submitted 26 June, 2017; originally announced June 2017.

    Comments: Accepted to OCEANS 2017 - Anchorage (Conference)

  44. A Maximum A Posteriori Estimation Framework for Robust High Dynamic Range Video Synthesis

    Authors: Yuelong Li, Chul Lee, Vishal Monga

    Abstract: High dynamic range (HDR) image synthesis from multiple low dynamic range (LDR) exposures continues to be actively researched. The extension to HDR video synthesis is a topic of significant current interest due to potential cost benefits. For HDR video, a stiff practical challenge presents itself in the form of accurate correspondence estimation of objects between video frames. In particular, loss… ▽ More

    Submitted 8 December, 2016; originally announced December 2016.

  45. Fast Low-rank Shared Dictionary Learning for Image Classification

    Authors: Tiep Vu, Vishal Monga

    Abstract: Despite the fact that different objects possess distinct class-specific features, they also usually share common patterns. This observation has been exploited partially in a recently proposed dictionary learning framework by separating the particularity and the commonality (COPAR). Inspired by this, we propose a novel method to explicitly and simultaneously learn a set of common patterns as well a… ▽ More

    Submitted 15 July, 2017; v1 submitted 26 October, 2016; originally announced October 2016.

    Comments: Accepted version

  46. arXiv:1610.08495  [pdf, other

    cs.LG stat.ML

    Adaptive matching pursuit for sparse signal recovery

    Authors: Tiep H. Vu, Hojjat S. Mousavi, Vishal Monga

    Abstract: Spike and Slab priors have been of much recent interest in signal processing as a means of inducing sparsity in Bayesian inference. Applications domains that benefit from the use of these priors include sparse recovery, regression and classification. It is well-known that solving for the sparse coefficient vector to maximize these priors results in a hard non-convex and mixed integer programming p… ▽ More

    Submitted 12 September, 2016; originally announced October 2016.

    Comments: ICASSP

  47. Sparsity-based Color Image Super Resolution via Exploiting Cross Channel Constraints

    Authors: Hojjat S. Mousavi, Vishal Monga

    Abstract: Sparsity constrained single image super-resolution (SR) has been of much recent interest. A typical approach involves sparsely representing patches in a low-resolution (LR) input image via a dictionary of example LR patches, and then using the coefficients of this representation to generate the high-resolution (HR) output via an analogous HR dictionary. However, most existing sparse representation… ▽ More

    Submitted 4 October, 2016; originally announced October 2016.

  48. arXiv:1602.05540  [pdf, other

    stat.ME

    Robust Covariance Estimation under Imperfect Constraints using an Expected Likelihood Approach

    Authors: Bosung Kang, Vishal Monga, Muralidhar Rangaswamy, Yuri I. Abramovich

    Abstract: We address the problem of structured covariance matrix estimation for radar space-time adaptive processing (STAP). A priori knowledge of the interference environment has been exploited in many previous works to enable accurate estimators even when training is not generous. Specifically, recent work has shown that employing practical constraints such as the rank of clutter subspace and the conditio… ▽ More

    Submitted 15 February, 2016; originally announced February 2016.

    Comments: arXiv admin note: substantial text overlap with arXiv:1602.05069

  49. arXiv:1602.00310  [pdf, other

    cs.CV

    Learning a low-rank shared dictionary for object classification

    Authors: Tiep H. Vu, Vishal Monga

    Abstract: Despite the fact that different objects possess distinct class-specific features, they also usually share common patterns. Inspired by this observation, we propose a novel method to explicitly and simultaneously learn a set of common patterns as well as class-specific features for classification. Our dictionary learning framework is hence characterized by both a shared dictionary and particular (c… ▽ More

    Submitted 17 May, 2016; v1 submitted 31 January, 2016; originally announced February 2016.

    Comments: 4 page + 1 reference page

  50. arXiv:1601.03323  [pdf, other

    cs.CV

    Localized Dictionary design for Geometrically Robust Sonar ATR

    Authors: John McKay, Vishal Monga, Raghu Raj

    Abstract: Advancements in Sonar image capture have opened the door to powerful classification schemes for automatic target recognition (ATR. Recent work has particularly seen the application of sparse reconstruction-based classification (SRC) to sonar ATR, which provides compelling accuracy rates even in the presence of noise and blur. Existing sparsity based sonar ATR techniques however assume that the tes… ▽ More

    Submitted 13 January, 2016; originally announced January 2016.

    Comments: Submitted to IGARSS 2016