Skip to main content

Showing 1–50 of 108 results for author: Ray, N

.
  1. arXiv:2505.19774  [pdf, ps, other

    eess.AS

    DuRep: Dual-Mode Speech Representation Learning via ASR-Aware Distillation

    Authors: Prabash Reddy Male, Swayambhu Nath Ray, Harish Arsikere, Akshat Jaiswal, Prakhar Swarup, Prantik Sen, Debmalya Chakrabarty, K V Vijay Girish, Nikhil Bhave, Frederick Weber, Sambuddha Bhattacharya, Sri Garimella

    Abstract: Recent advancements in speech encoders have drawn attention due to their integration with Large Language Models for various speech tasks. While most research has focused on either causal or full-context speech encoders, there's limited exploration to effectively handle both streaming and non-streaming applications, while achieving state-of-the-art performance. We introduce DuRep, a Dual-mode Speec… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  2. arXiv:2408.15187  [pdf, ps, other

    math.AG

    On Weak bounded negativity conjecture

    Authors: Snehajit Misra, Nabanita Ray

    Abstract: In the first part of this article, we give bounds on self-intersections $C^2$ of integral curves $C$ on blow-ups $Bl_nX$ of surfaces $X$ with the anti-cannonical divisor $-K_X$ effective. In the last part, we prove the weak bounded negativity for self-intersections $C^2$ of integral curves $C$ in a family of surfaces $f:Y\longrightarrow B$ where $B$ is a smooth curve.

    Submitted 27 August, 2024; originally announced August 2024.

  3. arXiv:2407.17530  [pdf, other

    cs.CV

    Learning Instance-Specific Parameters of Black-Box Models Using Differentiable Surrogates

    Authors: Arnisha Khondaker, Nilanjan Ray

    Abstract: Tuning parameters of a non-differentiable or black-box compute is challenging. Existing methods rely mostly on random sampling or grid sampling from the parameter space. Further, with all the current methods, it is not possible to supply any input specific parameters to the black-box. To the best of our knowledge, for the first time, we are able to learn input-specific parameters for a black box i… ▽ More

    Submitted 26 November, 2024; v1 submitted 23 July, 2024; originally announced July 2024.

    Comments: 10 pages, 9 figures

  4. arXiv:2405.18684  [pdf, other

    cs.CV

    Learning Diffeomorphism for Image Registration with Time-Continuous Networks using Semigroup Regularization

    Authors: Mohammadjavad Matinkia, Nilanjan Ray

    Abstract: Diffeomorphic image registration (DIR) is a fundamental task in 3D medical image analysis that seeks topology-preserving deformations between image pairs. To ensure diffeomorphism, a common approach is to model the deformation field as the flow map solution of a differential equation, which is solved using efficient schemes such as scaling and squaring along with multiple smoothness regularization… ▽ More

    Submitted 16 March, 2025; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: 27 pages, 11 figures

    ACM Class: I.4.9; I.2.10; F.2.2

  5. arXiv:2405.17006  [pdf, ps, other

    math.AG

    On Stability of Syzygy Bundles

    Authors: Snehajit Misra, Nabanita Ray

    Abstract: In this article, we investigate the stability of syzygy bundles corresponding to ample and globally generated vector bundles on smooth irreducible projective surfaces.

    Submitted 27 May, 2024; originally announced May 2024.

  6. arXiv:2404.00785  [pdf, other

    cs.CV cs.LG q-bio.NC

    Disentangling Hippocampal Shape Variations: A Study of Neurological Disorders Using Mesh Variational Autoencoder with Contrastive Learning

    Authors: Jakaria Rabbi, Johannes Kiechle, Christian Beaulieu, Nilanjan Ray, Dana Cobzas

    Abstract: This paper presents a comprehensive study focused on disentangling hippocampal shape variations from diffusion tensor imaging (DTI) datasets within the context of neurological disorders. Leveraging a Mesh Variational Autoencoder (VAE) enhanced with Supervised Contrastive Learning, our approach aims to improve interpretability by disentangling two distinct latent variables corresponding to age and… ▽ More

    Submitted 9 November, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

    Comments: Length: 26 pages and Accepted for publication in the Journal of Machine Learning for Biomedical Imaging (MELBA) https://melba-journal.org/2024:030

    Journal ref: Machine.Learning.for.Biomedical.Imaging. 2 (2024)

  7. arXiv:2312.06822  [pdf, other

    math.AP

    Well-posedness of an evaporation model for a spherical droplet exposed to an air flow

    Authors: Eberhard Bänsch, Martin Doß, Carsten Gräser, Nadja Ray

    Abstract: In this paper, we address the well-posedness of an evaporation model for a spherical liquid droplet taking into account the convective impact of an air flow in the ambient gas phase. From a mathematical perspective, we are dealing with a coupled ODE-PDE system for the droplet radius, the temperature distribution, and the vapor concentration. The nonlinear coupling arises from the evaporation rate… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    MSC Class: 35Q79 (Primary) 35A01; 35K55; 80A19; 80M10 (Secondary)

  8. arXiv:2310.16212  [pdf, other

    cs.CV

    ShadowSense: Unsupervised Domain Adaptation and Feature Fusion for Shadow-Agnostic Tree Crown Detection from RGB-Thermal Drone Imagery

    Authors: Rudraksh Kapil, Seyed Mojtaba Marvasti-Zadeh, Nadir Erbilgin, Nilanjan Ray

    Abstract: Accurate detection of individual tree crowns from remote sensing data poses a significant challenge due to the dense nature of forest canopy and the presence of diverse environmental variations, e.g., overlapping canopies, occlusions, and varying lighting conditions. Additionally, the lack of data for training robust models adds another limitation in effectively studying complex forest conditions.… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: Accepted in IEEE/CVF Winter Applications of Computer Vision (WACV) 2024 main conference! 8 pages (11 with bibliography), 5 figures, 3 tables

  9. arXiv:2308.08864  [pdf, other

    q-bio.PE

    A discrete-time dynamical model of prey and stage-structured predator with juvenile hunting incorporating negative effects of prey refuge

    Authors: Debasish Bhattacharjee, Nabajit Ray, Dipam Das, Hemanta Kumar Sarmah

    Abstract: This paper examines a discrete predator-prey model that incorporates prey refuge and its detrimental impact on the growth of the prey population. Age structure is taken into account for predator species. Furthermore, juvenile hunting as well as prey counter-attack are also considered. This paper provides a comprehensive analysis of the existence and stability conditions pertaining to all possible… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

    MSC Class: 92D25; 92D40; 92D50; 92B05; 39A05; 39A28; 39A30

  10. arXiv:2308.01982  [pdf, other

    eess.IV cs.CV q-bio.QM

    Predicting Ki67, ER, PR, and HER2 Statuses from H&E-stained Breast Cancer Images

    Authors: Amir Akbarnejad, Nilanjan Ray, Penny J. Barnes, Gilbert Bigras

    Abstract: Despite the advances in machine learning and digital pathology, it is not yet clear if machine learning methods can accurately predict molecular information merely from histomorphology. In a quest to answer this question, we built a large-scale dataset (185538 images) with reliable measurements for Ki67, ER, PR, and HER2 statuses. The dataset is composed of mirrored images of H\&E and correspondin… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.

  11. arXiv:2307.13755  [pdf, other

    cs.CV cs.AI

    Training-based Model Refinement and Representation Disagreement for Semi-Supervised Object Detection

    Authors: Seyed Mojtaba Marvasti-Zadeh, Nilanjan Ray, Nadir Erbilgin

    Abstract: Semi-supervised object detection (SSOD) aims to improve the performance and generalization of existing object detectors by utilizing limited labeled data and extensive unlabeled data. Despite many advances, recent SSOD methods are still challenged by inadequate model refinement using the classical exponential moving average (EMA) strategy, the consensus of Teacher-Student models in the latter stag… ▽ More

    Submitted 26 October, 2023; v1 submitted 25 July, 2023; originally announced July 2023.

    Comments: Accepted in IEEE/CVF Winter Applications of Computer Vision (WACV) 2024

  12. arXiv:2306.13236  [pdf, other

    cs.CV

    Document Image Cleaning using Budget-Aware Black-Box Approximation

    Authors: Ganesh Tata, Katyani Singh, Eric Van Oeveren, Nilanjan Ray

    Abstract: Recent work has shown that by approximating the behaviour of a non-differentiable black-box function using a neural network, the black-box can be integrated into a differentiable training pipeline for end-to-end training. This methodology is termed "differentiable bypass,'' and a successful application of this method involves training a document preprocessor to improve the performance of a black-b… ▽ More

    Submitted 22 June, 2023; originally announced June 2023.

  13. Towards Early Prediction of Human iPSC Reprogramming Success

    Authors: Abhineet Singh, Ila Jasra, Omar Mouhammed, Nidheesh Dadheech, Nilanjan Ray, James Shapiro

    Abstract: This paper presents advancements in automated early-stage prediction of the success of reprogramming human induced pluripotent stem cells (iPSCs) as a potential source for regenerative cell therapies.The minuscule success rate of iPSC-reprogramming of around $ 0.01% $ to $ 0.1% $ makes it labor-intensive, time-consuming, and exorbitantly expensive to generate a stable iPSC line. Since that require… ▽ More

    Submitted 11 November, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Accepted for publication at the Journal of Machine Learning for Biomedical Imaging (MELBA) https://melba-journal.org/2023:014

    Journal ref: Machine.Learning.for.Biomedical.Imaging. 2 (2023)

  14. arXiv:2303.13201  [pdf, ps, other

    math.AG

    Positivity and base loci for vector bundles revisited

    Authors: Mihai Fulger, Nabanita Ray

    Abstract: We give equivalent descriptions for the augmented and diminished base loci of vector bundles in characteristic zero. We show that these base loci behave well under pullback, tensor product, and direct sum. Pathological behavior is observed on some nonsplit exact sequences.

    Submitted 23 March, 2023; originally announced March 2023.

    Comments: 16 pages, comments welcomed

    MSC Class: 14F06

  15. arXiv:2303.02857  [pdf, other

    cs.CV

    Weakly Supervised Realtime Dynamic Background Subtraction

    Authors: Fateme Bahri, Nilanjan Ray

    Abstract: Background subtraction is a fundamental task in computer vision with numerous real-world applications, ranging from object tracking to video surveillance. Dynamic backgrounds poses a significant challenge here. Supervised deep learning-based techniques are currently considered state-of-the-art for this task. However, these methods require pixel-wise ground-truth labels, which can be time-consuming… ▽ More

    Submitted 5 March, 2023; originally announced March 2023.

    Comments: 10 pages, 3 figures

  16. arXiv:2302.11066  [pdf, other

    cs.LG cs.AI

    Reinforcement Learning for Block Decomposition of CAD Models

    Authors: Benjamin C. DiPrete, Rao V. Garimella, Cristina Garcia Cardona, Navamita Ray

    Abstract: We present a novel AI-assisted method for decomposing (segmenting) planar CAD (computer-aided design) models into well shaped rectangular blocks as a proof-of-principle of a general decomposition method applicable to complex 2D and 3D CAD models. The decomposed blocks are required for generating good quality meshes (tilings of quadrilaterals or hexahedra) suitable for numerical simulations of phys… ▽ More

    Submitted 21 February, 2023; originally announced February 2023.

    Comments: AAAI-2022 Fall Symposium Series

  17. arXiv:2301.13320  [pdf, other

    nlin.CG stat.ME

    Parameter estimation for cellular automata

    Authors: Alexey Kazarnikov, Nadja Ray, Heikki Haario, Joona Lappalainen, Andreas Rupp

    Abstract: Self-organizing complex systems can be modeled using cellular automaton models. However, the parametrization of these models is crucial and significantly determines the resulting structural pattern. In this research, we introduce and successfully apply a sound statistical method to estimate these parameters. The decisive difference to earlier applications of such approaches is that, in our case, b… ▽ More

    Submitted 11 January, 2025; v1 submitted 30 January, 2023; originally announced January 2023.

  18. arXiv:2211.13126  [pdf, other

    cs.CV cs.AI cs.LG

    Crown-CAM: Interpretable Visual Explanations for Tree Crown Detection in Aerial Images

    Authors: Seyed Mojtaba Marvasti-Zadeh, Devin Goodsman, Nilanjan Ray, Nadir Erbilgin

    Abstract: Visual explanation of ``black-box'' models allows researchers in explainable artificial intelligence (XAI) to interpret the model's decisions in a human-understandable manner. In this paper, we propose interpretable class activation mapping for tree crown detection (Crown-CAM) that overcomes inaccurate localization & computational complexity of previous methods while generating reliable visual exp… ▽ More

    Submitted 26 April, 2023; v1 submitted 23 November, 2022; originally announced November 2022.

    Comments: Accepted manuscript in IEEE Geoscience and Remote Sensing Letters (GRSL)

  19. arXiv:2210.03829  [pdf, other

    cs.LG cs.AI cs.CV

    Early Detection of Bark Beetle Attack Using Remote Sensing and Machine Learning: A Review

    Authors: Seyed Mojtaba Marvasti-Zadeh, Devin Goodsman, Nilanjan Ray, Nadir Erbilgin

    Abstract: This paper provides a comprehensive review of past and current advances in the early detection of bark beetle-induced tree mortality from three primary perspectives: bark beetle & host interactions, RS, and ML/DL. In contrast to prior efforts, this review encompasses all RS systems and emphasizes ML/DL methods to investigate their strengths and weaknesses. We parse existing literature based on mul… ▽ More

    Submitted 24 November, 2023; v1 submitted 7 October, 2022; originally announced October 2022.

    Comments: ACM Computing Surveys, 56, 4, Article 97, April 2024. https://doi.org/10.1145/3625387

  20. arXiv:2209.09876  [pdf, ps, other

    math.PR

    Distance-dependent chase-escape on trees

    Authors: Sarai Hernandez-Torres, Matthew Junge, Naina Ray, Nidhi Ray

    Abstract: We give a necessary and sufficient condition for species coexistence in a parasite-host growth process on infinite $d$-ary trees. The novelty of this work is that the spreading and death rates for hosts depend on the distance to the nearest parasite.

    Submitted 15 November, 2022; v1 submitted 20 September, 2022; originally announced September 2022.

    Comments: 9 pages, 3 figures

    MSC Class: 60K35

  21. arXiv:2208.13275  [pdf, other

    eess.IV cs.CV cs.LG

    Unsupervised diffeomorphic cardiac image registration using parameterization of the deformation field

    Authors: Ameneh Sheikhjafari, Deepa Krishnaswamy, Michelle Noga, Nilanjan Ray, Kumaradevan Punithakumar

    Abstract: This study proposes an end-to-end unsupervised diffeomorphic deformable registration framework based on moving mesh parameterization. Using this parameterization, a deformation field can be modeled with its transformation Jacobian determinant and curl of end velocity field. The new model of the deformation field has three important advantages; firstly, it relaxes the need for an explicit regulariz… ▽ More

    Submitted 28 August, 2022; originally announced August 2022.

    Comments: 12 pages, 6 figures, 4 tables

  22. arXiv:2208.07981  [pdf, other

    cs.LG cs.HC eess.SP

    Tiny-HR: Towards an interpretable machine learning pipeline for heart rate estimation on edge devices

    Authors: Preetam Anbukarasu, Shailesh Nanisetty, Ganesh Tata, Nilanjan Ray

    Abstract: The focus of this paper is a proof of concept, machine learning (ML) pipeline that extracts heart rate from pressure sensor data acquired on low-power edge devices. The ML pipeline consists an upsampler neural network, a signal quality classifier, and a 1D-convolutional neural network optimized for efficient and accurate heart rate estimation. The models were designed so the pipeline was less than… ▽ More

    Submitted 16 August, 2022; originally announced August 2022.

    Comments: 10 pages, 6 figures, Preprint Submitted to IEEE Transactions on Consumer Electronics

  23. arXiv:2208.03337  [pdf, other

    physics.comp-ph cs.LG

    Estimating relative diffusion from 3D micro-CT images using CNNs

    Authors: Stephan Gärttner, Florian Frank, Fabian Woller, Andreas Meier, Nadja Ray

    Abstract: In the past several years, convolutional neural networks (CNNs) have proven their capability to predict characteristic quantities in porous media research directly from pore-space geometries. Due to the frequently observed significant reduction in computation time in comparison to classical computational methods, bulk parameter prediction via CNNs is especially compelling, e.g. for effective diffu… ▽ More

    Submitted 4 August, 2022; originally announced August 2022.

  24. arXiv:2207.07241  [pdf, other

    cs.CV

    Classification of Bark Beetle-Induced Forest Tree Mortality using Deep Learning

    Authors: Rudraksh Kapil, Seyed Mojtaba Marvasti-Zadeh, Devin Goodsman, Nilanjan Ray, Nadir Erbilgin

    Abstract: Bark beetle outbreaks can dramatically impact forest ecosystems and services around the world. For the development of effective forest policies and management plans, the early detection of infested trees is essential. Despite the visual symptoms of bark beetle infestation, this task remains challenging, considering overlapping tree crowns and non-homogeneity in crown foliage discolouration. In thi… ▽ More

    Submitted 21 August, 2022; v1 submitted 14 July, 2022; originally announced July 2022.

    Comments: Extended abstract submitted to VAIB Worskhop at ICPR 2022. 4 pages, 6 figures. The code and results are publicly available at https://github.com/rudrakshkapil09/BarkBeetle-Damage-Classification-DL

  25. arXiv:2207.04512  [pdf, other

    cs.CV

    Learning-based Monocular 3D Reconstruction of Birds: A Contemporary Survey

    Authors: Seyed Mojtaba Marvasti-Zadeh, Mohammad N. S. Jahromi, Javad Khaghani, Devin Goodsman, Nilanjan Ray, Nadir Erbilgin

    Abstract: In nature, the collective behavior of animals, such as flying birds is dominated by the interactions between individuals of the same species. However, the study of such behavior among the bird species is a complex process that humans cannot perform using conventional visual observational techniques such as focal sampling in nature. For social animals such as birds, the mechanism of group formation… ▽ More

    Submitted 28 July, 2022; v1 submitted 10 July, 2022; originally announced July 2022.

    Comments: Accepted in the International Conference on Patten Recognition Workshops (VAIB'22)

  26. arXiv:2205.11289  [pdf, ps, other

    math.AG math.RT

    Slope Semistability and Positive cones of Grassmann bundles

    Authors: Snehajit Misra, Nabanita Ray

    Abstract: Let $E$ be a vector bundle of rank $r$ on a smooth complex projective variety $X$. In this article, we compute the nef and pseudoeffective cones of divisors in the Grassmann bundle $Gr_X(k,E)$ parametrizing $k$-dimensional subspaces of the fibers of $E$, where $1\leq k \leq rank(E)$, under assumptions on $X$ as well as on the vector bundle $E$. In particular, we show that nef cone and the pseudoef… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

    Comments: Comments are welcome. arXiv admin note: text overlap with arXiv:2203.07007

  27. arXiv:2205.06655   

    cs.CL cs.SD eess.AS

    Unified Modeling of Multi-Domain Multi-Device ASR Systems

    Authors: Soumyajit Mitra, Swayambhu Nath Ray, Bharat Padi, Arunasish Sen, Raghavendra Bilgi, Harish Arsikere, Shalini Ghosh, Ajay Srinivasamurthy, Sri Garimella

    Abstract: Modern Automatic Speech Recognition (ASR) systems often use a portfolio of domain-specific models in order to get high accuracy for distinct user utterance types across different devices. In this paper, we propose an innovative approach that integrates the different per-domain per-device models into a unified model, using a combination of domain embedding, domain experts, mixture of experts and ad… ▽ More

    Submitted 13 October, 2022; v1 submitted 13 May, 2022; originally announced May 2022.

    Comments: We will update the paper completely with our latest experiments and analysis

  28. arXiv:2203.04533  [pdf, ps, other

    math.AG

    Seshadri constants of parabolic vector bundles

    Authors: Indranil Biswas, Krishna Hanumanthu, Snehajit Misra, Nabanita Ray

    Abstract: Let $X$ be a complex projective variety, and let $E_{\ast}$ be a parabolic vector bundle on $X$. We introduce the notion of \textit{parabolic Seshadri constants} of $E_{\ast}$. It is shown that these constants are analogous to the classical Seshadri constants of vector bundles, in particular, they have parallel definitions and properties. We prove a Seshadri criterion for parabolic ampleness of… ▽ More

    Submitted 7 June, 2023; v1 submitted 9 March, 2022; originally announced March 2022.

    Comments: 25 pages; final version; to appear in Documenta Mathematica

    MSC Class: 14C20; 14J60

  29. arXiv:2202.05336  [pdf, other

    eess.IV cs.CV cs.LG

    Dynamic Background Subtraction by Generative Neural Networks

    Authors: Fateme Bahri, Nilanjan Ray

    Abstract: Background subtraction is a significant task in computer vision and an essential step for many real world applications. One of the challenges for background subtraction methods is dynamic background, which constitute stochastic movements in some parts of the background. In this paper, we have proposed a new background subtraction method, called DBSGen, which uses two generative neural networks, on… ▽ More

    Submitted 10 February, 2022; originally announced February 2022.

    Comments: 8 pages, 5 figures

  30. arXiv:2202.00749  [pdf, other

    eess.IV cs.CV

    Towards Positive Jacobian: Learn to Postprocess Diffeomorphic Image Registration with Matrix Exponential

    Authors: Soumyadeep Pal, Matthew Tennant, Nilanjan Ray

    Abstract: We present a postprocessing layer for deformable image registration to make a registration field more diffeomorphic by encouraging Jacobians of the transformation to be positive. Diffeomorphic image registration is important for medical imaging studies because of the properties like invertibility, smoothness of the transformation, and topology preservation/non-folding of the grid. Violation of the… ▽ More

    Submitted 1 February, 2022; originally announced February 2022.

  31. arXiv:2202.00675  [pdf, other

    eess.IV cs.CV cs.LG

    A training-free recursive multiresolution framework for diffeomorphic deformable image registration

    Authors: Ameneh Sheikhjafari, Michelle Noga, Kumaradevan Punithakumar, Nilanjan Ray

    Abstract: Diffeomorphic deformable image registration is one of the crucial tasks in medical image analysis, which aims to find a unique transformation while preserving the topology and invertibility of the transformation. Deep convolutional neural networks (CNNs) have yielded well-suited approaches for image registration by learning the transformation priors from a large dataset. The improvement in the per… ▽ More

    Submitted 1 February, 2022; originally announced February 2022.

    Comments: 15 pages, 5 figures, 3 tables, 1 algorithm, The International Journal of Research on Intelligent Systems for Real Life Complex Problems

    MSC Class: 68U10; 68T07 ACM Class: I.4.9

  32. arXiv:2201.13265  [pdf, other

    math.AP

    Local existence of strong solutions to micro-macro models for reactive transport in evolving porous media

    Authors: Stephan Gärttner, Peter Knabner, Nadja Ray

    Abstract: Two-scale models pose a promising approach in simulating reactive flow and transport in evolving porous media. Classically, homogenized flow and transport equations are solved on the macroscopic scale, while effective parameters are obtained from auxiliary cell problems on possibly evolving reference geometries (micro-scale). Despite their perspective success in rendering lab/field-scale simulatio… ▽ More

    Submitted 31 January, 2022; originally announced January 2022.

    MSC Class: 35A01; 35B30; 35M30; 35Q49

  33. arXiv:2112.09820  [pdf

    cs.LG stat.ML

    GPEX, A Framework For Interpreting Artificial Neural Networks

    Authors: Amir Akbarnejad, Gilbert Bigras, Nilanjan Ray

    Abstract: The analogy between Gaussian processes (GPs) and deep artificial neural networks (ANNs) has received a lot of interest, and has shown promise to unbox the blackbox of deep ANNs. Existing theoretical works put strict assumptions on the ANN (e.g. requiring all intermediate layers to be wide, or using specific activation functions). Accommodating those theoretical assumptions is hard in recent deep a… ▽ More

    Submitted 10 January, 2024; v1 submitted 17 December, 2021; originally announced December 2021.

  34. arXiv:2110.08232  [pdf, other

    cs.CV cs.LG

    Fire Together Wire Together: A Dynamic Pruning Approach with Self-Supervised Mask Prediction

    Authors: Sara Elkerdawy, Mostafa Elhoushi, Hong Zhang, Nilanjan Ray

    Abstract: Dynamic model pruning is a recent direction that allows for the inference of a different sub-network for each input sample during deployment. However, current dynamic methods rely on learning a continuous channel gating through regularization by inducing sparsity loss. This formulation introduces complexity in balancing different losses (e.g task loss, regularization loss). In addition, regulariza… ▽ More

    Submitted 28 June, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

  35. arXiv:2109.01818  [pdf, other

    cs.LG math.NA

    Estimating permeability of 3D micro-CT images by physics-informed CNNs based on DNS

    Authors: Stephan Gärttner, Faruk O. Alpak, Andreas Meier, Nadja Ray, Florian Frank

    Abstract: In recent years, convolutional neural networks (CNNs) have experienced an increasing interest in their ability to perform a fast approximation of effective hydrodynamic parameters in porous media research and applications. This paper presents a novel methodology for permeability prediction from micro-CT scans of geological rock samples. The training data set for CNNs dedicated to permeability pred… ▽ More

    Submitted 13 April, 2022; v1 submitted 4 September, 2021; originally announced September 2021.

    MSC Class: 05C21; 68T07; 76D07; 76M10; 76S05

  36. arXiv:2106.14622  [pdf, other

    cs.CL cs.LG

    Timestamping Documents and Beliefs

    Authors: Swayambhu Nath Ray

    Abstract: Most of the textual information available to us are temporally variable. In a world where information is dynamic, time-stamping them is a very important task. Documents are a good source of information and are used for many tasks like, sentiment analysis, classification of reviews etc. The knowledge of creation date of documents facilitates several tasks like summarization, event extraction, tempo… ▽ More

    Submitted 8 June, 2021; originally announced June 2021.

    Comments: Master's Report

    ACM Class: I.2.7

  37. arXiv:2106.06183  [pdf, other

    eess.AS cs.CL

    Improving RNN-T ASR Performance with Date-Time and Location Awareness

    Authors: Swayambhu Nath Ray, Soumyajit Mitra, Raghavendra Bilgi, Sri Garimella

    Abstract: In this paper, we explore the benefits of incorporating context into a Recurrent Neural Network (RNN-T) based Automatic Speech Recognition (ASR) model to improve the speech recognition for virtual assistants. Specifically, we use meta information extracted from the time at which the utterance is spoken and the approximate location information to make ASR context aware. We show that these contextua… ▽ More

    Submitted 16 June, 2021; v1 submitted 11 June, 2021; originally announced June 2021.

    Comments: To appear in TSD 2021

  38. Novel Deep Learning Architecture for Heart Disease Prediction using Convolutional Neural Network

    Authors: Shadab Hussain, Santosh Kumar Nanda, Susmith Barigidad, Shadab Akhtar, Md Suaib, Niranjan K. Ray

    Abstract: Healthcare is one of the most important aspects of human life. Heart disease is known to be one of the deadliest diseases which is hampering the lives of many people around the world. Heart disease must be detected early so the loss of lives can be prevented. The availability of large-scale data for medical diagnosis has helped developed complex machine learning and deep learning-based models for… ▽ More

    Submitted 26 December, 2021; v1 submitted 22 May, 2021; originally announced May 2021.

  39. arXiv:2105.07983  [pdf, other

    cs.CV

    Unknown-box Approximation to Improve Optical Character Recognition Performance

    Authors: Ayantha Randika, Nilanjan Ray, Xiao Xiao, Allegra Latimer

    Abstract: Optical character recognition (OCR) is a widely used pattern recognition application in numerous domains. There are several feature-rich, general-purpose OCR solutions available for consumers, which can provide moderate to excellent accuracy levels. However, accuracy can diminish with difficult and uncommon document domains. Preprocessing of document images can be used to minimize the effect of do… ▽ More

    Submitted 17 May, 2021; originally announced May 2021.

  40. Listen with Intent: Improving Speech Recognition with Audio-to-Intent Front-End

    Authors: Swayambhu Nath Ray, Minhua Wu, Anirudh Raju, Pegah Ghahremani, Raghavendra Bilgi, Milind Rao, Harish Arsikere, Ariya Rastrow, Andreas Stolcke, Jasha Droppo

    Abstract: Comprehending the overall intent of an utterance helps a listener recognize the individual words spoken. Inspired by this fact, we perform a novel study of the impact of explicitly incorporating intent representations as additional information to improve a recurrent neural network-transducer (RNN-T) based automatic speech recognition (ASR) system. An audio-to-intent (A2I) model encodes the intent… ▽ More

    Submitted 16 June, 2021; v1 submitted 14 May, 2021; originally announced May 2021.

    Comments: To appear in Interspeech 2021

    Journal ref: Proc. Interspeech, Sept. 2021, pp. 3455-3459

  41. arXiv:2102.03069  [pdf, other

    cs.CG

    Foldover-free maps in 50 lines of code

    Authors: Vladimir Garanzha, Igor Kaporin, Liudmila Kudryavtseva, François Protais, Nicolas Ray, Dmitry Sokolov

    Abstract: Mapping a triangulated surface to 2D space (or a tetrahedral mesh to 3D space) is the most fundamental problem in geometry processing.In computational physics, untangling plays an important role in mesh generation: it takes a mesh as an input, and moves the vertices to get rid of foldovers.In fact, mesh untangling can be considered as a special case of mapping where the geometry of the object is t… ▽ More

    Submitted 5 February, 2021; originally announced February 2021.

  42. arXiv:2012.00058  [pdf

    cs.LG cs.DB

    PMLB v1.0: An open source dataset collection for benchmarking machine learning methods

    Authors: Joseph D. Romano, Trang T. Le, William La Cava, John T. Gregg, Daniel J. Goldberg, Natasha L. Ray, Praneel Chakraborty, Daniel Himmelstein, Weixuan Fu, Jason H. Moore

    Abstract: Motivation: Novel machine learning and statistical modeling studies rely on standardized comparisons to existing methods using well-studied benchmark datasets. Few tools exist that provide rapid access to many of these datasets through a standardized, user-friendly interface that integrates well with popular data science workflows. Results: This release of PMLB provides the largest collection of… ▽ More

    Submitted 6 April, 2021; v1 submitted 30 November, 2020; originally announced December 2020.

    Comments: 4 pages, 1 figure. *: These authors contributed equally

    ACM Class: H.2.8

  43. arXiv:2009.06232  [pdf, ps, other

    math.AG

    Stability and semi-stability of (2,2)-type surfaces

    Authors: A. J. Parameswaran, Nabanita Ray

    Abstract: We describe the GIT compactification of the moduli of (2,2)-type effective divisors of $\mathbb{P}^1\times\mathbb{P}^2$ (i.e., surfaces of the linear system $\vert π_1^*\mathcal{O}_{\mathbb{P}^1}(2)\otimes π_2^*\mathcal{O}_{\mathbb{P}^2}(2)\vert$ ) which are generically Del Pezzo surfaces of degree two. In order to get the compactification, we characterize stable and semi-stable (2,2)-type surface… ▽ More

    Submitted 30 March, 2023; v1 submitted 14 September, 2020; originally announced September 2020.

    Comments: Comments are welcome! 28 pages

  44. arXiv:2008.04428  [pdf, other

    cs.CV

    Locating Cephalometric X-Ray Landmarks with Foveated Pyramid Attention

    Authors: Logan Gilmour, Nilanjan Ray

    Abstract: CNNs, initially inspired by human vision, differ in a key way: they sample uniformly, rather than with highest density in a focal point. For very large images, this makes training untenable, as the memory and computation required for activation maps scales quadratically with the side length of an image. We propose an image pyramid based approach that extracts narrow glimpses of the of the input im… ▽ More

    Submitted 10 August, 2020; originally announced August 2020.

    Comments: Presented at MIDL 2020

  45. arXiv:2007.13683  [pdf, other

    cs.CV

    Ordinary Differential Equation and Complex Matrix Exponential for Multi-resolution Image Registration

    Authors: Abhishek Nan, Matthew Tennant, Uriel Rubin, Nilanjan Ray

    Abstract: Autograd-based software packages have recently renewed interest in image registration using homography and other geometric models by gradient descent and optimization, e.g., AirLab and DRMIME. In this work, we emphasize on using complex matrix exponential (CME) over real matrix exponential to compute transformation matrices. CME is theoretically more suitable and practically provides faster conver… ▽ More

    Submitted 27 July, 2020; originally announced July 2020.

    Comments: Software: https://github.com/abnan/ODECME

  46. arXiv:2007.05667  [pdf, other

    cs.CV

    To Filter Prune, or to Layer Prune, That Is The Question

    Authors: Sara Elkerdawy, Mostafa Elhoushi, Abhineet Singh, Hong Zhang, Nilanjan Ray

    Abstract: Recent advances in pruning of neural networks have made it possible to remove a large number of filters or weights without any perceptible drop in accuracy. The number of parameters and that of FLOPs are usually the reported metrics to measure the quality of the pruned models. However, the gain in speed for these pruned models is often overlooked in the literature due to the complex nature of late… ▽ More

    Submitted 8 November, 2020; v1 submitted 10 July, 2020; originally announced July 2020.

  47. arXiv:2003.09085  [pdf, other

    cs.CV cs.LG

    Small-Object Detection in Remote Sensing Images with End-to-End Edge-Enhanced GAN and Object Detector Network

    Authors: Jakaria Rabbi, Nilanjan Ray, Matthias Schubert, Subir Chowdhury, Dennis Chao

    Abstract: The detection performance of small objects in remote sensing images is not satisfactory compared to large objects, especially in low-resolution and noisy images. A generative adversarial network (GAN)-based model called enhanced super-resolution GAN (ESRGAN) shows remarkable image enhancement performance, but reconstructed images miss high-frequency edge information. Therefore, object detection pe… ▽ More

    Submitted 28 April, 2020; v1 submitted 19 March, 2020; originally announced March 2020.

    Comments: This paper contains 27 pages and accepted for publication in MDPI remote sensing journal. GitHub Repository: https://github.com/Jakaria08/EESRGAN (Implementation)

  48. arXiv:2001.09865  [pdf, other

    cs.CV

    DRMIME: Differentiable Mutual Information and Matrix Exponential for Multi-Resolution Image Registration

    Authors: Abhishek Nan, Matthew Tennant, Uriel Rubin, Nilanjan Ray

    Abstract: In this work, we present a novel unsupervised image registration algorithm. It is differentiable end-to-end and can be used for both multi-modal and mono-modal registration. This is done using mutual information (MI) as a metric. The novelty here is that rather than using traditional ways of approximating MI, we use a neural estimator called MINE and supplement it with matrix exponential for trans… ▽ More

    Submitted 27 January, 2020; originally announced January 2020.

    Comments: Software: https://github.com/abnan/DRMIME

  49. arXiv:1910.11443  [pdf, other

    cs.CV

    Animal Detection in Man-made Environments

    Authors: Abhineet Singh, Marcin Pietrasik, Gabriell Natha, Nehla Ghouaiel, Ken Brizel, Nilanjan Ray

    Abstract: Automatic detection of animals that have strayed into human inhabited areas has important security and road safety applications. This paper attempts to solve this problem using deep learning techniques from a variety of computer vision fields including object detection, tracking, segmentation and edge detection. Several interesting insights into transfer learning are elicited while adapting models… ▽ More

    Submitted 14 January, 2020; v1 submitted 24 October, 2019; originally announced October 2019.

    Comments: to appear in to WACV 2020, supplementary: [http://webdocs.cs.ualberta.ca/~vis/asingh1/docs/animal_detection_supp.pdf], demo: [https://youtu.be/ZkjcP8s0QVQ]

  50. arXiv:1910.05038  [pdf, ps, other

    math.AG

    Weyl and Zariski chambers on projective surfaces

    Authors: Krishna Hanumanthu, Nabanita Ray

    Abstract: Let $X$ be a nonsingular complex projective surface. The Weyl and Zariski chambers give two interesting decompositions of the big cone of $X$. We study these two decompositions and determine when a Weyl chamber is contained in the interior of a Zariski chamber and vice versa. We also determine when a Weyl chamber can intersect non-trivially with a Zariski chamber.

    Submitted 28 April, 2020; v1 submitted 11 October, 2019; originally announced October 2019.

    Comments: 15 pages; minor revisions, a new reference; to appear in Forum Mathematicum

    MSC Class: 14C20