Skip to main content

Showing 1–50 of 107 results for author: Raman, S

Searching in archive cs. Search in all archives.
.
  1. Fingerprinting Deep Packet Inspection Devices by Their Ambiguities

    Authors: Diwen Xue, Armin Huremagic, Wayne Wang, Ram Sundara Raman, Roya Ensafi

    Abstract: Users around the world face escalating network interference such as censorship, throttling, and interception, largely driven by the commoditization and growing availability of Deep Packet Inspection (DPI) devices. Once reserved for a few well-resourced nation-state actors, the ability to interfere with traffic at scale is now within reach of nearly any network operator. Despite this proliferation,… ▽ More

    Submitted 10 September, 2025; originally announced September 2025.

    Comments: In: Proceedings of the 2025 ACM SIGSAC Conference on Computer and Communications Security, 2025

  2. arXiv:2509.04047  [pdf, ps, other

    cs.GR cs.CV cs.LG

    TensoIS: A Step Towards Feed-Forward Tensorial Inverse Subsurface Scattering for Perlin Distributed Heterogeneous Media

    Authors: Ashish Tiwari, Satyam Bhardwaj, Yash Bachwana, Parag Sarvoday Sahu, T. M. Feroz Ali, Bhargava Chintalapati, Shanmuganathan Raman

    Abstract: Estimating scattering parameters of heterogeneous media from images is a severely under-constrained and challenging problem. Most of the existing approaches model BSSRDF either through an analysis-by-synthesis approach, approximating complex path integrals, or using differentiable volume rendering techniques to account for heterogeneity. However, only a few studies have applied learning-based meth… ▽ More

    Submitted 4 September, 2025; originally announced September 2025.

    Comments: To appear in Pacific Graphics 2025 (CGF Journal Track), Project page: https://yashbachwana.github.io/TensoIS/

  3. arXiv:2508.18944  [pdf, ps, other

    cs.GR cs.CV

    PanoHair: Detailed Hair Strand Synthesis on Volumetric Heads

    Authors: Shashikant Verma, Shanmuganathan Raman

    Abstract: Achieving realistic hair strand synthesis is essential for creating lifelike digital humans, but producing high-fidelity hair strand geometry remains a significant challenge. Existing methods require a complex setup for data acquisition, involving multi-view images captured in constrained studio environments. Additionally, these methods have longer hair volume estimation and strand synthesis times… ▽ More

    Submitted 26 August, 2025; originally announced August 2025.

  4. arXiv:2508.07536  [pdf, ps, other

    cs.LG

    Physics-Informed Multimodal Bearing Fault Classification under Variable Operating Conditions using Transfer Learning

    Authors: Tasfiq E. Alam, Md Manjurul Ahsan, Shivakumar Raman

    Abstract: Accurate and interpretable bearing fault classification is critical for ensuring the reliability of rotating machinery, particularly under variable operating conditions where domain shifts can significantly degrade model performance. This study proposes a physics-informed multimodal convolutional neural network (CNN) with a late fusion architecture, integrating vibration and motor current signals… ▽ More

    Submitted 10 August, 2025; originally announced August 2025.

  5. arXiv:2508.06845  [pdf, ps, other

    cs.CV cs.CE eess.IV

    Hybrid Machine Learning Framework for Predicting Geometric Deviations from 3D Surface Metrology

    Authors: Hamidreza Samadi, Md Manjurul Ahsan, Shivakumar Raman

    Abstract: This study addresses the challenge of accurately forecasting geometric deviations in manufactured components using advanced 3D surface analysis. Despite progress in modern manufacturing, maintaining dimensional precision remains difficult, particularly for complex geometries. We present a methodology that employs a high-resolution 3D scanner to acquire multi-angle surface data from 237 components… ▽ More

    Submitted 9 August, 2025; originally announced August 2025.

  6. COT-AD: Cotton Analysis Dataset

    Authors: Akbar Ali, Mahek Vyas, Soumyaratna Debnath, Chanda Grover Kamra, Jaidev Sanjay Khalane, Reuben Shibu Devanesan, Indra Deep Mastan, Subramanian Sankaranarayanan, Pankaj Khanna, Shanmuganathan Raman

    Abstract: This paper presents COT-AD, a comprehensive Dataset designed to enhance cotton crop analysis through computer vision. Comprising over 25,000 images captured throughout the cotton growth cycle, with 5,000 annotated images, COT-AD includes aerial imagery for field-scale detection and segmentation and high-resolution DSLR images documenting key diseases. The annotations cover pest and disease recogni… ▽ More

    Submitted 24 July, 2025; originally announced July 2025.

    Comments: Dataset publicly available at: https://ieee-dataport.org/documents/cot-adcotton-analysis-dataset. Accepted to IEEE International Conference on Image Processing (ICIP) 2025

    ACM Class: I.4.9; I.5.4; H.2.8

  7. arXiv:2507.05653  [pdf, ps, other

    cs.DC

    AAPA: An Archetype-Aware Predictive Autoscaler with Uncertainty Quantification for Serverless Workloads on Kubernetes

    Authors: Guilin Zhang, Srinivas Vippagunta, Raghavendra Nandagopal, Suchitra Raman, Jeff Xu, Marcus Pfeiffer, Shreeshankar Chatterjee, Ziqi Tan, Wulan Guo, Hailong Jiang

    Abstract: Serverless platforms such as Kubernetes are increasingly adopted in high-performance computing, yet autoscaling remains challenging under highly dynamic and heterogeneous workloads. Existing approaches often rely on uniform reactive policies or unconditioned predictive models, ignoring both workload semantics and prediction uncertainty. We present AAPA, an archetype-aware predictive autoscaler tha… ▽ More

    Submitted 16 July, 2025; v1 submitted 8 July, 2025; originally announced July 2025.

    Comments: 6 pages, 4 figures, 1 table. First three authors contributed equally. Correspondence to Hailong Jiang

  8. arXiv:2506.22850  [pdf, ps, other

    cs.CV

    DMD-Net: Deep Mesh Denoising Network

    Authors: Aalok Gangopadhyay, Shashikant Verma, Shanmuganathan Raman

    Abstract: We present Deep Mesh Denoising Network (DMD-Net), an end-to-end deep learning framework, for solving the mesh denoising problem. DMD-Net consists of a Graph Convolutional Neural Network in which aggregation is performed in both the primal as well as the dual graph. This is realized in the form of an asymmetric two-stream network, which contains a primal-dual fusion block that enables communication… ▽ More

    Submitted 28 June, 2025; originally announced June 2025.

  9. arXiv:2506.22833  [pdf, ps, other

    cs.CV

    SemFaceEdit: Semantic Face Editing on Generative Radiance Manifolds

    Authors: Shashikant Verma, Shanmuganathan Raman

    Abstract: Despite multiple view consistency offered by 3D-aware GAN techniques, the resulting images often lack the capacity for localized editing. In response, generative radiance manifolds emerge as an efficient approach for constrained point sampling within volumes, effectively reducing computational demands and enabling the learning of fine details. This work introduces SemFaceEdit, a novel method that… ▽ More

    Submitted 28 June, 2025; originally announced June 2025.

  10. arXiv:2506.18172  [pdf, ps, other

    eess.IV cs.AI cs.CV

    STACT-Time: Spatio-Temporal Cross Attention for Cine Thyroid Ultrasound Time Series Classification

    Authors: Irsyad Adam, Tengyue Zhang, Shrayes Raman, Zhuyu Qiu, Brandon Taraku, Hexiang Feng, Sile Wang, Ashwath Radhachandran, Shreeram Athreya, Vedrana Ivezic, Peipei Ping, Corey Arnold, William Speier

    Abstract: Thyroid cancer is among the most common cancers in the United States. Thyroid nodules are frequently detected through ultrasound (US) imaging, and some require further evaluation via fine-needle aspiration (FNA) biopsy. Despite its effectiveness, FNA often leads to unnecessary biopsies of benign nodules, causing patient discomfort and anxiety. To address this, the American College of Radiology Thy… ▽ More

    Submitted 22 June, 2025; originally announced June 2025.

  11. arXiv:2505.21385  [pdf, ps, other

    cs.HC

    Dynamic Vision from EEG Brain Recordings, How much does EEG know?

    Authors: Prajwal Singh, Anupam Sharma, Pankaj Pandey, Krishna Miyapuram, Shanmuganathan Raman

    Abstract: Reconstructing dynamic visual stimuli from brain EEG recordings is challenging due to the non-stationary and noisy nature of EEG signals and the limited availability of EEG-video datasets. Prior work has largely focused on static image reconstruction, leaving the open question of whether EEG carries sufficient information for dynamic video decoding. In this work, we present EEGVid, a framework tha… ▽ More

    Submitted 22 September, 2025; v1 submitted 27 May, 2025; originally announced May 2025.

  12. Hand Shadow Art: A Differentiable Rendering Perspective

    Authors: Aalok Gangopadhyay, Prajwal Singh, Ashish Tiwari, Shanmuganathan Raman

    Abstract: Shadow art is an exciting form of sculptural art that produces captivating artistic effects through the 2D shadows cast by 3D shapes. Hand shadows, also known as shadow puppetry or shadowgraphy, involve creating various shapes and figures using your hands and fingers to cast meaningful shadows on a wall. In this work, we propose a differentiable rendering-based approach to deform hand models such… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

    Comments: Published in Pacific Graphics 2023

  13. arXiv:2505.14892  [pdf, ps, other

    cs.CL cs.AI

    Scaling Laws for State Dynamics in Large Language Models

    Authors: Jacob X Li, Shreyas S Raman, Jessica Wan, Fahad Samman, Jazlyn Lin

    Abstract: Large Language Models (LLMs) are increasingly used in tasks requiring internal state tracking, yet their ability to model state transition dynamics remains poorly understood. We evaluate how well LLMs capture deterministic state dynamics across 3 domains: Box Tracking, Abstract DFA Sequences, and Complex Text Games, each formalizable as a finite-state system. Across tasks, we find that next-state… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

    Comments: 16 pages; 23 figures

    ACM Class: I.2.7; I.2.1; I.2.4; I.5.4

  14. arXiv:2504.02465  [pdf, other

    cs.GR cs.CV

    RASP: Revisiting 3D Anamorphic Art for Shadow-Guided Packing of Irregular Objects

    Authors: Soumyaratna Debnath, Ashish Tiwari, Kaustubh Sadekar, Shanmuganathan Raman

    Abstract: Recent advancements in learning-based methods have opened new avenues for exploring and interpreting art forms, such as shadow art, origami, and sketch art, through computational models. One notable visual art form is 3D Anamorphic Art in which an ensemble of arbitrarily shaped 3D objects creates a realistic and meaningful expression when observed from a particular viewpoint and loses its coherenc… ▽ More

    Submitted 3 April, 2025; originally announced April 2025.

    Comments: Conference on Computer Vision and Pattern Recognition (CVPR) 2025

  15. arXiv:2503.13344  [pdf, other

    cs.CV

    STEP: Simultaneous Tracking and Estimation of Pose for Animals and Humans

    Authors: Shashikant Verma, Harish Katti, Soumyaratna Debnath, Yamuna Swamy, Shanmuganathan Raman

    Abstract: We introduce STEP, a novel framework utilizing Transformer-based discriminative model prediction for simultaneous tracking and estimation of pose across diverse animal species and humans. We are inspired by the fact that the human brain exploits spatiotemporal continuity and performs concurrent localization and pose estimation despite the specialization of brain areas for form and motion processin… ▽ More

    Submitted 20 March, 2025; v1 submitted 17 March, 2025; originally announced March 2025.

  16. arXiv:2502.17524  [pdf, other

    eess.SP cs.AI cs.LG

    Multimodal Bearing Fault Classification Under Variable Conditions: A 1D CNN with Transfer Learning

    Authors: Tasfiq E. Alam, Md Manjurul Ahsan, Shivakumar Raman

    Abstract: Bearings play an integral role in ensuring the reliability and efficiency of rotating machinery - reducing friction and handling critical loads. Bearing failures that constitute up to 90% of mechanical faults highlight the imperative need for reliable condition monitoring and fault detection. This study proposes a multimodal bearing fault classification approach that relies on vibration and motor… ▽ More

    Submitted 23 February, 2025; originally announced February 2025.

  17. arXiv:2501.01174  [pdf, other

    cs.CV cs.AI

    L3D-Pose: Lifting Pose for 3D Avatars from a Single Camera in the Wild

    Authors: Soumyaratna Debnath, Harish Katti, Shashikant Verma, Shanmuganathan Raman

    Abstract: While 2D pose estimation has advanced our ability to interpret body movements in animals and primates, it is limited by the lack of depth information, constraining its application range. 3D pose estimation provides a more comprehensive solution by incorporating spatial depth, yet creating extensive 3D pose datasets for animals is challenging due to their dynamic and unpredictable behaviours in nat… ▽ More

    Submitted 2 January, 2025; originally announced January 2025.

    Comments: 2025 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2025)

  18. arXiv:2412.16942  [pdf, other

    cs.CV

    BloomCoreset: Fast Coreset Sampling using Bloom Filters for Fine-Grained Self-Supervised Learning

    Authors: Prajwal Singh, Gautam Vashishtha, Indra Deep Mastan, Shanmuganathan Raman

    Abstract: The success of deep learning in supervised fine-grained recognition for domain-specific tasks relies heavily on expert annotations. The Open-Set for fine-grained Self-Supervised Learning (SSL) problem aims to enhance performance on downstream tasks by strategically sampling a subset of images (the Core-Set) from a large pool of unlabeled data (the Open-Set). In this paper, we propose a novel metho… ▽ More

    Submitted 22 December, 2024; originally announced December 2024.

    Comments: Accepted at ICASSP 2025

  19. arXiv:2412.16860  [pdf, other

    eess.IV cs.CV

    Diffusion-Based Approaches in Medical Image Generation and Analysis

    Authors: Abdullah al Nomaan Nafi, Md. Alamgir Hossain, Rakib Hossain Rifat, Md Mahabub Uz Zaman, Md Manjurul Ahsan, Shivakumar Raman

    Abstract: Data scarcity in medical imaging poses significant challenges due to privacy concerns. Diffusion models, a recent generative modeling technique, offer a potential solution by generating synthetic and realistic data. However, questions remain about the performance of convolutional neural network (CNN) models on original and synthetic datasets. If diffusion-generated samples can help CNN models perf… ▽ More

    Submitted 22 December, 2024; originally announced December 2024.

  20. arXiv:2412.05313  [pdf, ps, other

    cs.RO cs.AI cs.LG

    λ: A Benchmark for Data-Efficiency in Long-Horizon Indoor Mobile Manipulation Robotics

    Authors: Ahmed Jaafar, Shreyas Sundara Raman, Sudarshan Harithas, Yichen Wei, Sofia Juliani, Anneke Wernerfelt, Benedict Quartey, Ifrah Idrees, Jason Xinyu Liu, Stefanie Tellex

    Abstract: Learning to execute long-horizon mobile manipulation tasks is crucial for advancing robotics in household and workplace settings. However, current approaches are typically data-inefficient, underscoring the need for improved models that require realistically sized benchmarks to evaluate their efficiency. To address this, we introduce the LAMBDA (λ) benchmark-Long-horizon Actions for Mobile-manipul… ▽ More

    Submitted 1 August, 2025; v1 submitted 28 November, 2024; originally announced December 2024.

    Comments: Accepted to IROS 2025. Sudarshan Harithas and Yichen Wei contributed equally. 8 pages. 7 figures

  21. arXiv:2411.19903  [pdf, ps, other

    cs.CV cs.LG

    Incremental Multi-Scene Modeling via Continual Neural Graphics Primitives

    Authors: Prajwal Singh, Ashish Tiwari, Gautam Vashishtha, Shanmuganathan Raman

    Abstract: Neural radiance fields (NeRF) have revolutionized photorealistic rendering of novel views for 3D scenes. Despite their growing popularity and efficiency as 3D resources, NeRFs face scalability challenges due to the need for separate models per scene and the cumulative increase in training time for multiple scenes. The potential for incrementally encoding multiple 3D scenes into a single NeRF model… ▽ More

    Submitted 26 August, 2025; v1 submitted 29 November, 2024; originally announced November 2024.

  22. ScribGen: Generating Scribble Art Through Metaheuristics

    Authors: Soumyaratna Debnath, Ashish Tiwari, Shanmuganathan Raman

    Abstract: Art has long been a medium for individuals to engage with the world. Scribble art, a form of abstract visual expression, features spontaneous, gestural strokes made with pens or brushes. These dynamic and expressive compositions, created quickly and impulsively, reveal intricate patterns and hidden meanings upon closer inspection. While scribble art is often associated with spontaneous expression… ▽ More

    Submitted 13 November, 2024; originally announced November 2024.

    Comments: SIGGRAPH Asia 2024

  23. arXiv:2411.05286  [pdf, other

    cs.CE

    Metrology and Manufacturing-Integrated Digital Twin (MM-DT) for Advanced Manufacturing: Insights from CMM and FARO Arm Measurements

    Authors: Hamidreza Samadi, Md Manjurul Ahsan, Shivakumar Raman

    Abstract: Metrology, the science of measurement, plays a key role in Advanced Manufacturing (AM) to ensure quality control, process optimization, and predictive maintenance. However, it has often been overlooked in AM domains due to the current focus on automation and the complexity of integrated precise measurement systems. Over the years, Digital Twin (DT) technology in AM has gained much attention due to… ▽ More

    Submitted 7 November, 2024; originally announced November 2024.

  24. arXiv:2411.01299  [pdf, other

    cs.CE

    PMI-DT: Leveraging Digital Twins and Machine Learning for Predictive Modeling and Inspection in Manufacturing

    Authors: Chas Hamel, Md Manjurul Ahsan, Shivakumar Raman

    Abstract: Over the years, Digital Twin (DT) has become popular in Advanced Manufacturing (AM) due to its ability to improve production efficiency and quality. By creating virtual replicas of physical assets, DTs help in real-time monitoring, develop predictive models, and improve operational performance. However, integrating data from physical systems into reliable predictive models, particularly in precisi… ▽ More

    Submitted 2 November, 2024; originally announced November 2024.

  25. arXiv:2409.02716  [pdf, other

    cs.CV

    LIPIDS: Learning-based Illumination Planning In Discretized (Light) Space for Photometric Stereo

    Authors: Ashish Tiwari, Mihir Sutariya, Shanmuganathan Raman

    Abstract: Photometric stereo is a powerful method for obtaining per-pixel surface normals from differently illuminated images of an object. While several methods address photometric stereo with different image (or light) counts ranging from one to two to a hundred, very few focus on learning optimal lighting configuration. Finding an optimal configuration is challenging due to the vast number of possible li… ▽ More

    Submitted 1 September, 2024; originally announced September 2024.

    Comments: Accepted in WACV 2025

  26. arXiv:2409.00877  [pdf, other

    cs.CV

    Digital Twins in Additive Manufacturing: A Systematic Review

    Authors: Md Manjurul Ahsan, Yingtao Liu, Shivakumar Raman, Zahed Siddique

    Abstract: Digital Twins (DTs) are becoming popular in Additive Manufacturing (AM) due to their ability to create virtual replicas of physical components of AM machines, which helps in real-time production monitoring. Advanced techniques such as Machine Learning (ML), Augmented Reality (AR), and simulation-based models play key roles in developing intelligent and adaptable DTs in manufacturing processes. How… ▽ More

    Submitted 1 November, 2024; v1 submitted 1 September, 2024; originally announced September 2024.

  27. arXiv:2409.00674  [pdf, other

    cs.CV

    MERLiN: Single-Shot Material Estimation and Relighting for Photometric Stereo

    Authors: Ashish Tiwari, Satoshi Ikehata, Shanmuganathan Raman

    Abstract: Photometric stereo typically demands intricate data acquisition setups involving multiple light sources to recover surface normals accurately. In this paper, we propose MERLiN, an attention-based hourglass network that integrates single image-based inverse rendering and relighting within a single unified framework. We evaluate the performance of photometric stereo methods using these relit images… ▽ More

    Submitted 1 September, 2024; originally announced September 2024.

    Comments: Accepted in ECCV 2024

  28. arXiv:2408.10207  [pdf, other

    cs.CV

    A Comprehensive Survey on Diffusion Models and Their Applications

    Authors: Md Manjurul Ahsan, Shivakumar Raman, Yingtao Liu, Zahed Siddique

    Abstract: Diffusion Models are probabilistic models that create realistic samples by simulating the diffusion process, gradually adding and removing noise from data. These models have gained popularity in domains such as image processing, speech synthesis, and natural language processing due to their ability to produce high-quality samples. As Diffusion Models are being adopted in various domains, existing… ▽ More

    Submitted 1 July, 2024; originally announced August 2024.

  29. arXiv:2408.04805  [pdf

    eess.IV cs.CV cs.LG physics.med-ph

    Improved Robustness for Deep Learning-based Segmentation of Multi-Center Myocardial Perfusion MRI Datasets Using Data Adaptive Uncertainty-guided Space-time Analysis

    Authors: Dilek M. Yalcinkaya, Khalid Youssef, Bobak Heydari, Janet Wei, Noel Bairey Merz, Robert Judd, Rohan Dharmakumar, Orlando P. Simonetti, Jonathan W. Weinsaft, Subha V. Raman, Behzad Sharif

    Abstract: Background. Fully automatic analysis of myocardial perfusion MRI datasets enables rapid and objective reporting of stress/rest studies in patients with suspected ischemic heart disease. Developing deep learning techniques that can analyze multi-center datasets despite limited training data and variations in software and hardware is an ongoing challenge. Methods. Datasets from 3 medical centers a… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

    Comments: Accepted for publication in JCMR, 2024

  30. arXiv:2407.09294  [pdf, other

    cs.CV

    SS-SfP:Neural Inverse Rendering for Self Supervised Shape from (Mixed) Polarization

    Authors: Ashish Tiwari, Shanmuganathan Raman

    Abstract: We present a novel inverse rendering-based framework to estimate the 3D shape (per-pixel surface normals and depth) of objects and scenes from single-view polarization images, the problem popularly known as Shape from Polarization (SfP). The existing physics-based and learning-based methods for SfP perform under certain restrictions, i.e., (a) purely diffuse or purely specular reflections, which a… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: Published in Pacific Graphics 2023

  31. arXiv:2405.13832  [pdf, other

    cs.CR cs.AI cs.LG

    Federated Learning in Healthcare: Model Misconducts, Security, Challenges, Applications, and Future Research Directions -- A Systematic Review

    Authors: Md Shahin Ali, Md Manjurul Ahsan, Lamia Tasnim, Sadia Afrin, Koushik Biswas, Md Maruf Hossain, Md Mahfuz Ahmed, Ronok Hashan, Md Khairul Islam, Shivakumar Raman

    Abstract: Data privacy has become a major concern in healthcare due to the increasing digitization of medical records and data-driven medical research. Protecting sensitive patient information from breaches and unauthorized access is critical, as such incidents can have severe legal and ethical complications. Federated Learning (FL) addresses this concern by enabling multiple healthcare institutions to coll… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  32. arXiv:2405.10925  [pdf

    stat.ME cs.AI cs.LG

    High-dimensional multiple imputation (HDMI) for partially observed confounders including natural language processing-derived auxiliary covariates

    Authors: Janick Weberpals, Pamela A. Shaw, Kueiyu Joshua Lin, Richard Wyss, Joseph M Plasek, Li Zhou, Kerry Ngan, Thomas DeRamus, Sudha R. Raman, Bradley G. Hammill, Hana Lee, Sengwee Toh, John G. Connolly, Kimberly J. Dandreo, Fang Tian, Wei Liu, Jie Li, José J. Hernández-Muñoz, Sebastian Schneeweiss, Rishi J. Desai

    Abstract: Multiple imputation (MI) models can be improved by including auxiliary covariates (AC), but their performance in high-dimensional data is not well understood. We aimed to develop and compare high-dimensional MI (HDMI) approaches using structured and natural language processing (NLP)-derived AC in studies with partially observed confounders. We conducted a plasmode simulation study using data from… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

  33. arXiv:2312.06317  [pdf, ps, other

    cs.GR

    Flow Symmetrization for Parameterized Constrained Diffeomorphisms

    Authors: Aalok Gangopadhyay, Dwip Dalal, Progyan Das, Shanmuganathan Raman

    Abstract: Diffeomorphisms play a crucial role while searching for shapes with fixed topological properties, allowing for smooth deformation of template shapes. Several approaches use diffeomorphism for shape search. However, these approaches employ only unconstrained diffeomorphisms. In this work, we develop Flow Symmetrization - a method to represent a parametric family of constrained diffeomorphisms that… ▽ More

    Submitted 21 July, 2025; v1 submitted 11 December, 2023; originally announced December 2023.

  34. arXiv:2311.11988  [pdf, other

    cs.CV cs.AI

    Categorizing the Visual Environment and Analyzing the Visual Attention of Dogs

    Authors: Shreyas Sundara Raman, Madeline H. Pelgrim, Daphna Buchsbaum, Thomas Serre

    Abstract: Dogs have a unique evolutionary relationship with humans and serve many important roles e.g. search and rescue, blind assistance, emotional support. However, few datasets exist to categorize visual features and objects available to dogs, as well as how dogs direct their visual attention within their environment. We collect and study a dataset with over 11,698 gazes to categorize the objects availa… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: 13 pages, 11 figures, 1 table, WACV CV4Smalls Workshop

  35. arXiv:2311.04942  [pdf, other

    eess.IV cs.CV

    CSAM: A 2.5D Cross-Slice Attention Module for Anisotropic Volumetric Medical Image Segmentation

    Authors: Alex Ling Yu Hung, Haoxin Zheng, Kai Zhao, Xiaoxi Du, Kaifeng Pang, Qi Miao, Steven S. Raman, Demetri Terzopoulos, Kyunghyun Sung

    Abstract: A large portion of volumetric medical data, especially magnetic resonance imaging (MRI) data, is anisotropic, as the through-plane resolution is typically much lower than the in-plane resolution. Both 3D and purely 2D deep learning-based segmentation methods are deficient in dealing with such volumetric data since the performance of 3D methods suffers when confronting anisotropic data, and 2D meth… ▽ More

    Submitted 26 November, 2023; v1 submitted 7 November, 2023; originally announced November 2023.

  36. arXiv:2310.16532  [pdf, other

    cs.CV

    Learning Robust Deep Visual Representations from EEG Brain Recordings

    Authors: Prajwal Singh, Dwip Dalal, Gautam Vashishtha, Krishna Miyapuram, Shanmuganathan Raman

    Abstract: Decoding the human brain has been a hallmark of neuroscientists and Artificial Intelligence researchers alike. Reconstruction of visual images from brain Electroencephalography (EEG) signals has garnered a lot of interest due to its applications in brain-computer interfacing. This study proposes a two-stage method where the first step is to obtain EEG-derived features for robust learning of deep r… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: Accepted in WACV 2024

  37. arXiv:2310.08645  [pdf, other

    cs.CV cs.LG

    Defect Analysis of 3D Printed Cylinder Object Using Transfer Learning Approaches

    Authors: Md Manjurul Ahsan, Shivakumar Raman, Zahed Siddique

    Abstract: Additive manufacturing (AM) is gaining attention across various industries like healthcare, aerospace, and automotive. However, identifying defects early in the AM process can reduce production costs and improve productivity - a key challenge. This study explored the effectiveness of machine learning (ML) approaches, specifically transfer learning (TL) models, for defect detection in 3D-printed cy… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  38. arXiv:2309.09919  [pdf, other

    cs.RO cs.AI cs.FL

    Plug in the Safety Chip: Enforcing Constraints for LLM-driven Robot Agents

    Authors: Ziyi Yang, Shreyas S. Raman, Ankit Shah, Stefanie Tellex

    Abstract: Recent advancements in large language models (LLMs) have enabled a new research domain, LLM agents, for solving robotics and planning tasks by leveraging the world knowledge and general reasoning abilities of LLMs obtained during pretraining. However, while considerable effort has been made to teach the robot the "dos," the "don'ts" received relatively less attention. We argue that, for any practi… ▽ More

    Submitted 28 November, 2023; v1 submitted 18 September, 2023; originally announced September 2023.

  39. arXiv:2308.13488  [pdf, other

    eess.IV cs.AI cs.CV physics.med-ph

    Temporal Uncertainty Localization to Enable Human-in-the-loop Analysis of Dynamic Contrast-enhanced Cardiac MRI Datasets

    Authors: Dilek M. Yalcinkaya, Khalid Youssef, Bobak Heydari, Orlando Simonetti, Rohan Dharmakumar, Subha Raman, Behzad Sharif

    Abstract: Dynamic contrast-enhanced (DCE) cardiac magnetic resonance imaging (CMRI) is a widely used modality for diagnosing myocardial blood flow (perfusion) abnormalities. During a typical free-breathing DCE-CMRI scan, close to 300 time-resolved images of myocardial perfusion are acquired at various contrast "wash in/out" phases. Manual segmentation of myocardial contours in each time-frame of a DCE image… ▽ More

    Submitted 13 November, 2023; v1 submitted 25 August, 2023; originally announced August 2023.

    Comments: Accepted for publication in MICCAI 2023

  40. arXiv:2307.08652  [pdf, other

    cs.GR

    Search Me Knot, Render Me Knot: Embedding Search and Differentiable Rendering of Knots in 3D

    Authors: Aalok Gangopadhyay, Paras Gupta, Tarun Sharma, Prajwal Singh, Shanmuganathan Raman

    Abstract: We introduce the problem of knot-based inverse perceptual art. Given multiple target images and their corresponding viewing configurations, the objective is to find a 3D knot-based tubular structure whose appearance resembles the target images when viewed from the specified viewing configurations. To solve this problem, we first design a differentiable rendering algorithm for rendering tubular kno… ▽ More

    Submitted 19 August, 2023; v1 submitted 17 July, 2023; originally announced July 2023.

  41. arXiv:2307.02814  [pdf, other

    cs.CV eess.IV

    Single Image LDR to HDR Conversion using Conditional Diffusion

    Authors: Dwip Dalal, Gautam Vashishtha, Prajwal Singh, Shanmuganathan Raman

    Abstract: Digital imaging aims to replicate realistic scenes, but Low Dynamic Range (LDR) cameras cannot represent the wide dynamic range of real scenes, resulting in under-/overexposed images. This paper presents a deep learning-based approach for recovering intricate details from shadows and highlights while reconstructing High Dynamic Range (HDR) images. We formulate the problem as an image-to-image (I2I… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

    Journal ref: IEEE International Conference on Image Processing 2023

  42. arXiv:2306.13452  [pdf, other

    cs.CV cs.GR

    A Graph Neural Network Approach for Temporal Mesh Blending and Correspondence

    Authors: Aalok Gangopadhyay, Abhinav Narayan Harish, Prajwal Singh, Shanmuganathan Raman

    Abstract: We have proposed a self-supervised deep learning framework for solving the mesh blending problem in scenarios where the meshes are not in correspondence. To solve this problem, we have developed Red-Blue MPNN, a novel graph neural network that processes an augmented graph to estimate the correspondence. We have designed a novel conditional refinement scheme to find the exact correspondence when ce… ▽ More

    Submitted 23 June, 2023; originally announced June 2023.

  43. arXiv:2305.20077  [pdf, other

    cs.LG cs.DC cs.SE

    Managed Geo-Distributed Feature Store: Architecture and System Design

    Authors: Anya Li, Bhala Ranganathan, Feng Pan, Mickey Zhang, Qianjun Xu, Runhan Li, Sethu Raman, Shail Paragbhai Shah, Vivienne Tang

    Abstract: Companies are using machine learning to solve real-world problems and are developing hundreds to thousands of features in the process. They are building feature engineering pipelines as part of MLOps life cycle to transform data from various data sources and materialize the same for future consumption. Without feature stores, different teams across various business groups would maintain the above… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

    Comments: All the authors are from the AzureML Feature Store product group and are listed in alphabetical order. Bhala Ranganathan: System architect and tech lead of AzureML Feature Store. Feng Pan, Qianjun Xu: Engineering managers. Sethu Raman: Product Manager of AzureML Feature Store who structured and organized the product vision and specifications

  44. arXiv:2305.09777  [pdf, other

    cs.LG

    BSGAN: A Novel Oversampling Technique for Imbalanced Pattern Recognitions

    Authors: Md Manjurul Ahsan, Shivakumar Raman, Zahed Siddique

    Abstract: Class imbalanced problems (CIP) are one of the potential challenges in developing unbiased Machine Learning (ML) models for predictions. CIP occurs when data samples are not equally distributed between the two or multiple classes. Borderline-Synthetic Minority Oversampling Techniques (SMOTE) is one of the approaches that has been used to balance the imbalance data by oversampling the minor (limite… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

  45. CERTainty: Detecting DNS Manipulation at Scale using TLS Certificates

    Authors: Elisa Tsai, Deepak Kumar, Ram Sundara Raman, Gavin Li, Yael Eiger, Roya Ensafi

    Abstract: DNS manipulation is an increasingly common technique used by censors and other network adversaries to prevent users from accessing restricted Internet resources and hijack their connections. Prior work in detecting DNS manipulation relies largely on comparing DNS resolutions with trusted control results to identify inconsistencies. However, the emergence of CDNs and other cloud providers practicin… ▽ More

    Submitted 14 May, 2023; originally announced May 2023.

    Comments: To Appear in: Privacy Enhancing Technologies Symposium (PETS), July 2023

  46. arXiv:2305.04401  [pdf, other

    eess.IV cs.CV

    Few Shot Learning for Medical Imaging: A Comparative Analysis of Methodologies and Formal Mathematical Framework

    Authors: Jannatul Nayem, Sayed Sahriar Hasan, Noshin Amina, Bristy Das, Md Shahin Ali, Md Manjurul Ahsan, Shivakumar Raman

    Abstract: Deep learning becomes an elevated context regarding disposing of many machine learning tasks and has shown a breakthrough upliftment to extract features from unstructured data. Though this flourishing context is developing in the medical image processing sector, scarcity of problem-dependent training data has become a larger issue in the way of easy application of deep learning in the medical sect… ▽ More

    Submitted 31 May, 2023; v1 submitted 7 May, 2023; originally announced May 2023.

    Comments: Accepted for a Springer book chapter for a book title "Data-driven approaches to Medical Imaging"

  47. arXiv:2304.10582  [pdf, other

    eess.IV cs.CV

    Invariant Scattering Transform for Medical Imaging

    Authors: Md Manjurul Ahsan, Shivakumar Raman, Zahed Siddique

    Abstract: Over the years, the Invariant Scattering Transform (IST) technique has become popular for medical image analysis, including using wavelet transform computation using Convolutional Neural Networks (CNN) to capture patterns' scale and orientation in the input signal. IST aims to be invariant to transformations that are common in medical images, such as translation, rotation, scaling, and deformation… ▽ More

    Submitted 31 May, 2023; v1 submitted 20 April, 2023; originally announced April 2023.

    Comments: Accepted for Springer book chapter for a book "Data-driven approaches to Medical Imaging"

  48. arXiv:2302.10121  [pdf, other

    cs.HC q-bio.NC

    EEG2IMAGE: Image Reconstruction from EEG Brain Signals

    Authors: Prajwal Singh, Pankaj Pandey, Krishna Miyapuram, Shanmuganathan Raman

    Abstract: Reconstructing images using brain signals of imagined visuals may provide an augmented vision to the disabled, leading to the advancement of Brain-Computer Interface (BCI) technology. The recent progress in deep learning has boosted the study area of synthesizing images from brain signals using Generative Adversarial Networks (GAN). In this work, we have proposed a framework for synthesizing the i… ▽ More

    Submitted 18 March, 2023; v1 submitted 20 February, 2023; originally announced February 2023.

    Comments: Accepted in ICASSP 2023

  49. arXiv:2212.03733  [pdf, other

    cs.LG cs.AI

    Tiered Reward: Designing Rewards for Specification and Fast Learning of Desired Behavior

    Authors: Zhiyuan Zhou, Shreyas Sundara Raman, Henry Sowerby, Michael L. Littman

    Abstract: Reinforcement-learning agents seek to maximize a reward signal through environmental interactions. As humans, our job in the learning process is to design reward functions to express desired behavior and enable the agent to learn such behavior swiftly. However, designing good reward functions to induce the desired behavior is generally hard, let alone the question of which rewards make learning fa… ▽ More

    Submitted 1 August, 2024; v1 submitted 7 December, 2022; originally announced December 2022.

    Comments: For code, see https://github.com/zhouzypaul/tiered-reward

    Journal ref: Reinforcement Learning Journal, vol. 1, no. 1, 2024, pp. TBD

  50. arXiv:2211.11040  [pdf, other

    cs.CV

    PointResNet: Residual Network for 3D Point Cloud Segmentation and Classification

    Authors: Aadesh Desai, Saagar Parikh, Seema Kumari, Shanmuganathan Raman

    Abstract: Point cloud segmentation and classification are some of the primary tasks in 3D computer vision with applications ranging from augmented reality to robotics. However, processing point clouds using deep learning-based algorithms is quite challenging due to the irregular point formats. Voxelization or 3D grid-based representation are different ways of applying deep neural networks to this problem. I… ▽ More

    Submitted 20 November, 2022; originally announced November 2022.

    Comments: Paper Under Review at IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2023