Skip to main content

Showing 1–50 of 140 results for author: Singh, A

Searching in archive eess. Search in all archives.
.
  1. arXiv:2506.16751  [pdf, ps, other

    eess.AS

    H-QuEST: Accelerating Query-by-Example Spoken Term Detection with Hierarchical Indexing

    Authors: Akanksha Singh, Yi-Ping Phoebe Chen, Vipul Arora

    Abstract: Query-by-example spoken term detection (QbE-STD) searches for matching words or phrases in an audio dataset using a sample spoken query. When annotated data is limited or unavailable, QbE-STD is often done using template matching methods like dynamic time warping (DTW), which are computationally expensive and do not scale well. To address this, we propose H-QuEST (Hierarchical Query-by-Example Spo… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

    Journal ref: Interspeech 2025

  2. arXiv:2506.14059  [pdf, ps, other

    eess.SY

    A Stochastic Differential Equation Framework for Modeling Queue Length Dynamics Inspired by Self-Similarity

    Authors: Shakib Mustavee, Shaurya Agarwal, Arvind Singh

    Abstract: This article develops a stochastic differential equation (SDE) for modeling the temporal evolution of queue length dynamics at signalized intersections. Inspired by the observed quasiperiodic and self-similar characteristics of the queue length dynamics, the proposed model incorporates three properties into the SDE: (i) mean reversion with periodic mean, (ii) multiplicative noise, and (iii) fracti… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

  3. arXiv:2506.10832  [pdf, ps, other

    eess.IV physics.flu-dyn

    A novel visual data-based diagnostic approach for estimation of regime transition in pool boiling

    Authors: Pranay Nirapure, Ayushman Singh, Srikanth Rangarajan, Bahgat Sammakia

    Abstract: This study introduces a novel metric, the Index of Visual Similarity (IVS), to qualitatively characterize boiling heat transfer regimes using only visual data. The IVS is constructed by combining morphological similarity, through SIFT-based feature matching, with physical similarity, via vapor area estimation using Mask R-CNN. High-speed images of pool boiling on two distinct surfaces, polished co… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

  4. arXiv:2505.19839  [pdf, ps, other

    eess.SY

    Chance-constrained Solar PV Hosting Capacity Assessment for Distribution Grids Using Gaussian Process and Logit Learning

    Authors: Sel Ly, Anshuman Singh, Petr Vorobev, Yeng Chai Soh, Hung Dinh Nguyen

    Abstract: Growing penetration of distributed generation such as solar PV can increase the risk of over-voltage in distribution grids, affecting network security. Therefore, assessment of the so-called, PV hosting capacity (HC) - the maximum amount of PV that a given grid can accommodate becomes an important practical problem. In this paper, we propose a novel chance-constrained HC estimation framework using… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  5. arXiv:2505.08693  [pdf, ps, other

    eess.IV cs.CV

    VIViT: Variable-Input Vision Transformer Framework for 3D MR Image Segmentation

    Authors: Badhan Kumar Das, Ajay Singh, Gengyan Zhao, Han Liu, Thomas J. Re, Dorin Comaniciu, Eli Gibson, Andreas Maier

    Abstract: Self-supervised pretrain techniques have been widely used to improve the downstream tasks' performance. However, real-world magnetic resonance (MR) studies usually consist of different sets of contrasts due to different acquisition protocols, which poses challenges for the current deep learning methods on large-scale pretrain and different downstream tasks with different input requirements, since… ▽ More

    Submitted 14 June, 2025; v1 submitted 13 May, 2025; originally announced May 2025.

    Comments: 9 pages

  6. arXiv:2505.03695  [pdf, other

    cs.RO eess.SY

    Frenet Corridor Planner: An Optimal Local Path Planning Framework for Autonomous Driving

    Authors: Faizan M. Tariq, Zheng-Hang Yeh, Avinash Singh, David Isele, Sangjae Bae

    Abstract: Motivated by the requirements for effectiveness and efficiency, path-speed decomposition-based trajectory planning methods have widely been adopted for autonomous driving applications. While a global route can be pre-computed offline, real-time generation of adaptive local paths remains crucial. Therefore, we present the Frenet Corridor Planner (FCP), an optimization-based local path planning stra… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

    Comments: 8 pages, 10 figures - Presented at 2025 IEEE 36th Intelligent Vehicles Symposium (IV)

  7. arXiv:2505.02529  [pdf, other

    eess.IV cs.CV

    RobSurv: Vector Quantization-Based Multi-Modal Learning for Robust Cancer Survival Prediction

    Authors: Aiman Farooq, Azad Singh, Deepak Mishra, Santanu Chaudhury

    Abstract: Cancer survival prediction using multi-modal medical imaging presents a critical challenge in oncology, mainly due to the vulnerability of deep learning models to noise and protocol variations across imaging centers. Current approaches struggle to extract consistent features from heterogeneous CT and PET images, limiting their clinical applicability. We address these challenges by introducing RobS… ▽ More

    Submitted 5 May, 2025; originally announced May 2025.

  8. arXiv:2505.01670  [pdf, other

    eess.IV cs.CV cs.LG

    Efficient Multi Subject Visual Reconstruction from fMRI Using Aligned Representations

    Authors: Christos Zangos, Danish Ebadulla, Thomas Christopher Sprague, Ambuj Singh

    Abstract: This work introduces a novel approach to fMRI-based visual image reconstruction using a subject-agnostic common representation space. We show that the brain signals of the subjects can be aligned in this common space during training to form a semantically aligned common brain. This is leveraged to demonstrate that aligning subject-specific lightweight modules to a reference subject is significantl… ▽ More

    Submitted 2 May, 2025; originally announced May 2025.

  9. arXiv:2504.11045  [pdf, other

    cs.RO cs.AI eess.SY

    Neural Control Barrier Functions from Physics Informed Neural Networks

    Authors: Shreenabh Agrawal, Manan Tayal, Aditya Singh, Shishir Kolathaya

    Abstract: As autonomous systems become increasingly prevalent in daily life, ensuring their safety is paramount. Control Barrier Functions (CBFs) have emerged as an effective tool for guaranteeing safety; however, manually designing them for specific applications remains a significant challenge. With the advent of deep learning techniques, recent research has explored synthesizing CBFs using neural networks… ▽ More

    Submitted 15 April, 2025; originally announced April 2025.

    Comments: 8 pages, 5 figures

  10. $π$-MPPI: A Projection-based Model Predictive Path Integral Scheme for Smooth Optimal Control of Fixed-Wing Aerial Vehicles

    Authors: Edvin Martin Andrejev, Amith Manoharan, Karl-Eerik Unt, Arun Kumar Singh

    Abstract: Model Predictive Path Integral (MPPI) is a popular sampling-based Model Predictive Control (MPC) algorithm for nonlinear systems. It optimizes trajectories by sampling control sequences and averaging them. However, a key issue with MPPI is the non-smoothness of the optimal control sequence, leading to oscillations in systems like fixed-wing aerial vehicles (FWVs). Existing solutions use post-hoc s… ▽ More

    Submitted 16 April, 2025; v1 submitted 15 April, 2025; originally announced April 2025.

    Comments: 8 pages, 4 figures, submitted to IEEE RA-L

    Journal ref: IEEE ROBOTICS AND AUTOMATION LETTERS, VOL. 10, NO. 6, JUNE 2025

  11. arXiv:2504.04532  [pdf, ps, other

    eess.IV cs.CV

    BrainMRDiff: A Diffusion Model for Anatomically Consistent Brain MRI Synthesis

    Authors: Moinak Bhattacharya, Saumya Gupta, Annie Singh, Chao Chen, Gagandeep Singh, Prateek Prasanna

    Abstract: Accurate brain tumor diagnosis relies on the assessment of multiple Magnetic Resonance Imaging (MRI) sequences. However, in clinical practice, the acquisition of certain sequences may be affected by factors like motion artifacts or contrast agent contraindications, leading to suboptimal outcome, such as poor image quality. This can then affect image interpretation by radiologists. Synthesizing hig… ▽ More

    Submitted 29 May, 2025; v1 submitted 6 April, 2025; originally announced April 2025.

  12. arXiv:2503.17395  [pdf, other

    eess.SY cs.AI cs.RO

    CP-NCBF: A Conformal Prediction-based Approach to Synthesize Verified Neural Control Barrier Functions

    Authors: Manan Tayal, Aditya Singh, Pushpak Jagtap, Shishir Kolathaya

    Abstract: Control Barrier Functions (CBFs) are a practical approach for designing safety-critical controllers, but constructing them for arbitrary nonlinear dynamical systems remains a challenge. Recent efforts have explored learning-based methods, such as neural CBFs (NCBFs), to address this issue. However, ensuring the validity of NCBFs is difficult due to potential learning errors. In this letter, we pro… ▽ More

    Submitted 17 May, 2025; v1 submitted 18 March, 2025; originally announced March 2025.

    Comments: 17 Pages, 10 Figures. First two authors have contributed equally

  13. arXiv:2503.16798  [pdf, other

    eess.IV eess.SP

    A Pathway to Near Tissue Computing through Processing-in-CTIA Pixels for Biomedical Applications

    Authors: Zihan Yin, Subhradip Chakraborty, Ankur Singh, Chengwei Zhou, Gourav Datta, Akhilesh Jaiswal

    Abstract: Near-tissue computing requires sensor-level processing of high-resolution images, essential for real-time biomedical diagnostics and surgical guidance. To address this need, we introduce a novel Capacitive Transimpedance Amplifier-based In-Pixel Computing (CTIA-IPC) architecture. Our design leverages CTIA pixels that are widely used for biomedical imaging owing to the inherent advantages of excell… ▽ More

    Submitted 20 March, 2025; originally announced March 2025.

  14. arXiv:2502.20636  [pdf, ps, other

    cs.RO eess.SY

    Delayed-Decision Motion Planning in the Presence of Multiple Predictions

    Authors: David Isele, Alexandre Miranda Anon, Faizan M. Tariq, Goro Yeh, Avinash Singh, Sangjae Bae

    Abstract: Reliable automated driving technology is challenged by various sources of uncertainties, in particular, behavioral uncertainties of traffic agents. It is common for traffic agents to have intentions that are unknown to others, leaving an automated driving car to reason over multiple possible behaviors. This paper formalizes a behavior planning scheme in the presence of multiple possible futures wi… ▽ More

    Submitted 6 June, 2025; v1 submitted 27 February, 2025; originally announced February 2025.

  15. arXiv:2502.11057  [pdf, other

    cs.RO cs.AI eess.SY

    A Physics-Informed Machine Learning Framework for Safe and Optimal Control of Autonomous Systems

    Authors: Manan Tayal, Aditya Singh, Shishir Kolathaya, Somil Bansal

    Abstract: As autonomous systems become more ubiquitous in daily life, ensuring high performance with guaranteed safety is crucial. However, safety and performance could be competing objectives, which makes their co-optimization difficult. Learning-based methods, such as Constrained Reinforcement Learning (CRL), achieve strong performance but lack formal safety guarantees due to safety being enforced as soft… ▽ More

    Submitted 28 May, 2025; v1 submitted 16 February, 2025; originally announced February 2025.

    Comments: 22 Pages, 12 Figures. First two authors have contributed equally. Accepted at ICML 2025

  16. arXiv:2501.08058  [pdf, other

    eess.SY

    Range-Only Dynamic Output Feedback Controller for Safe and Secure Target Circumnavigation

    Authors: Anand Singh, Anoop Jain

    Abstract: The safety and security of robotic systems are paramount when navigating around a hostile target. This paper addresses the problem of circumnavigating an unknown target by a unicycle robot while ensuring it maintains a desired safe distance and remains within the sensing region around the target throughout its motion. The proposed control design methodology is based on the construction of a joint… ▽ More

    Submitted 14 January, 2025; originally announced January 2025.

  17. arXiv:2501.07197  [pdf

    eess.IV cs.CV cs.LG

    Lung Cancer detection using Deep Learning

    Authors: Aryan Chaudhari, Ankush Singh, Sanchi Gajbhiye, Pratham Agrawal

    Abstract: In this paper we discuss lung cancer detection using hybrid model of Convolutional-Neural-Networks (CNNs) and Support-Vector-Machines-(SVMs) in order to gain early detection of tumors, benign or malignant. The work uses this hybrid model by training upon the Computed Tomography scans (CT scans) as dataset. Using deep learning for detecting lung cancer early is a cutting-edge method.

    Submitted 13 January, 2025; originally announced January 2025.

  18. arXiv:2501.03765  [pdf, other

    cs.CV eess.IV

    Image Segmentation: Inducing graph-based learning

    Authors: Aryan Singh, Pepijn Van de Ven, Ciarán Eising, Patrick Denny

    Abstract: This study explores the potential of graph neural networks (GNNs) to enhance semantic segmentation across diverse image modalities. We evaluate the effectiveness of a novel GNN-based U-Net architecture on three distinct datasets: PascalVOC, a standard benchmark for natural image segmentation, WoodScape, a challenging dataset of fisheye images commonly used in autonomous driving, introducing signif… ▽ More

    Submitted 19 January, 2025; v1 submitted 7 January, 2025; originally announced January 2025.

  19. arXiv:2412.05216  [pdf, other

    eess.IV cs.CV cs.LG

    ColonNet: A Hybrid Of DenseNet121 And U-NET Model For Detection And Segmentation Of GI Bleeding

    Authors: Ayushman Singh, Sharad Prakash, Aniket Das, Nidhi Kushwaha

    Abstract: This study presents an integrated deep learning model for automatic detection and classification of Gastrointestinal bleeding in the frames extracted from Wireless Capsule Endoscopy (WCE) videos. The dataset has been released as part of Auto-WCBleedGen Challenge Version V2 hosted by the MISAHUB team. Our model attained the highest performance among 75 teams that took part in this competition. It a… ▽ More

    Submitted 6 December, 2024; originally announced December 2024.

  20. arXiv:2411.14100  [pdf, other

    eess.AS cs.CL cs.IR

    BEST-STD: Bidirectional Mamba-Enhanced Speech Tokenization for Spoken Term Detection

    Authors: Anup Singh, Kris Demuynck, Vipul Arora

    Abstract: Spoken term detection (STD) is often hindered by reliance on frame-level features and the computationally intensive DTW-based template matching, limiting its practicality. To address these challenges, we propose a novel approach that encodes speech into discrete, speaker-agnostic semantic tokens. This facilitates fast retrieval using text-based search algorithms and effectively handles out-of-voca… ▽ More

    Submitted 21 December, 2024; v1 submitted 21 November, 2024; originally announced November 2024.

    Comments: Accepted at ICASSP 2025

  21. arXiv:2411.12681  [pdf

    eess.IV cs.AI cs.CV

    AI Guided Early Screening of Cervical Cancer

    Authors: Dharanidharan S I, Suhitha Renuka S V, Ajishi Singh, Sheena Christabel Pravin

    Abstract: In order to support the creation of reliable machine learning models for anomaly detection, this project focuses on preprocessing, enhancing, and organizing a medical imaging dataset. There are two classifications in the dataset: normal and abnormal, along with extra noise fluctuations. In order to improve the photographs' quality, undesirable artifacts, including visible medical equipment at the… ▽ More

    Submitted 19 November, 2024; originally announced November 2024.

  22. arXiv:2411.09204  [pdf, other

    eess.IV cs.AI physics.med-ph

    RibCageImp: A Deep Learning Framework for 3D Ribcage Implant Generation

    Authors: Gyanendra Chaubey, Aiman Farooq, Azad Singh, Deepak Mishra

    Abstract: The recovery of damaged or resected ribcage structures requires precise, custom-designed implants to restore the integrity and functionality of the thoracic cavity. Traditional implant design methods rely mainly on manual processes, making them time-consuming and susceptible to variability. In this work, we explore the feasibility of automated ribcage implant generation using deep learning. We pre… ▽ More

    Submitted 14 November, 2024; originally announced November 2024.

  23. arXiv:2411.01506  [pdf, other

    eess.SY

    Degradation-Infused Energy Portfolio Allocation Framework: Risk-Averse Fair Storage Participation

    Authors: Parikshit Pareek, L. P. Mohasha Isuru Sampath, Anshuman Singh, Lalit Goel, Hoay Beng Gooi, Hung Dinh Nguyen

    Abstract: This work proposes a novel degradation-infused energy portfolio allocation (DI-EPA) framework for enabling the participation of battery energy storage systems in multi-service electricity markets. The proposed framework attempts to address the challenge of including the rainflow algorithm for cycle counting by directly developing a closed-form of marginal degradation as a function of dispatch deci… ▽ More

    Submitted 4 November, 2024; v1 submitted 3 November, 2024; originally announced November 2024.

  24. arXiv:2410.19858  [pdf, other

    cs.LG cs.CE eess.SP physics.geo-ph

    Enhancing Deep Learning based RMT Data Inversion using Gaussian Random Field

    Authors: Koustav Ghosal, Arun Singh, Samir Malakar, Shalivahan Srivastava, Deepak Gupta

    Abstract: Deep learning (DL) methods have emerged as a powerful tool for the inversion of geophysical data. When applied to field data, these models often struggle without additional fine-tuning of the network. This is because they are built on the assumption that the statistical patterns in the training and test datasets are the same. To address this, we propose a DL-based inversion scheme for Radio Magnet… ▽ More

    Submitted 22 October, 2024; originally announced October 2024.

  25. arXiv:2410.19151  [pdf, other

    eess.IV cs.CV

    CapsuleNet: A Deep Learning Model To Classify GI Diseases Using EfficientNet-b7

    Authors: Aniket Das, Ayushman Singh, Nishant, Sharad Prakash

    Abstract: Gastrointestinal (GI) diseases represent a significant global health concern, with Capsule Endoscopy (CE) offering a non-invasive method for diagnosis by capturing a large number of GI tract images. However, the sheer volume of video frames necessitates automated analysis to reduce the workload on doctors and increase the diagnostic accuracy. In this paper, we present CapsuleNet, a deep learning m… ▽ More

    Submitted 24 October, 2024; originally announced October 2024.

    Comments: Capsule Vision 2024 Challenge

  26. arXiv:2410.15321  [pdf, other

    cs.RO eess.SY

    Integrated Design and Control of a Robotic Arm on a Quadcopter for Enhanced Package Delivery

    Authors: Animesh Singh, Jason Hillyer, Fariba Ariaei, Hossein Jula

    Abstract: This paper presents a comprehensive design process for the integration of a robotic arm into a quadcopter, emphasizing the physical modeling, system integration, and controller development. Utilizing SolidWorks for mechanical design and MATLAB Simscape for simulation and control, this study addresses the challenges encountered in integrating the robotic arm with the drone, encompassing both mechan… ▽ More

    Submitted 20 October, 2024; originally announced October 2024.

  27. arXiv:2410.07393  [pdf, other

    eess.SP cs.IT

    How Much Power Must We Extract From a Receiver Antenna to Effect Communications?

    Authors: Thomas L. Marzetta, Brian McMinn, Amritpal Singh, Thorkild B. Hansen

    Abstract: Subject to the laws of classical physics - the science that governs the design of today's wireless communication systems - there is no need to extract power from a receiver antenna in order to effect communications. If we dispense with a transmission line and, instead, make the front-end electronics colocated with the antenna, then a high input-impedance preamplifier can measure the open-circuit v… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

    Comments: 10 pages

  28. arXiv:2409.19015  [pdf, other

    cs.CL cs.AI cs.LG cs.SD eess.AS

    Textless NLP -- Zero Resource Challenge with Low Resource Compute

    Authors: Krithiga Ramadass, Abrit Pal Singh, Srihari J, Sheetal Kalyani

    Abstract: This work addresses the persistent challenges of substantial training time and GPU resource requirements even when training lightweight encoder-vocoder models for Textless NLP. We reduce training steps significantly while improving performance by a) leveraging learning rate schedulers for efficient and faster convergence b) optimizing hop length and c) tuning the interpolation scale factors for be… ▽ More

    Submitted 24 September, 2024; originally announced September 2024.

  29. arXiv:2409.12616  [pdf, other

    cs.RO eess.SY

    Semi-Supervised Safe Visuomotor Policy Synthesis using Barrier Certificates

    Authors: Manan Tayal, Aditya Singh, Pushpak Jagtap, Shishir Kolathaya

    Abstract: In modern robotics, addressing the lack of accurate state space information in real-world scenarios has led to a significant focus on utilizing visuomotor observation to provide safety assurances. Although supervised learning methods, such as imitation learning, have demonstrated potential in synthesizing control policies based on visuomotor observations, they require ground truth safety labels fo… ▽ More

    Submitted 19 September, 2024; originally announced September 2024.

    Comments: First two authors have contributed equally. 8 Pages, 3 figures

  30. arXiv:2409.11262  [pdf, other

    cs.SD cs.AI eess.AS

    The Sounds of Home: A Speech-Removed Residential Audio Dataset for Sound Event Detection

    Authors: Gabriel Bibbó, Thomas Deacon, Arshdeep Singh, Mark D. Plumbley

    Abstract: This paper presents a residential audio dataset to support sound event detection research for smart home applications aimed at promoting wellbeing for older adults. The dataset is constructed by deploying audio recording systems in the homes of 8 participants aged 55-80 years for a 7-day period. Acoustic characteristics are documented through detailed floor plans and construction material informat… ▽ More

    Submitted 4 October, 2024; v1 submitted 17 September, 2024; originally announced September 2024.

  31. arXiv:2409.08384  [pdf, ps, other

    eess.SP cs.LG

    Noisy Low Rank Column-wise Sensing

    Authors: Ankit Pratap Singh, Namrata Vaswani

    Abstract: This letter studies the AltGDmin algorithm for solving the noisy low rank column-wise sensing (LRCS) problem. Our sample complexity guarantee improves upon the best existing one by a factor $\max(r, \log(1/ε))/r$ where $r$ is the rank of the unknown matrix and $ε$ is the final desired accuracy. A second contribution of this work is a detailed comparison of guarantees from all work that studies the… ▽ More

    Submitted 24 March, 2025; v1 submitted 12 September, 2024; originally announced September 2024.

    Comments: 9 pages

  32. arXiv:2407.19229  [pdf, other

    eess.SY

    Impact of Transmission Dynamics and Treatment Uptake, Frequency and Timing on the Cost-effectiveness of Directly Acting Antivirals for Hepatitis C Virus Infection

    Authors: Soham Das, Ajit Sood, Vandana Midha, Arshdeep Singh, Pranjl Sharma, Varun Ramamohan

    Abstract: Cost-effectiveness analyses, based on decision-analytic models of disease progression and treatment, are routinely used to assess the economic value of a new intervention and consequently inform reimbursement decisions for the intervention. Many decision-analytic models developed to assess the economic value of highly effective directly acting antiviral (DAA) treatments for the hepatitis C virus (… ▽ More

    Submitted 17 September, 2024; v1 submitted 27 July, 2024; originally announced July 2024.

  33. arXiv:2407.15423  [pdf, other

    eess.AS cs.AI cs.MM cs.SD

    Integrating IP Broadcasting with Audio Tags: Workflow and Challenges

    Authors: Rhys Burchett-Vass, Arshdeep Singh, Gabriel Bibbó, Mark D. Plumbley

    Abstract: The broadcasting industry is increasingly adopting IP techniques, revolutionising both live and pre-recorded content production, from news gathering to live music events. IP broadcasting allows for the transport of audio and video signals in an easily configurable way, aligning with modern networking techniques. This shift towards an IP workflow allows for much greater flexibility, not only in rou… ▽ More

    Submitted 23 July, 2024; v1 submitted 22 July, 2024; originally announced July 2024.

    Comments: Submitted to DCASE 2024 Workshop

  34. arXiv:2406.20005  [pdf, other

    eess.IV cs.CV

    Malaria Cell Detection Using Deep Neural Networks

    Authors: Saurabh Sawant, Anurag Singh

    Abstract: Malaria remains one of the most pressing public health concerns globally, causing significant morbidity and mortality, especially in sub-Saharan Africa. Rapid and accurate diagnosis is crucial for effective treatment and disease management. Traditional diagnostic methods, such as microscopic examination of blood smears, are labor-intensive and require significant expertise, which may not be readil… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  35. arXiv:2406.17339  [pdf, other

    cs.IT eess.SP

    Optimizing Configuration Selection in Reconfigurable-Antenna MIMO Systems: Physics-Inspired Heuristic Solvers

    Authors: I. Krikidis, C. Psomas, A. K. Singh, K. Jamieson

    Abstract: Reconfigurable antenna multiple-input multiple-output (MIMO) is a foundational technology for the continuing evolution of cellular systems, including upcoming 6G communication systems. In this paper, we address the problem of flexible/reconfigurable antenna configuration selection for point-to-point MIMO antenna systems by using physics-inspired heuristics. Firstly, we optimize the antenna configu… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: text overlap with arXiv:2403.12571

    Journal ref: IEEE Transactions on Communications, 2004

  36. arXiv:2406.09661  [pdf, other

    cs.LO cs.AI eess.SY

    Temporal Planning via Interval Logic Satisfiability for Autonomous Systems

    Authors: Miquel Ramirez, Anubhav Singh, Peter Stuckey, Chris Manzie

    Abstract: Many automated planning methods and formulations rely on suitably designed abstractions or simplifications of the constrained dynamics associated with agents to attain computational scalability. We consider formulations of temporal planning where intervals are associated with both action and fluent atoms, and relations between these are given as sentences in Allen's Interval Logic. We propose a no… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: This publication is an extended version of a manuscript submitted to ICAPS-24 (and rejected). Please contact the first author for queries, comments or discussion of the paper

  37. arXiv:2404.03307  [pdf, other

    cs.RO eess.SY

    Bi-level Trajectory Optimization on Uneven Terrains with Differentiable Wheel-Terrain Interaction Model

    Authors: Amith Manoharan, Aditya Sharma, Himani Belsare, Kaustab Pal, K. Madhava Krishna, Arun Kumar Singh

    Abstract: Navigation of wheeled vehicles on uneven terrain necessitates going beyond the 2D approaches for trajectory planning. Specifically, it is essential to incorporate the full 6dof variation of vehicle pose and its associated stability cost in the planning process. To this end, most recent works aim to learn a neural network model to predict the vehicle evolution. However, such approaches are data-int… ▽ More

    Submitted 22 November, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

    Comments: 8 pages, 7 figures, submitted to IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2024)

  38. arXiv:2404.00814  [pdf, other

    cs.RO eess.SY

    Exact Imposition of Safety Boundary Conditions in Neural Reachable Tubes

    Authors: Aditya Singh, Zeyuan Feng, Somil Bansal

    Abstract: Hamilton-Jacobi (HJ) reachability analysis is a widely adopted verification tool to provide safety and performance guarantees for autonomous systems. However, it involves solving a partial differential equation (PDE) to compute a safety value function, whose computational and memory complexity scales exponentially with the state dimension, making its direct application to large-scale systems intra… ▽ More

    Submitted 9 May, 2025; v1 submitted 31 March, 2024; originally announced April 2024.

    Comments: First two authors have contributed equally. 7 Pages, 3 figures. Accepted at ICRA 2025

  39. arXiv:2403.12571  [pdf, other

    cs.IT eess.SP

    Optimizing Reconfigurable Antenna MIMO Systems with Coherent Ising Machines

    Authors: Ioannis Krikidis, Abhishek Kumar Singh, Kyle Jamieson

    Abstract: Reconfigurable antenna multiple-input multiple-output (MIMO) is a promising technology for upcoming 6G communication systems. In this paper, we deal with the problem of configuration selection for reconfigurable antenna MIMO by leveraging Coherent Ising Machines (CIMs). By adopting the CIM as a heuristic solver for the Ising problem, the optimal antenna configuration that maximizes the received si… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Journal ref: IEEE International Conference on Communications (ICC), June 2024

  40. arXiv:2403.11504  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    MLVICX: Multi-Level Variance-Covariance Exploration for Chest X-ray Self-Supervised Representation Learning

    Authors: Azad Singh, Vandan Gorade, Deepak Mishra

    Abstract: Self-supervised learning (SSL) is potentially useful in reducing the need for manual annotation and making deep learning models accessible for medical image analysis tasks. By leveraging the representations learned from unlabeled data, self-supervised models perform well on tasks that require little to no fine-tuning. However, for medical images, like chest X-rays, which are characterized by compl… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  41. Impact of the Antenna on the Sub-Terahertz Indoor Channel Characteristics: An Experimental Approach

    Authors: Priyangshu Sen, Sherif Badran, Vitaly Petrov, Arjun Singh, Josep M. Jornet

    Abstract: Terahertz-band (100 GHz-10 THz) communication is a promising radio technology envisioned to enable ultra-high data rate, reliable and low-latency wireless connectivity in next-generation wireless systems. However, the low transmission power of THz transmitters, the need for high gain directional antennas, and the complex interaction of THz radiation with common objects along the propagation path m… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: Accepted and to be published in IEEE ICC 2024. Copyright © 2024 by the Institute of Electrical and Electronics Engineers (IEEE). Permission to make digital or hard copies of portions of this work for personal or classroom use is granted without fee provided that the copies are not made or distributed for profit or commercial advantage

    Journal ref: ICC 2024 - IEEE International Conference on Communications, Denver, CO, USA, 2024, pp. 2537-2542

  42. arXiv:2312.00698  [pdf, other

    eess.AS

    SPIRE-SIES: A Spontaneous Indian English Speech Corpus

    Authors: Abhayjeet Singh, Charu Shah, Rajashri Varadaraj, Sonakshi Chauhan, Prasanta Kumar Ghosh

    Abstract: In this paper, we present a 170.83 hour Indian English spontaneous speech dataset. Lack of Indian English speech data is one of the major hindrances in developing robust speech systems which are adapted to the Indian speech style. Moreover this scarcity is even more for spontaneous speech. This corpus is crowd sourced over varied Indian nativities, genders and age groups. Traditional spontaneous s… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

    Comments: 6 pages, 7 plots, 3 tables, Accepted at O-COCOSDA 2023

  43. arXiv:2311.07068  [pdf, other

    cs.IT eess.SP physics.class-ph

    Shannon Theory for Wireless Communication in a Resonant Chamber

    Authors: Amritpal Singh, Thomas Marzetta

    Abstract: A closed electromagnetic resonant chamber (RC) is a highly favorable artificial environment for wireless communication. A pair of antennas within the chamber constitutes a two-port network described by an impedance matrix. We analyze communication between the two antennas when the RC has perfectly conducting walls and the impedance matrix is imaginary-valued. The transmit antenna is driven by a cu… ▽ More

    Submitted 14 November, 2023; v1 submitted 12 November, 2023; originally announced November 2023.

    Comments: 10 pages, 13 figures. To be published in IEEE Journal on Selected Areas in Communications Special Issue on Electromagnetic Signal and Information Theory

  44. arXiv:2311.06329  [pdf

    cs.CV cs.AI cs.CL cs.LG eess.IV

    A Survey of AI Text-to-Image and AI Text-to-Video Generators

    Authors: Aditi Singh

    Abstract: Text-to-Image and Text-to-Video AI generation models are revolutionary technologies that use deep learning and natural language processing (NLP) techniques to create images and videos from textual descriptions. This paper investigates cutting-edge approaches in the discipline of Text-to-Image and Text-to-Video AI generations. The survey provides an overview of the existing literature as well as an… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

    Comments: 4 pages, 2 tables, 4th International Conference on Artificial Intelligence, Robotics and Control (AIRC 2023)

  45. arXiv:2310.08846  [pdf, other

    eess.AS

    Speaking rate attention-based duration prediction for speed control TTS

    Authors: Jesuraj Bandekar, Sathvik Udupa, Abhayjeet Singh, Anjali Jayakumar, Deekshitha G, Sandhya Badiger, Saurabh Kumar, Pooja VH, Prasanta Kumar Ghosh

    Abstract: With the advent of high-quality speech synthesis, there is a lot of interest in controlling various prosodic attributes of speech. Speaking rate is an essential attribute towards modelling the expressivity of speech. In this work, we propose a novel approach to control the speaking rate for non-autoregressive TTS. We achieve this by conditioning the speaking rate inside the duration predictor, all… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

    Comments: \c{opyright} 20XX IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

  46. arXiv:2310.07727  [pdf, other

    cs.CV eess.IV

    Deep Learning based Systems for Crater Detection: A Review

    Authors: Atal Tewari, K Prateek, Amrita Singh, Nitin Khanna

    Abstract: Craters are one of the most prominent features on planetary surfaces, used in applications such as age estimation, hazard detection, and spacecraft navigation. Crater detection is a challenging problem due to various aspects, including complex crater characteristics such as varying sizes and shapes, data resolution, and planetary data types. Similar to other computer vision tasks, deep learning-ba… ▽ More

    Submitted 28 September, 2023; originally announced October 2023.

  47. Design and Validation of a Metallic Reflectarray for Communications at True Terahertz Frequencies

    Authors: Sherif Badran, Arjun Singh, Arpit Jaiswal, Erik Einarsson, Josep M. Jornet

    Abstract: Wireless communications in the terahertz band (0.1-10 THz) is a promising and key wireless technology enabling ultra-high data rate communication over multi-gigahertz-wide bandwidths, thus fulfilling the demand for denser networks. The complex propagation environment at such high frequencies introduces several challenges, such as high spreading and molecular absorption losses. As such, intelligent… ▽ More

    Submitted 18 September, 2023; v1 submitted 11 September, 2023; originally announced September 2023.

    Comments: Accepted and to be published in ACM mmNets 2023. Copyright © 2023 by the Association for Computing Machinery, Inc. (ACM). Permission to make digital or hard copies of portions of this work for personal or classroom use is granted without fee provided that the copies are not made or distributed for profit or commercial advantage

    Journal ref: Proceedings of the 7th ACM Workshop on Millimeter-Wave and Terahertz Networks and Sensing Systems, ser. mmNets '23, Madrid, Spain: Association for Computing Machinery, 2024, pp. 19-24

  48. arXiv:2309.04651  [pdf

    eess.IV cs.AI cs.CV

    Video and Synthetic MRI Pre-training of 3D Vision Architectures for Neuroimage Analysis

    Authors: Nikhil J. Dhinagar, Amit Singh, Saket Ozarkar, Ketaki Buwa, Sophia I. Thomopoulos, Conor Owens-Walton, Emily Laltoo, Yao-Liang Chen, Philip Cook, Corey McMillan, Chih-Chien Tsai, J-J Wang, Yih-Ru Wu, Paul M. Thompson

    Abstract: Transfer learning represents a recent paradigm shift in the way we build artificial intelligence (AI) systems. In contrast to training task-specific models, transfer learning involves pre-training deep learning models on a large corpus of data and minimally fine-tuning them for adaptation to specific tasks. Even so, for 3D medical imaging tasks, we do not know if it is best to pre-train models on… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

  49. arXiv:2308.08713  [pdf, ps, other

    cs.CL cs.SD eess.AS

    Decoding Emotions: A comprehensive Multilingual Study of Speech Models for Speech Emotion Recognition

    Authors: Anant Singh, Akshat Gupta

    Abstract: Recent advancements in transformer-based speech representation models have greatly transformed speech processing. However, there has been limited research conducted on evaluating these models for speech emotion recognition (SER) across multiple languages and examining their internal representations. This article addresses these gaps by presenting a comprehensive benchmark for SER with eight speech… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

  50. arXiv:2308.08302  [pdf, ps, other

    cs.IT eess.SP

    PSA Based Power Control for Cell-Free Massive MIMO under LoS/NLoS Channels

    Authors: Ashish Pratap Singh, Ribhu Chopra

    Abstract: A primary design goal of the cell-free~(CF) massive MIMO architecture is to provide uniformly good coverage to all the user equipments~(UEs) connected to the network. However, it has been found that this requirement may not be satisfied in case the channels between the access points~(APs) and the UEs are mixed LoS/NLoS. In this paper, we try to address this issue via the use of appropriate power c… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

    Comments: 10 pages, 10 figures