-
eCAV: An Edge-Assisted Evaluation Platform for Connected Autonomous Vehicles
Authors:
Tyler Landle,
Jordan Rapp,
Dean Blank,
Chandramouli Amarnath,
Abhijit Chatterjee,
Alexandros Daglis,
Umakishore Ramachandran
Abstract:
As autonomous vehicles edge closer to widespread adoption, enhancing road safety through collision avoidance and minimization of collateral damage becomes imperative. Vehicle-to-everything (V2X) technologies, which include vehicle-to-vehicle (V2V), vehicle-to-infrastructure (V2I), and vehicle-to-cloud (V2C), are being proposed as mechanisms to achieve this safety improvement.
Simulation-based te…
▽ More
As autonomous vehicles edge closer to widespread adoption, enhancing road safety through collision avoidance and minimization of collateral damage becomes imperative. Vehicle-to-everything (V2X) technologies, which include vehicle-to-vehicle (V2V), vehicle-to-infrastructure (V2I), and vehicle-to-cloud (V2C), are being proposed as mechanisms to achieve this safety improvement.
Simulation-based testing is crucial for early-stage evaluation of Connected Autonomous Vehicle (CAV) control systems, offering a safer and more cost-effective alternative to real-world tests. However, simulating large 3D environments with many complex single- and multi-vehicle sensors and controllers is computationally intensive. There is currently no evaluation framework that can effectively evaluate realistic scenarios involving large numbers of autonomous vehicles.
We propose eCAV -- an efficient, modular, and scalable evaluation platform to facilitate both functional validation of algorithmic approaches to increasing road safety, as well as performance prediction of algorithms of various V2X technologies, including a futuristic Vehicle-to-Edge control plane and correspondingly designed control algorithms. eCAV can model up to 256 vehicles running individual control algorithms without perception enabled, which is $8\times$ more vehicles than what is possible with state-of-the-art alternatives.
△ Less
Submitted 27 June, 2025; v1 submitted 19 June, 2025;
originally announced June 2025.
-
Improving AI-generated music with user-guided training
Authors:
Vishwa Mohan Singh,
Sai Anirudh Aryasomayajula,
Ahan Chatterjee,
Beste Aydemir,
Rifat Mehreen Amin
Abstract:
AI music generation has advanced rapidly, with models like diffusion and autoregressive algorithms enabling high-fidelity outputs. These tools can alter styles, mix instruments, or isolate them. Since sound can be visualized as spectrograms, image-generation algorithms can be applied to generate novel music. However, these algorithms are typically trained on fixed datasets, which makes it challeng…
▽ More
AI music generation has advanced rapidly, with models like diffusion and autoregressive algorithms enabling high-fidelity outputs. These tools can alter styles, mix instruments, or isolate them. Since sound can be visualized as spectrograms, image-generation algorithms can be applied to generate novel music. However, these algorithms are typically trained on fixed datasets, which makes it challenging for them to interpret and respond to user input accurately. This is especially problematic because music is highly subjective and requires a level of personalization that image generation does not provide. In this work, we propose a human-computation approach to gradually improve the performance of these algorithms based on user interactions. The human-computation element involves aggregating and selecting user ratings to use as the loss function for fine-tuning the model. We employ a genetic algorithm that incorporates user feedback to enhance the baseline performance of a model initially trained on a fixed dataset. The effectiveness of this approach is measured by the average increase in user ratings with each iteration. In the pilot test, the first iteration showed an average rating increase of 0.2 compared to the baseline. The second iteration further improved upon this, achieving an additional increase of 0.39 over the first iteration.
△ Less
Submitted 5 June, 2025;
originally announced June 2025.
-
An Addendum to NeBula: Towards Extending TEAM CoSTAR's Solution to Larger Scale Environments
Authors:
Ali Agha,
Kyohei Otsu,
Benjamin Morrell,
David D. Fan,
Sung-Kyun Kim,
Muhammad Fadhil Ginting,
Xianmei Lei,
Jeffrey Edlund,
Seyed Fakoorian,
Amanda Bouman,
Fernando Chavez,
Taeyeon Kim,
Gustavo J. Correa,
Maira Saboia,
Angel Santamaria-Navarro,
Brett Lopez,
Boseong Kim,
Chanyoung Jung,
Mamoru Sobue,
Oriana Claudia Peltzer,
Joshua Ott,
Robert Trybula,
Thomas Touma,
Marcel Kaufmann,
Tiago Stegun Vaquero
, et al. (64 additional authors not shown)
Abstract:
This paper presents an appendix to the original NeBula autonomy solution developed by the TEAM CoSTAR (Collaborative SubTerranean Autonomous Robots), participating in the DARPA Subterranean Challenge. Specifically, this paper presents extensions to NeBula's hardware, software, and algorithmic components that focus on increasing the range and scale of the exploration environment. From the algorithm…
▽ More
This paper presents an appendix to the original NeBula autonomy solution developed by the TEAM CoSTAR (Collaborative SubTerranean Autonomous Robots), participating in the DARPA Subterranean Challenge. Specifically, this paper presents extensions to NeBula's hardware, software, and algorithmic components that focus on increasing the range and scale of the exploration environment. From the algorithmic perspective, we discuss the following extensions to the original NeBula framework: (i) large-scale geometric and semantic environment mapping; (ii) an adaptive positioning system; (iii) probabilistic traversability analysis and local planning; (iv) large-scale POMDP-based global motion planning and exploration behavior; (v) large-scale networking and decentralized reasoning; (vi) communication-aware mission planning; and (vii) multi-modal ground-aerial exploration solutions. We demonstrate the application and deployment of the presented systems and solutions in various large-scale underground environments, including limestone mine exploration scenarios as well as deployment in the DARPA Subterranean challenge.
△ Less
Submitted 18 April, 2025;
originally announced April 2025.
-
Can Domain Experts Rely on AI Appropriately? A Case Study on AI-Assisted Prostate Cancer MRI Diagnosis
Authors:
Chacha Chen,
Han Liu,
Jiamin Yang,
Benjamin M. Mervak,
Bora Kalaycioglu,
Grace Lee,
Emre Cakmakli,
Matteo Bonatti,
Sridhar Pudu,
Osman Kahraman,
Gul Gizem Pamuk,
Aytekin Oto,
Aritrick Chatterjee,
Chenhao Tan
Abstract:
Despite the growing interest in human-AI decision making, experimental studies with domain experts remain rare, largely due to the complexity of working with domain experts and the challenges in setting up realistic experiments. In this work, we conduct an in-depth collaboration with radiologists in prostate cancer diagnosis based on MRI images. Building on existing tools for teaching prostate can…
▽ More
Despite the growing interest in human-AI decision making, experimental studies with domain experts remain rare, largely due to the complexity of working with domain experts and the challenges in setting up realistic experiments. In this work, we conduct an in-depth collaboration with radiologists in prostate cancer diagnosis based on MRI images. Building on existing tools for teaching prostate cancer diagnosis, we develop an interface and conduct two experiments to study how AI assistance and performance feedback shape the decision making of domain experts. In Study 1, clinicians were asked to provide an initial diagnosis (human), then view the AI's prediction, and subsequently finalize their decision (human-AI team). In Study 2 (after a memory wash-out period), the same participants first received aggregated performance statistics from Study 1, specifically their own performance, the AI's performance, and their human-AI team performance, and then directly viewed the AI's prediction before making their diagnosis (i.e., no independent initial diagnosis). These two workflows represent realistic ways that clinical AI tools might be used in practice, where the second study simulates a scenario where doctors can adjust their reliance and trust on AI based on prior performance feedback. Our findings show that, while human-AI teams consistently outperform humans alone, they still underperform the AI due to under-reliance, similar to prior studies with crowdworkers. Providing clinicians with performance feedback did not significantly improve the performance of human-AI teams, although showing AI decisions in advance nudges people to follow AI more. Meanwhile, we observe that the ensemble of human-AI teams can outperform AI alone, suggesting promising directions for human-AI collaboration.
△ Less
Submitted 3 February, 2025;
originally announced February 2025.
-
MCICSAM: Monte Carlo-guided Interpolation Consistency Segment Anything Model for Semi-Supervised Prostate Zone Segmentation
Authors:
Guantian Huang,
Beibei Li,
Xiaobing Fan,
Aritrick Chatterjee,
Cheng Wei,
Shouliang Qi,
Wei Qian,
Dianning He
Abstract:
Accurate segmentation of various regions within the prostate is pivotal for diagnosing and treating prostate-related diseases. However, the scarcity of labeled data, particularly in specialized medical fields like prostate imaging, poses a significant challenge. Segment Anything Model (SAM) is a new large model for natural image segmentation, but there are some challenges in medical imaging. In or…
▽ More
Accurate segmentation of various regions within the prostate is pivotal for diagnosing and treating prostate-related diseases. However, the scarcity of labeled data, particularly in specialized medical fields like prostate imaging, poses a significant challenge. Segment Anything Model (SAM) is a new large model for natural image segmentation, but there are some challenges in medical imaging. In order to better utilize the powerful feature extraction capability of SAM as well as to address the problem of low data volume for medical image annotation, we use Low-Rank Adaptation (LoRA) and semi-supervised learning methods of Monte Carlo guided interpolation consistency (MCIC) to enhance the fine-tuned SAM. We propose Monte Carlo-guided Interpolation Consistency Segment Anything Model (MCICSAM) for application to semi-supervised learning based prostate region segmentation. In the unlabeled data section, MCIC performs two different interpolation transformations on the input data and incorporates Monte Carlo uncertainty analysis in the output, forcing the model to be consistent in its predictions. The consistency constraints imposed on these interpolated samples allow the model to fit the distribution of unlabeled data better, ultimately improving its performance in semi-supervised scenarios. We use Dice and Hausdorff Distance at 95th percentile (HD95) to validate model performance. MCICSAM yieldes Dice with 79.38% and 89.95%, along with improves HD95 values of 3.12 and 2.27 for transition zone and transition zone. At the same time MCICSAM demonstrates strong generalizability. This method is expected to bring new possibilities in the field of prostate image segmentation.
△ Less
Submitted 20 September, 2024;
originally announced September 2024.
-
Pre-Trained Foundation Model representations to uncover Breathing patterns in Speech
Authors:
Vikramjit Mitra,
Anirban Chatterjee,
Ke Zhai,
Helen Weng,
Ayuko Hill,
Nicole Hay,
Christopher Webb,
Jamie Cheng,
Erdrin Azemi
Abstract:
The process of human speech production involves coordinated respiratory action to elicit acoustic speech signals. Typically, speech is produced when air is forced from the lungs and is modulated by the vocal tract, where such actions are interspersed by moments of breathing in air (inhalation) to refill the lungs again. Respiratory rate (RR) is a vital metric that is used to assess the overall hea…
▽ More
The process of human speech production involves coordinated respiratory action to elicit acoustic speech signals. Typically, speech is produced when air is forced from the lungs and is modulated by the vocal tract, where such actions are interspersed by moments of breathing in air (inhalation) to refill the lungs again. Respiratory rate (RR) is a vital metric that is used to assess the overall health, fitness, and general well-being of an individual. Existing approaches to measure RR (number of breaths one takes in a minute) are performed using specialized equipment or training. Studies have demonstrated that machine learning algorithms can be used to estimate RR using bio-sensor signals as input. Speech-based estimation of RR can offer an effective approach to measure the vital metric without requiring any specialized equipment or sensors. This work investigates a machine learning based approach to estimate RR from speech segments obtained from subjects speaking to a close-talking microphone device. Data were collected from N=26 individuals, where the groundtruth RR was obtained through commercial grade chest-belts and then manually corrected for any errors. A convolutional long-short term memory network (Conv-LSTM) is proposed to estimate respiration time-series data from the speech signal. We demonstrate that the use of pre-trained representations obtained from a foundation model, such as Wav2Vec2, can be used to estimate respiration-time-series with low root-mean-squared error and high correlation coefficient, when compared with the baseline. The model-driven time series can be used to estimate $RR$ with a low mean absolute error (MAE) ~ 1.6 breaths/min.
△ Less
Submitted 17 July, 2024;
originally announced July 2024.
-
Learning the Influence Graph of a High-Dimensional Markov Process with Memory
Authors:
Smita Bagewadi,
Avhishek Chatterjee
Abstract:
Motivated by multiple applications in social networks, nervous systems, and financial risk analysis, we consider the problem of learning the underlying (directed) influence graph or causal graph of a high-dimensional multivariate discrete-time Markov process with memory. At any discrete time instant, each observed variable of the multivariate process is a binary string of random length, which is p…
▽ More
Motivated by multiple applications in social networks, nervous systems, and financial risk analysis, we consider the problem of learning the underlying (directed) influence graph or causal graph of a high-dimensional multivariate discrete-time Markov process with memory. At any discrete time instant, each observed variable of the multivariate process is a binary string of random length, which is parameterized by an unobservable or hidden [0,1]-valued scalar. The hidden scalars corresponding to the variables evolve according to discrete-time linear stochastic dynamics dictated by the underlying influence graph whose nodes are the variables. We extend an existing algorithm for learning i.i.d. graphical models to this Markovian setting with memory and prove that it can learn the influence graph based on the binary observations using logarithmic (in number of variables or nodes) samples when the degree of the influence graph is bounded. The crucial analytical contribution of this work is the derivation of the sample complexity result by upper and lower bounding the rate of convergence of the observed Markov process with memory to its stationary distribution in terms of the parameters of the influence graph.
△ Less
Submitted 13 June, 2024;
originally announced June 2024.
-
Stable compensators in parallel to stabilize arbitrary proper rational SISO plants
Authors:
Abdul Hannan Faruqi,
Anindya Chatterjee
Abstract:
We consider stabilization of linear time-invariant (LTI) and single input single output (SISO) plants in the frequency domain from a fresh perspective. Compensators that are themselves stable are sometimes preferred because they make starting the system easier. Such starting remains easy if there is a stable compensator in parallel with the plant rather than in a feedback loop. In such an arrangem…
▽ More
We consider stabilization of linear time-invariant (LTI) and single input single output (SISO) plants in the frequency domain from a fresh perspective. Compensators that are themselves stable are sometimes preferred because they make starting the system easier. Such starting remains easy if there is a stable compensator in parallel with the plant rather than in a feedback loop. In such an arrangement, we explain why it is possible to stabilize all plants whose transfer functions are proper rational functions of the Laplace variable $s$. In our proposed architecture we have (i) an optional compensator $C_s(s)$ in series with the plant $P(s)$, (ii) a necessary compensator $C_p(s)$ in parallel with $C_s(s)P(s)$, along with (iii) a feedback gain $K$ for the combined new plant $C_s(s)P(s)+C_p(s)$. We show that stabilization with stable $C_s(s)$ and $C_p(s)$ is always possible. In our proposed solution the closed-loop plant is biproper and has all its zeros in the left half plane, so there is a $K_0$ such that the plant is stable for $K>K_0$. We are not aware of prior use of parallel compensators with such a goal. Our proposed architecture works even for plants that are impossible to stabilize with stable compensators in the usual single-loop feedback architecture. Several examples are provided.
△ Less
Submitted 15 December, 2023;
originally announced December 2023.
-
New method for SISO strong stabilization with advantages over existing methods
Authors:
Abdul Hannan Faruqi,
Anindya Chatterjee
Abstract:
We address stabilization of linear time-invariant (LTI), single-input single-output (SISO) systems in the Laplace domain, with a stable controller in a single feedback loop. Such stabilization is called strong. Plants that satisfy a parity interlacing property are known to be strongly stabilizable. Finding such controllers is a well known difficult problem. Existing general methods are based on ei…
▽ More
We address stabilization of linear time-invariant (LTI), single-input single-output (SISO) systems in the Laplace domain, with a stable controller in a single feedback loop. Such stabilization is called strong. Plants that satisfy a parity interlacing property are known to be strongly stabilizable. Finding such controllers is a well known difficult problem. Existing general methods are based on either manual search or a clever use of Nevanlinna-Pick interpolation with polynomials of possibly high integer order. Here we present a new, simple, and general method for strongly stabilizing systems of relative degree less than 3. We call our method Real to Integers (RTI). Our theoretical contributions constitute proposing the functional form used, which involves a product of several terms of the form $\displaystyle \left ( \frac{s+a}{s+b} \right )^m$, showing that real $m$'s will arise whenever the plant is strongly stabilizable, and proving that integer $m$'s can be obtained by continuously varying free parameters (i.e., the $a$'s and $b$'s). Our practical contributions include demonstrating a simple way, based on a trigonometric trick, to adjust the fractional powers until they take reasonable integer values. We include brief but necessary associated discussion to make the paper accessible to a broad audience. We also present ten numerical examples of successful control design with varying levels of difficulty, including plants whose transfer functions have relative degrees of 0, 1 or 2; and with right half plane zeros of multiplicity possibly exceeding one.
△ Less
Submitted 12 February, 2024; v1 submitted 4 December, 2023;
originally announced December 2023.
-
Automated COVID-19 CT Image Classification using Multi-head Channel Attention in Deep CNN
Authors:
Susmita Ghosh,
Abhiroop Chatterjee
Abstract:
The rapid spread of COVID-19 has necessitated efficient and accurate diagnostic methods. Computed Tomography (CT) scan images have emerged as a valuable tool for detecting the disease. In this article, we present a novel deep learning approach for automated COVID-19 CT scan classification where a modified Xception model is proposed which incorporates a newly designed channel attention mechanism an…
▽ More
The rapid spread of COVID-19 has necessitated efficient and accurate diagnostic methods. Computed Tomography (CT) scan images have emerged as a valuable tool for detecting the disease. In this article, we present a novel deep learning approach for automated COVID-19 CT scan classification where a modified Xception model is proposed which incorporates a newly designed channel attention mechanism and weighted global average pooling to enhance feature extraction thereby improving classification accuracy. The channel attention module selectively focuses on informative regions within each channel, enabling the model to learn discriminative features for COVID-19 detection. Experiments on a widely used COVID-19 CT scan dataset demonstrate a very good accuracy of 96.99% and show its superiority to other state-of-the-art techniques. This research can contribute to the ongoing efforts in using artificial intelligence to combat current and future pandemics and can offer promising and timely solutions for efficient medical image analysis tasks.
△ Less
Submitted 12 August, 2023; v1 submitted 31 July, 2023;
originally announced August 2023.
-
T-Fusion Net: A Novel Deep Neural Network Augmented with Multiple Localizations based Spatial Attention Mechanisms for Covid-19 Detection
Authors:
Susmita Ghosh,
Abhiroop Chatterjee
Abstract:
In recent years, deep neural networks are yielding better performance in image classification tasks. However, the increasing complexity of datasets and the demand for improved performance necessitate the exploration of innovative techniques. The present work proposes a new deep neural network (called as, T-Fusion Net) that augments multiple localizations based spatial attention. This attention mec…
▽ More
In recent years, deep neural networks are yielding better performance in image classification tasks. However, the increasing complexity of datasets and the demand for improved performance necessitate the exploration of innovative techniques. The present work proposes a new deep neural network (called as, T-Fusion Net) that augments multiple localizations based spatial attention. This attention mechanism allows the network to focus on relevant image regions, improving its discriminative power. A homogeneous ensemble of the said network is further used to enhance image classification accuracy. For ensembling, the proposed approach considers multiple instances of individual T-Fusion Net. The model incorporates fuzzy max fusion to merge the outputs of individual nets. The fusion process is optimized through a carefully chosen parameter to strike a balance on the contributions of the individual models. Experimental evaluations on benchmark Covid-19 (SARS-CoV-2 CT scan) dataset demonstrate the effectiveness of the proposed T-Fusion Net as well as its ensemble. The proposed T-Fusion Net and the homogeneous ensemble model exhibit better performance, as compared to other state-of-the-art methods, achieving accuracy of 97.59% and 98.4%, respectively.
△ Less
Submitted 31 July, 2023;
originally announced August 2023.
-
Capacity Achieving Codes for an Erasure Queue-Channel
Authors:
Jaswanthi Mandalapu,
Krishna Jagannathan,
Avhishek Chatterjee,
Andrew Thangaraj
Abstract:
We consider a queue-channel model that captures the waiting time-dependent degradation of information bits as they wait to be transmitted. Such a scenario arises naturally in quantum communications, where quantum bits tend to decohere rapidly. Trailing the capacity results obtained recently for certain queue-channels, this paper aims to construct practical channel codes for the erasure queue-chann…
▽ More
We consider a queue-channel model that captures the waiting time-dependent degradation of information bits as they wait to be transmitted. Such a scenario arises naturally in quantum communications, where quantum bits tend to decohere rapidly. Trailing the capacity results obtained recently for certain queue-channels, this paper aims to construct practical channel codes for the erasure queue-channel (EQC) -- a channel characterized by highly correlated erasures, governed by the underlying queuing dynamics. Our main contributions in this paper are twofold: (i) We propose a generic `wrapper' based on interleaving across renewal blocks of the queue to convert any capacity-achieving block code for a memoryless erasure channel to a capacity-achieving code for the EQC. Next, due to the complexity involved in implementing interleaved systems, (ii) we study the performance of LDPC and Polar codes without any interleaving. We show that standard Arıkan's Polar transform polarizes the EQC for certain restricted class of erasure probability functions. We also highlight some possible approaches and the corresponding challenges involved in proving polarization of a general EQC.
△ Less
Submitted 6 May, 2023;
originally announced May 2023.
-
Adaptive Gravity Compensation Control of a Cable-Driven Upper-Arm Soft Exosuit
Authors:
Joyjit Mukherjee,
Ankit Chatterjee,
Shreeshan Jena,
Nitesh Kumar,
Suriya Prakash Muthukrishnan,
Sitikantha Roy,
Shubhendu Bhasin
Abstract:
This paper proposes an adaptive gravity compensation (AGC) control strategy for a cable-driven upper-limb exosuit intended to assist the wearer with lifting tasks. Unlike most model-based control techniques used for this human-robot interaction task, the proposed control design does not assume knowledge of the anthropometric parameters of the wearer's arm and the payload. Instead, the uncertaintie…
▽ More
This paper proposes an adaptive gravity compensation (AGC) control strategy for a cable-driven upper-limb exosuit intended to assist the wearer with lifting tasks. Unlike most model-based control techniques used for this human-robot interaction task, the proposed control design does not assume knowledge of the anthropometric parameters of the wearer's arm and the payload. Instead, the uncertainties in human arm parameters, such as mass, length, and payload, are estimated online using an indirect adaptive control law that compensates for the gravity moment about the elbow joint. Additionally, the AGC controller is agnostic to the desired joint trajectory followed by the human arm. For the purpose of controller design, the human arm is modeled using a 1-DOF manipulator model. Further, a cable-driven actuator model is proposed that maps the assistive elbow torque to the actuator torque. The performance of the proposed method is verified through a co-simulation, wherein the control input realized in MATLAB is applied to the human bio-mechanical model in OpenSim under varying payload conditions. Significant reductions in human effort in terms of human muscle torque and metabolic cost are observed with the proposed control strategy. Further, simulation results show that the performance of the AGC controller converges to that of the gravity compensation (GC) controller, demonstrating the efficacy of AGC-based online parameter learning.
△ Less
Submitted 28 April, 2023;
originally announced April 2023.
-
Balancing a Stick with Eyes Shut: Inverted Pendulum on a Cart without Angle Measurement
Authors:
Bidhayak Goswami,
Anindya Chatterjee
Abstract:
We consider linear time-invariant dynamic systems in the single-input, single-output (SISO) framework. In particular, we consider stabilization of an inverted pendulum on a cart using a force on the cart. This system is easy to stabilize with pendulum angle feedback. However, with cart position feedback it cannot be stabilized with stable and proper compensators. Here we demonstrate that with an a…
▽ More
We consider linear time-invariant dynamic systems in the single-input, single-output (SISO) framework. In particular, we consider stabilization of an inverted pendulum on a cart using a force on the cart. This system is easy to stabilize with pendulum angle feedback. However, with cart position feedback it cannot be stabilized with stable and proper compensators. Here we demonstrate that with an additional compensator in a parallel feedforward loop, stabilization is possible with such compensators. Sensitivity to noise seems to be about 3 times worse than for the situation with angle feedback. For completeness, discussion is presented of compensator parameter choices, robustness, fragility and comparison with another control approach.
△ Less
Submitted 10 January, 2023;
originally announced January 2023.
-
Gaussian Control Barrier Functions : A Non-Parametric Paradigm to Safety
Authors:
Mouhyemen Khan,
Tatsuya Ibuki,
Abhijit Chatterjee
Abstract:
Inspired by the success of control barrier functions (CBFs) in addressing safety, and the rise of data-driven techniques for modeling functions, we propose a non-parametric approach for online synthesis of CBFs using Gaussian Processes (GPs). Mathematical constructs such as CBFs have achieved safety by designing a candidate function a priori. However, designing such a candidate function can be cha…
▽ More
Inspired by the success of control barrier functions (CBFs) in addressing safety, and the rise of data-driven techniques for modeling functions, we propose a non-parametric approach for online synthesis of CBFs using Gaussian Processes (GPs). Mathematical constructs such as CBFs have achieved safety by designing a candidate function a priori. However, designing such a candidate function can be challenging. A practical example of such a setting would be to design a CBF in a disaster recovery scenario where safe and navigable regions need to be determined. The decision boundary for safety in such an example is unknown and cannot be designed a priori. In our approach, we work with safety samples or observations to construct the CBF online by assuming a flexible GP prior on these samples, and term our formulation as a Gaussian CBF. GPs have favorable properties, in addition to being non-parametric, such as analytical tractability and robust uncertainty estimation. This allows realizing the posterior components with high safety guarantees by incorporating variance estimation, while also computing associated partial derivatives in closed-form to achieve safe control. Moreover, the synthesized safety function from our approach allows changing the corresponding safe set arbitrarily based on the data, thus allowing non-convex safe sets. We validate our approach experimentally on a quadrotor by demonstrating safe control for fixed but arbitrary safe sets and collision avoidance where the safe set is constructed online. Finally, we juxtapose Gaussian CBFs with regular CBFs in the presence of noisy states to highlight its flexibility and robustness to noise. The experiment video can be seen at: https://youtu.be/HX6uokvCiGk.
△ Less
Submitted 1 August, 2022; v1 submitted 29 March, 2022;
originally announced March 2022.
-
Recovery of Missing Sensor Data by Reconstructing Time-varying Graph Signals
Authors:
Anindya Mondal,
Mayukhmali Das,
Aditi Chatterjee,
Palaniandavar Venkateswaran
Abstract:
Wireless sensor networks are among the most promising technologies of the current era because of their small size, lower cost, and ease of deployment. With the increasing number of wireless sensors, the probability of generating missing data also rises. This incomplete data could lead to disastrous consequences if used for decision-making. There is rich literature dealing with this problem. Howeve…
▽ More
Wireless sensor networks are among the most promising technologies of the current era because of their small size, lower cost, and ease of deployment. With the increasing number of wireless sensors, the probability of generating missing data also rises. This incomplete data could lead to disastrous consequences if used for decision-making. There is rich literature dealing with this problem. However, most approaches show performance degradation when a sizable amount of data is lost. Inspired by the emerging field of graph signal processing, this paper performs a new study of a Sobolev reconstruction algorithm in wireless sensor networks. Experimental comparisons on several publicly available datasets demonstrate that the algorithm surpasses multiple state-of-the-art techniques by a maximum margin of 54%. We further show that this algorithm consistently retrieves the missing data even during massive data loss situations.
△ Less
Submitted 23 December, 2022; v1 submitted 1 March, 2022;
originally announced March 2022.
-
An Embarrassingly Simple Consistency Regularization Method for Semi-Supervised Medical Image Segmentation
Authors:
Hritam Basak,
Rajarshi Bhattacharya,
Rukhshanda Hussain,
Agniv Chatterjee
Abstract:
The scarcity of pixel-level annotation is a prevalent problem in medical image segmentation tasks. In this paper, we introduce a novel regularization strategy involving interpolation-based mixing for semi-supervised medical image segmentation. The proposed method is a new consistency regularization strategy that encourages segmentation of interpolation of two unlabelled data to be consistent with…
▽ More
The scarcity of pixel-level annotation is a prevalent problem in medical image segmentation tasks. In this paper, we introduce a novel regularization strategy involving interpolation-based mixing for semi-supervised medical image segmentation. The proposed method is a new consistency regularization strategy that encourages segmentation of interpolation of two unlabelled data to be consistent with the interpolation of segmentation maps of those data. This method represents a specific type of data-adaptive regularization paradigm which aids to minimize the overfitting of labelled data under high confidence values. The proposed method is advantageous over adversarial and generative models as it requires no additional computation. Upon evaluation on two publicly available MRI datasets: ACDC and MMWHS, experimental results demonstrate the superiority of the proposed method in comparison to existing semi-supervised models. Code is available at: https://github.com/hritam-98/ICT-MedSeg
△ Less
Submitted 3 February, 2022; v1 submitted 1 February, 2022;
originally announced February 2022.
-
Transparency of Deep Neural Networks for Medical Image Analysis: A Review of Interpretability Methods
Authors:
Zohaib Salahuddin,
Henry C Woodruff,
Avishek Chatterjee,
Philippe Lambin
Abstract:
Artificial Intelligence has emerged as a useful aid in numerous clinical applications for diagnosis and treatment decisions. Deep neural networks have shown same or better performance than clinicians in many tasks owing to the rapid increase in the available data and computational power. In order to conform to the principles of trustworthy AI, it is essential that the AI system be transparent, rob…
▽ More
Artificial Intelligence has emerged as a useful aid in numerous clinical applications for diagnosis and treatment decisions. Deep neural networks have shown same or better performance than clinicians in many tasks owing to the rapid increase in the available data and computational power. In order to conform to the principles of trustworthy AI, it is essential that the AI system be transparent, robust, fair and ensure accountability. Current deep neural solutions are referred to as black-boxes due to a lack of understanding of the specifics concerning the decision making process. Therefore, there is a need to ensure interpretability of deep neural networks before they can be incorporated in the routine clinical workflow. In this narrative review, we utilized systematic keyword searches and domain expertise to identify nine different types of interpretability methods that have been used for understanding deep learning models for medical image analysis applications based on the type of generated explanations and technical similarities. Furthermore, we report the progress made towards evaluating the explanations produced by various interpretability methods. Finally we discuss limitations, provide guidelines for using interpretability methods and future directions concerning the interpretability of deep neural networks for medical imaging analysis.
△ Less
Submitted 31 October, 2021;
originally announced November 2021.
-
Algorithm To Calculate Pulse from PPG Signal After Eliminating Touch Errors from the Fingertip Video Captured by Smartphone Camera
Authors:
Ayan Chatterjee,
Sundar Gopalakrishnan,
Martin Gerdes,
Santiago Martinez,
Nibedita Pahari,
Pankaj Khatiwada
Abstract:
With the ongoing heart problems of the population worldwide, the medical requirements of the people are expected to increase. Electrocardiogram (ECG) is one of the proven to capture the heart response signal to assess the electrical and muscular functions of the heart. The ECG setup is expensive and needs proper training, and of course, it is not instant. For fast, accurate heart parameter monitor…
▽ More
With the ongoing heart problems of the population worldwide, the medical requirements of the people are expected to increase. Electrocardiogram (ECG) is one of the proven to capture the heart response signal to assess the electrical and muscular functions of the heart. The ECG setup is expensive and needs proper training, and of course, it is not instant. For fast, accurate heart parameter monitoring, scientists pay attention to the photoplethysmogram signal (PPG), based on the light intensity of a particular wavelength. Android smartphone with a good quality camera has come to ordinary people's reach and has become one of the most necessary and rugged devices for today and future generations. We can use its powerful features to solve or assess heart state monitoring by capturing the image's necessary data. The mobile camera has a photo emitting diode and a photodetector. The light source illuminates the tissue. The photodetector calculates the small variation in light intensity associated with blood volume change in the vessels (mainly fingertips, toes, and ears). We have captured unfocused contact video to capture PPG using an Android Smartphone. Then, we removed a certain percent of camera touch errors based on average pixel intensity count in the red plane, and it is a new approach that has been introduced in this research. We used a 2nd order Butterworth (IIR) band pass filter for noise removal, FFT Hann Window for frequency analysis and leakage reduction. We have developed an algorithm using MATLAB as a development platform, for accurate pulse (BPM) measurement. Moreover, we have done a comparative analysis of developed algorithm with other available algorithms for PPG-based pulse calculation. In this study, the fingertip video was captured when the body was at rest
△ Less
Submitted 6 September, 2021; v1 submitted 30 November, 2020;
originally announced December 2020.
-
Automated Human Activity Recognition by Colliding Bodies Optimization-based Optimal Feature Selection with Recurrent Neural Network
Authors:
Pankaj Khatiwada,
Ayan Chatterjee,
Matrika Subedi
Abstract:
In smart healthcare, Human Activity Recognition (HAR) is considered to be an efficient model in pervasive computation from sensor readings. The Ambient Assisted Living (AAL) in the home or community helps the people in providing independent care and enhanced living quality. However, many AAL models were restricted using many factors that include computational cost and system complexity. Moreover,…
▽ More
In smart healthcare, Human Activity Recognition (HAR) is considered to be an efficient model in pervasive computation from sensor readings. The Ambient Assisted Living (AAL) in the home or community helps the people in providing independent care and enhanced living quality. However, many AAL models were restricted using many factors that include computational cost and system complexity. Moreover, the HAR concept has more relevance because of its applications. Hence, this paper tempts to implement the HAR system using deep learning with the data collected from smart sensors that are publicly available in the UC Irvine Machine Learning Repository (UCI). The proposed model involves three processes: (1) Data collection, (b) Optimal feature selection, (c) Recognition. The data gathered from the benchmark repository is initially subjected to optimal feature selection that helps to select the most significant features. The proposed optimal feature selection is based on a new meta-heuristic algorithm called Colliding Bodies Optimization (CBO). An objective function derived by the recognition accuracy is used for accomplishing the optimal feature selection. Here, the deep learning model called Recurrent Neural Network (RNN) is used for activity recognition. The proposed model on the concerned benchmark dataset outperforms existing learning methods, providing high performance compared to the conventional models.
△ Less
Submitted 19 November, 2021; v1 submitted 7 October, 2020;
originally announced October 2020.
-
Non Orthogonal Multiple Access with Orthogonal Time Frequency Space Signal Transmission
Authors:
Aritra Chatterjee,
Vivek Rangamgari,
Shashank Tiwari,
Suvra Sekhar Das
Abstract:
Orthogonal time frequency space (OTFS) is being pursued in recent times as a suitable wireless transmission technology for use in high mobility scenarios. In this work, we propose nonorthogonal multiple acess (NOMA) based OTFS which may be called NOMA-OTFS system and evaluate its performance from system level and link level perspective. The challenge lies in the fact that while OTFS transmission t…
▽ More
Orthogonal time frequency space (OTFS) is being pursued in recent times as a suitable wireless transmission technology for use in high mobility scenarios. In this work, we propose nonorthogonal multiple acess (NOMA) based OTFS which may be called NOMA-OTFS system and evaluate its performance from system level and link level perspective. The challenge lies in the fact that while OTFS transmission technology is known for its resilience to high mobility conditions, while NOMA is known to yield high spectral efficiency in low mobility scenarios in comparison to orthogonal multiple access (OMA). We present a minimum mean square error (MMSE)- successive interference cancellation (SIC) based receiver for NOMA-OTFS, for which we derive expression for symbol-wise post-processing SINR in order to evaluate system sum spectral efficiency (SE). We develop power allocation schemes to maximize the sum SE in the high-mobility version of NOMA. We further design a realizable codeword level SIC (CWIC) receiver using LDPC codes along with MMSE equalization for evaluating link level performance of such practical NOMA-OTFS system. The system level and link level performance of the proposed NOMA-OTFS system are compared against benchmark OMA-OTFS, OMA-orthogonal frequency division multiplexing (OMA-OFDM) and NOMA-OFDM schemes. From system-level performance evaluation, we observe interestingly that NOMA-OTFS provides higher sum SE than OMA-OTFS. When compared to NOMA-OFDM, we find that outage SE of NOMA-OTFS is improved at the cost of decrease in mean SE. Whereas link-level results show that the developed CWIC based NOMA-OTFS receiver performs significantly better than NOMA-OFDM in terms of block error rate (BLER), goodput and throughput.
△ Less
Submitted 31 May, 2020; v1 submitted 13 March, 2020;
originally announced March 2020.
-
Multi-Sparse Gaussian Process: Learning based Semi-Parametric Control
Authors:
Mouhyemen Khan,
Akash Patel,
Abhijit Chatterjee
Abstract:
A key challenge with controlling complex dynamical systems is to accurately model them. However, this requirement is very hard to satisfy in practice. Data-driven approaches such as Gaussian processes (GPs) have proved quite effective by employing regression based methods to capture the unmodeled dynamical effects. However, GPs scale cubically with data, and is often a challenge to perform real-ti…
▽ More
A key challenge with controlling complex dynamical systems is to accurately model them. However, this requirement is very hard to satisfy in practice. Data-driven approaches such as Gaussian processes (GPs) have proved quite effective by employing regression based methods to capture the unmodeled dynamical effects. However, GPs scale cubically with data, and is often a challenge to perform real-time regression. In this paper, we propose a semi-parametric framework exploiting sparsity for learning-based control. We combine the parametric model of the system with multiple sparse GP models to capture any unmodeled dynamics. Multi-Sparse Gaussian Process (MSGP) divides the original dataset into multiple sparse models with unique hyperparameters for each model. Thereby, preserving the richness and uniqueness of each sparse model. For a query point, a weighted sparse posterior prediction is performed based on $N$ neighboring sparse models. Hence, the prediction complexity is significantly reduced from $\mathcal{O}(n^3)$ to $\mathcal{O}(Npu^2)$, where $p$ and $u$ are data points and pseudo-inputs respectively for each sparse model. We validate MSGP's learning performance for a quadrotor using a geometric controller in simulation. Comparison with GP, sparse GP, and local GP shows that MSGP has higher prediction accuracy than sparse and local GP, while significantly lower time complexity than all three. We also validate MSGP on a hardware quadrotor for unmodeled mass, inertia, and disturbances. The experiment video can be seen at: https://youtu.be/zUk1ISux6ao
△ Less
Submitted 3 March, 2020;
originally announced March 2020.
-
Semi-Bagging Based Deep Neural Architecture to Extract Text from High Entropy Images
Authors:
Pranay Dugar,
Anirban Chatterjee,
Rajesh Shreedhar Bhat,
Saswata Sahoo
Abstract:
Extracting texts of various size and shape from images containing multiple objects is an important problem in many contexts, especially, in connection to e-commerce, augmented reality assistance system in natural scene, etc. The existing works (based on only CNN) often perform sub-optimally when the image contains regions of high entropy having multiple objects. This paper presents an end-to-end t…
▽ More
Extracting texts of various size and shape from images containing multiple objects is an important problem in many contexts, especially, in connection to e-commerce, augmented reality assistance system in natural scene, etc. The existing works (based on only CNN) often perform sub-optimally when the image contains regions of high entropy having multiple objects. This paper presents an end-to-end text detection strategy combining a segmentation algorithm and an ensemble of multiple text detectors of different types to detect text in every individual image segments independently. The proposed strategy involves a super-pixel based image segmenter which splits an image into multiple regions. A convolutional deep neural architecture is developed which works on each of the segments and detects texts of multiple shapes, sizes, and structures. It outperforms the competing methods in terms of coverage in detecting texts in images especially the ones where the text of various types and sizes are compacted in a small region along with various other objects. Furthermore, the proposed text detection method along with a text recognizer outperforms the existing state-of-the-art approaches in extracting text from high entropy images. We validate the results on a dataset consisting of product images on an e-commerce website.
△ Less
Submitted 2 July, 2019;
originally announced July 2019.
-
Barrier Functions in Cascaded Controller: Safe Quadrotor Control
Authors:
Mouhyemen Khan,
Munzir Zafar,
Abhijit Chatterjee
Abstract:
Safe control for inherently unstable systems such as quadrotors is crucial. Imposing multiple dynamic constraints simultaneously on the states for safety regulation can be a challenging problem. In this paper, we propose a quadratic programming (QP) based approach on a cascaded control architecture for quadrotors to enforce safety. Safety regions are constructed using control barrier functions (CB…
▽ More
Safe control for inherently unstable systems such as quadrotors is crucial. Imposing multiple dynamic constraints simultaneously on the states for safety regulation can be a challenging problem. In this paper, we propose a quadratic programming (QP) based approach on a cascaded control architecture for quadrotors to enforce safety. Safety regions are constructed using control barrier functions (CBF) while explicitly considering the nonlinear underactuated dynamics of the quadrotor. The safety regions constructed using CBFs establish a non-conservative forward invariant safe region for quadrotor navigation. Barriers imposed across the cascaded architecture allows independent safety regulation in quadrotor's altitude and lateral domains. Despite barriers appearing in a cascaded fashion, we show preservation of safety for quadrotor motion in SE(3). We demonstrate the feasibility of our method on a quadrotor in simulation with static and dynamic constraints enforced on position and velocity spaces simultaneously.
△ Less
Submitted 17 February, 2020; v1 submitted 22 March, 2019;
originally announced March 2019.
-
Generalized Opinion Dynamics from Local Optimization Rules
Authors:
Avhishek Chatterjee,
Anand D. Sarwate,
Sriram Vishwanath
Abstract:
We study generalizations of the Hegselmann-Krause (HK) model for opinion dynamics, incorporating features and parameters that are natural components of observed social systems. The first generalization is one where the strength of influence depends on the distance of the agents' opinions. Under this setup, we identify conditions under which the opinions converge in finite time, and provide a quali…
▽ More
We study generalizations of the Hegselmann-Krause (HK) model for opinion dynamics, incorporating features and parameters that are natural components of observed social systems. The first generalization is one where the strength of influence depends on the distance of the agents' opinions. Under this setup, we identify conditions under which the opinions converge in finite time, and provide a qualitative characterization of the equilibrium. We interpret the HK model opinion update rule as a quadratic cost-minimization rule. This enables a second generalization: a family of update rules which possess different equilibrium properties. Subsequently, we investigate models in which a external force can behave strategically to modulate/influence user updates. We consider cases where this external force can introduce additional agents and cases where they can modify the cost structures for other agents. We describe and analyze some strategies through which such modulation may be possible in an order-optimal manner. Our simulations demonstrate that generalized dynamics differ qualitatively and quantitatively from traditional HK dynamics.
△ Less
Submitted 25 September, 2014;
originally announced September 2014.
-
Two Timescale Convergent Q-learning for Sleep--Scheduling in Wireless Sensor Networks
Authors:
Prashanth L. A.,
Abhranil Chatterjee,
Shalabh Bhatnagar
Abstract:
In this paper, we consider an intrusion detection application for Wireless Sensor Networks (WSNs). We study the problem of scheduling the sleep times of the individual sensors to maximize the network lifetime while keeping the tracking error to a minimum. We formulate this problem as a partially-observable Markov decision process (POMDP) with continuous state-action spaces, in a manner similar to…
▽ More
In this paper, we consider an intrusion detection application for Wireless Sensor Networks (WSNs). We study the problem of scheduling the sleep times of the individual sensors to maximize the network lifetime while keeping the tracking error to a minimum. We formulate this problem as a partially-observable Markov decision process (POMDP) with continuous state-action spaces, in a manner similar to (Fuemmeler and Veeravalli [2008]). However, unlike their formulation, we consider infinite horizon discounted and average cost objectives as performance criteria. For each criterion, we propose a convergent on-policy Q-learning algorithm that operates on two timescales, while employing function approximation to handle the curse of dimensionality associated with the underlying POMDP. Our proposed algorithm incorporates a policy gradient update using a one-simulation simultaneous perturbation stochastic approximation (SPSA) estimate on the faster timescale, while the Q-value parameter (arising from a linear function approximation for the Q-values) is updated in an on-policy temporal difference (TD) algorithm-like fashion on the slower timescale. The feature selection scheme employed in each of our algorithms manages the energy and tracking components in a manner that assists the search for the optimal sleep-scheduling policy. For the sake of comparison, in both discounted and average settings, we also develop a function approximation analogue of the Q-learning algorithm. This algorithm, unlike the two-timescale variant, does not possess theoretical convergence guarantees. Finally, we also adapt our algorithms to include a stochastic iterative estimation scheme for the intruder's mobility model. Our simulation results on a 2-dimensional network setting suggest that our algorithms result in better tracking accuracy at the cost of only a few additional sensors, in comparison to a recent prior work.
△ Less
Submitted 23 March, 2014; v1 submitted 27 December, 2013;
originally announced December 2013.