-
Planar Herding of Multiple Evaders with a Single Herder
Authors:
Rishabh Kumar Singh,
Debraj Chakraborty
Abstract:
A planar herding problem is considered, where a superior pursuer herds a flock of non-cooperative, inferior evaders around a predefined target point. An inverse square law of repulsion is assumed between the pursuer and each evader. Two classes of pursuer trajectories are proposed: (i) a constant angular-velocity spiral, and (ii) a constant angular-velocity circle, both centered around the target…
▽ More
A planar herding problem is considered, where a superior pursuer herds a flock of non-cooperative, inferior evaders around a predefined target point. An inverse square law of repulsion is assumed between the pursuer and each evader. Two classes of pursuer trajectories are proposed: (i) a constant angular-velocity spiral, and (ii) a constant angular-velocity circle, both centered around the target point. For the spiraling pursuer, the radial velocity is dynamically adjusted based on a feedback law that depends on the instantaneous position of the evader, which is located at the farthest distance from the target at the start of the game. It is shown that, under suitable choices of the model parameters, all the evaders are herded into an arbitrarily small limit cycle around the target point. Meanwhile, the pursuer also converges onto a circular trajectory around the target. The conditions for the stability of these limit cycles are derived. For the circling pursuer, similar guarantees are provided along with explicit formulas for the radii of the limit cycles.
△ Less
Submitted 15 May, 2025;
originally announced May 2025.
-
Explaining Control Policies through Predicate Decision Diagrams
Authors:
Debraj Chakraborty,
Clemens Dubslaff,
Sudeep Kanav,
Jan Kretinsky,
Christoph Weinhuber
Abstract:
Safety-critical controllers of complex systems are hard to construct manually. Automated approaches such as controller synthesis or learning provide a tempting alternative but usually lack explainability. To this end, learning decision trees (DTs) have been prevalently used towards an interpretable model of the generated controllers. However, DTs do not exploit shared decision-making, a key concep…
▽ More
Safety-critical controllers of complex systems are hard to construct manually. Automated approaches such as controller synthesis or learning provide a tempting alternative but usually lack explainability. To this end, learning decision trees (DTs) have been prevalently used towards an interpretable model of the generated controllers. However, DTs do not exploit shared decision-making, a key concept exploited in binary decision diagrams (BDDs) to reduce their size and thus improve explainability. In this work, we introduce predicate decision diagrams (PDDs) that extend BDDs with predicates and thus unite the advantages of DTs and BDDs for controller representation. We establish a synthesis pipeline for efficient construction of PDDs from DTs representing controllers, exploiting reduction techniques for BDDs also for PDDs.
△ Less
Submitted 25 March, 2025; v1 submitted 8 March, 2025;
originally announced March 2025.
-
Explainable Finite-Memory Policies for Partially Observable Markov Decision Processes
Authors:
Muqsit Azeem,
Debraj Chakraborty,
Sudeep Kanav,
Jan Kretinsky
Abstract:
Partially Observable Markov Decision Processes (POMDPs) are a fundamental framework for decision-making under uncertainty and partial observability. Since in general optimal policies may require infinite memory, they are hard to implement and often render most problems undecidable. Consequently, finite-memory policies are mostly considered instead. However, the algorithms for computing them are ty…
▽ More
Partially Observable Markov Decision Processes (POMDPs) are a fundamental framework for decision-making under uncertainty and partial observability. Since in general optimal policies may require infinite memory, they are hard to implement and often render most problems undecidable. Consequently, finite-memory policies are mostly considered instead. However, the algorithms for computing them are typically very complex, and so are the resulting policies. Facing the need for their explainability, we provide a representation of such policies, both (i) in an interpretable formalism and (ii) typically of smaller size, together yielding higher explainability. To that end, we combine models of Mealy machines and decision trees; the latter describing simple, stationary parts of the policies and the former describing how to switch among them. We design a translation for policies of the finite-state-controller (FSC) form from standard literature and show how our method smoothly generalizes to other variants of finite-memory policies. Further, we identify specific properties of recently used "attractor-based" policies, which allow us to construct yet simpler and smaller representations. Finally, we illustrate the higher explainability in a few case studies.
△ Less
Submitted 20 November, 2024;
originally announced November 2024.
-
1-2-3-Go! Policy Synthesis for Parameterized Markov Decision Processes via Decision-Tree Learning and Generalization
Authors:
Muqsit Azeem,
Debraj Chakraborty,
Sudeep Kanav,
Jan Kretinsky,
Mohammadsadegh Mohagheghi,
Stefanie Mohr,
Maximilian Weininger
Abstract:
Despite the advances in probabilistic model checking, the scalability of the verification methods remains limited. In particular, the state space often becomes extremely large when instantiating parameterized Markov decision processes (MDPs) even with moderate values. Synthesizing policies for such \emph{huge} MDPs is beyond the reach of available tools. We propose a learning-based approach to obt…
▽ More
Despite the advances in probabilistic model checking, the scalability of the verification methods remains limited. In particular, the state space often becomes extremely large when instantiating parameterized Markov decision processes (MDPs) even with moderate values. Synthesizing policies for such \emph{huge} MDPs is beyond the reach of available tools. We propose a learning-based approach to obtain a reasonable policy for such huge MDPs.
The idea is to generalize optimal policies obtained by model-checking small instances to larger ones using decision-tree learning. Consequently, our method bypasses the need for explicit state-space exploration of large models, providing a practical solution to the state-space explosion problem. We demonstrate the efficacy of our approach by performing extensive experimentation on the relevant models from the quantitative verification benchmark set. The experimental results indicate that our policies perform well, even when the size of the model is orders of magnitude beyond the reach of state-of-the-art analysis tools.
△ Less
Submitted 1 April, 2025; v1 submitted 23 October, 2024;
originally announced October 2024.
-
Multiscale Color Guided Attention Ensemble Classifier for Age-Related Macular Degeneration using Concurrent Fundus and Optical Coherence Tomography Images
Authors:
Pragya Gupta,
Subhamoy Mandal,
Debashree Guha,
Debjani Chakraborty
Abstract:
Automatic diagnosis techniques have evolved to identify age-related macular degeneration (AMD) by employing single modality Fundus images or optical coherence tomography (OCT). To classify ocular diseases, fundus and OCT images are the most crucial imaging modalities used in the clinical setting. Most deep learning-based techniques are established on a single imaging modality, which contemplates t…
▽ More
Automatic diagnosis techniques have evolved to identify age-related macular degeneration (AMD) by employing single modality Fundus images or optical coherence tomography (OCT). To classify ocular diseases, fundus and OCT images are the most crucial imaging modalities used in the clinical setting. Most deep learning-based techniques are established on a single imaging modality, which contemplates the ocular disorders to a specific extent and disregards other modality that comprises exhaustive information among distinct imaging modalities. This paper proposes a modality-specific multiscale color space embedding integrated with the attention mechanism based on transfer learning for classification (MCGAEc), which can efficiently extract the distinct modality information at various scales using the distinct color spaces. In this work, we first introduce the modality-specific multiscale color space encoder model, which includes diverse feature representations by integrating distinct characteristic color spaces on a multiscale into a unified framework. The extracted features from the prior encoder module are incorporated with the attention mechanism to extract the global features representation, which is integrated with the prior extracted features and transferred to the random forest classifier for the classification of AMD. To analyze the performance of the proposed MCGAEc method, a publicly available multi-modality dataset from Project Macula for AMD is utilized and compared with the existing models.
△ Less
Submitted 1 September, 2024;
originally announced September 2024.
-
Learning from similar systems and online data-driven LQR using iterative randomised data compression
Authors:
Vatsal Kedia,
Sneha Susan George,
Debraj Chakraborty
Abstract:
The problem of data-driven recursive computation of receding horizon LQR control through a randomized combination of online/current and historical/recorded data is considered. It is assumed that large amounts of historical input-output data from a system, which is similar but not identical to the current system under consideration, is available. This (possibly large) data set is compressed through…
▽ More
The problem of data-driven recursive computation of receding horizon LQR control through a randomized combination of online/current and historical/recorded data is considered. It is assumed that large amounts of historical input-output data from a system, which is similar but not identical to the current system under consideration, is available. This (possibly large) data set is compressed through a novel randomized subspace algorithm to directly synthesize an initial solution of the standard LQR problem, which however is sub-optimal due to the inaccuracy of the historical model. The first instance of this input is used to actuate the current system and the corresponding instantaneous output is used to iteratively re-solve the LQR problem through a computationally inexpensive randomized rank-one update of the old compressed data. The first instance of the re-computed input is applied to the system at the next instant, output recorded and the entire procedure is repeated at each subsequent instant. As more current data becomes available, the algorithm learns automatically from the new data while simultaneously controlling the system in near optimal manner. The proposed algorithm is computationally inexpensive due to the initial and repeated compression of old and newly available data. Moreover, the simultaneous learning and control makes this algorithm particularly suited for adapting to unknown, poorly modeled and time-varying systems without any explicit exploration stage. Simulations demonstrate the effectiveness of the proposed algorithm vs popular exploration/exploitation approaches to LQR control.
△ Less
Submitted 22 November, 2023;
originally announced November 2023.
-
The Sound Demixing Challenge 2023 $\unicode{x2013}$ Cinematic Demixing Track
Authors:
Stefan Uhlich,
Giorgio Fabbro,
Masato Hirano,
Shusuke Takahashi,
Gordon Wichern,
Jonathan Le Roux,
Dipam Chakraborty,
Sharada Mohanty,
Kai Li,
Yi Luo,
Jianwei Yu,
Rongzhi Gu,
Roman Solovyev,
Alexander Stempkovskiy,
Tatiana Habruseva,
Mikhail Sukhovei,
Yuki Mitsufuji
Abstract:
This paper summarizes the cinematic demixing (CDX) track of the Sound Demixing Challenge 2023 (SDX'23). We provide a comprehensive summary of the challenge setup, detailing the structure of the competition and the datasets used. Especially, we detail CDXDB23, a new hidden dataset constructed from real movies that was used to rank the submissions. The paper also offers insights into the most succes…
▽ More
This paper summarizes the cinematic demixing (CDX) track of the Sound Demixing Challenge 2023 (SDX'23). We provide a comprehensive summary of the challenge setup, detailing the structure of the competition and the datasets used. Especially, we detail CDXDB23, a new hidden dataset constructed from real movies that was used to rank the submissions. The paper also offers insights into the most successful approaches employed by participants. Compared to the cocktail-fork baseline, the best-performing system trained exclusively on the simulated Divide and Remaster (DnR) dataset achieved an improvement of 1.8 dB in SDR, whereas the top-performing system on the open leaderboard, where any data could be used for training, saw a significant improvement of 5.7 dB. A significant source of this improvement was making the simulated data better match real cinematic audio, which we further investigate in detail.
△ Less
Submitted 18 April, 2024; v1 submitted 14 August, 2023;
originally announced August 2023.
-
The Sound Demixing Challenge 2023 $\unicode{x2013}$ Music Demixing Track
Authors:
Giorgio Fabbro,
Stefan Uhlich,
Chieh-Hsin Lai,
Woosung Choi,
Marco Martínez-Ramírez,
Weihsiang Liao,
Igor Gadelha,
Geraldo Ramos,
Eddie Hsu,
Hugo Rodrigues,
Fabian-Robert Stöter,
Alexandre Défossez,
Yi Luo,
Jianwei Yu,
Dipam Chakraborty,
Sharada Mohanty,
Roman Solovyev,
Alexander Stempkovskiy,
Tatiana Habruseva,
Nabarun Goswami,
Tatsuya Harada,
Minseok Kim,
Jun Hyung Lee,
Yuanliang Dong,
Xinran Zhang
, et al. (2 additional authors not shown)
Abstract:
This paper summarizes the music demixing (MDX) track of the Sound Demixing Challenge (SDX'23). We provide a summary of the challenge setup and introduce the task of robust music source separation (MSS), i.e., training MSS models in the presence of errors in the training data. We propose a formalization of the errors that can occur in the design of a training dataset for MSS systems and introduce t…
▽ More
This paper summarizes the music demixing (MDX) track of the Sound Demixing Challenge (SDX'23). We provide a summary of the challenge setup and introduce the task of robust music source separation (MSS), i.e., training MSS models in the presence of errors in the training data. We propose a formalization of the errors that can occur in the design of a training dataset for MSS systems and introduce two new datasets that simulate such errors: SDXDB23_LabelNoise and SDXDB23_Bleeding. We describe the methods that achieved the highest scores in the competition. Moreover, we present a direct comparison with the previous edition of the challenge (the Music Demixing Challenge 2021): the best performing system achieved an improvement of over 1.6dB in signal-to-distortion ratio over the winner of the previous competition, when evaluated on MDXDB21. Besides relying on the signal-to-distortion ratio as objective metric, we also performed a listening test with renowned producers and musicians to study the perceptual quality of the systems and report here the results. Finally, we provide our insights into the organization of the competition and our prospects for future editions.
△ Less
Submitted 19 April, 2024; v1 submitted 14 August, 2023;
originally announced August 2023.
-
Screening Mammography Breast Cancer Detection
Authors:
Debajyoti Chakraborty
Abstract:
Breast cancer is a leading cause of cancer-related deaths, but current programs are expensive and prone to false positives, leading to unnecessary follow-up and patient anxiety. This paper proposes a solution to automated breast cancer detection, to improve the efficiency and accuracy of screening programs. Different methodologies were tested against the RSNA dataset of radiographic breast images…
▽ More
Breast cancer is a leading cause of cancer-related deaths, but current programs are expensive and prone to false positives, leading to unnecessary follow-up and patient anxiety. This paper proposes a solution to automated breast cancer detection, to improve the efficiency and accuracy of screening programs. Different methodologies were tested against the RSNA dataset of radiographic breast images of roughly 20,000 female patients and yielded an average validation case pF1 score of 0.56 across methods.
△ Less
Submitted 20 July, 2023;
originally announced July 2023.
-
Asynchronous Distributed Consensus with Minimum Communication
Authors:
Vishal Sawant,
Debraj Chakraborty,
Debasattam Pal
Abstract:
In this paper, the communication effort required in a multi-agent system (MAS) is minimized via an explicit optimization formulation. The paper considers a MAS of single-integrator agents with bounded inputs and a time-invariant communication graph. A new model of discrete asynchronous communication and a distributed consensus protocol based on it, are proposed. The goal of the proposed protocol i…
▽ More
In this paper, the communication effort required in a multi-agent system (MAS) is minimized via an explicit optimization formulation. The paper considers a MAS of single-integrator agents with bounded inputs and a time-invariant communication graph. A new model of discrete asynchronous communication and a distributed consensus protocol based on it, are proposed. The goal of the proposed protocol is to minimize the aggregate number of communication instants of all agents, required to steer the state trajectories inside a pres-specified bounded neighbourhood within a pre-specified time. Due to information structure imposed by the underlying communication graph, an individual agent does not know the global parameters in the MAS, which are required for the above-mentioned minimization. To counter this uncertainty, the worst-case realizations of the global parameters are considered, which lead to min-max type optimizations. The control rules in the proposed protocol are obtained as the closed form solutions of these optimization problems. Hence, the proposed protocol does not increase the burden of run-time computation making it suitable for time-critical applications.
△ Less
Submitted 3 May, 2023;
originally announced May 2023.
-
Fast Randomized Subspace System Identification for Large I/O Data
Authors:
Vatsal Kedia,
Debraj Chakraborty
Abstract:
In this article, a novel fast randomized subspace system identification method for estimating combined deterministic-stochastic LTI state-space models, is proposed. The algorithm is especially well-suited to identify high-order and multi-scale systems with both fast and slow dynamics, which typically require a large number of input-output data samples for accurate identification using traditional…
▽ More
In this article, a novel fast randomized subspace system identification method for estimating combined deterministic-stochastic LTI state-space models, is proposed. The algorithm is especially well-suited to identify high-order and multi-scale systems with both fast and slow dynamics, which typically require a large number of input-output data samples for accurate identification using traditional subspace methods. Instead of working with such large matrices, the dataset is compressed using randomized methods, which preserve the range-spaces of these matrices almost surely. A novel identification algorithm using this compressed dataset, is proposed. This method enables the handling of extremely large datasets, which often make conventional algorithms like N4SID, MOESP, etc. run out of computer memory. Moreover the proposed method outperforms these algorithms in terms of memory-cost, data-movement, flop count and computation time for cases where these algorithms still work in-spite of large data sizes. The effectiveness of the proposed algorithm is established by theoretical analysis and various real and simulated case studies.
△ Less
Submitted 11 December, 2023; v1 submitted 2 March, 2023;
originally announced March 2023.
-
Computational Solar Energy -- Ensemble Learning Methods for Prediction of Solar Power Generation based on Meteorological Parameters in Eastern India
Authors:
Debojyoti Chakraborty,
Jayeeta Mondal,
Hrishav Bakul Barua,
Ankur Bhattacharjee
Abstract:
The challenges in applications of solar energy lies in its intermittency and dependency on meteorological parameters such as; solar radiation, ambient temperature, rainfall, wind-speed etc., and many other physical parameters like dust accumulation etc. Hence, it is important to estimate the amount of solar photovoltaic (PV) power generation for a specific geographical location. Machine learning (…
▽ More
The challenges in applications of solar energy lies in its intermittency and dependency on meteorological parameters such as; solar radiation, ambient temperature, rainfall, wind-speed etc., and many other physical parameters like dust accumulation etc. Hence, it is important to estimate the amount of solar photovoltaic (PV) power generation for a specific geographical location. Machine learning (ML) models have gained importance and are widely used for prediction of solar power plant performance. In this paper, the impact of weather parameters on solar PV power generation is estimated by several Ensemble ML (EML) models like Bagging, Boosting, Stacking, and Voting for the first time. The performance of chosen ML algorithms is validated by field dataset of a 10kWp solar PV power plant in Eastern India region. Furthermore, a complete test-bed framework has been designed for data mining as well as to select appropriate learning models. It also supports feature selection and reduction for dataset to reduce space and time complexity of the learning models. The results demonstrate greater prediction accuracy of around 96% for Stacking and Voting EML models. The proposed work is a generalized one and can be very useful for predicting the performance of large-scale solar PV power plants also.
△ Less
Submitted 21 January, 2023;
originally announced January 2023.
-
A time-optimal feedback control for a particular case of the game of two cars
Authors:
Aditya Chaudhari,
Debraj Chakraborty
Abstract:
In this paper, a time-optimal feedback solution to the game of two cars, for the case where the pursuer is faster and more agile than the evader, is presented. The concept of continuous subsets of the reachable set is introduced to characterize the time-optimal pursuit-evasion game under feedback strategies. Using these subsets it is shown that, if initially the pursuer is distant enough from the…
▽ More
In this paper, a time-optimal feedback solution to the game of two cars, for the case where the pursuer is faster and more agile than the evader, is presented. The concept of continuous subsets of the reachable set is introduced to characterize the time-optimal pursuit-evasion game under feedback strategies. Using these subsets it is shown that, if initially the pursuer is distant enough from the evader, then the feedback saddle point strategies for both the pursuer and the evader are coincident with one of the common tangents from the minimum radius turning circles of the pursuer to the minimum radius turning circles of the evader. Using geometry, four feasible tangents are identified and the feedback min-max strategy for the pursuer and the max-min strategy for the evader are derived by solving a $2 \times 2$ matrix game at each instant. Insignificant computational effort is involved in evaluating the pursuer and evader inputs using the proposed feedback control law and hence it is suitable for real-time implementation. The proposed law is validated further by comparing the resulting trajectories with those obtained by solving the differential game using numerical techniques.
△ Less
Submitted 30 May, 2021; v1 submitted 6 January, 2020;
originally announced January 2020.
-
Nonparallel Emotional Speech Conversion
Authors:
Jian Gao,
Deep Chakraborty,
Hamidou Tembine,
Olaitan Olaleye
Abstract:
We propose a nonparallel data-driven emotional speech conversion method. It enables the transfer of emotion-related characteristics of a speech signal while preserving the speaker's identity and linguistic content. Most existing approaches require parallel data and time alignment, which is not available in most real applications. We achieve nonparallel training based on an unsupervised style trans…
▽ More
We propose a nonparallel data-driven emotional speech conversion method. It enables the transfer of emotion-related characteristics of a speech signal while preserving the speaker's identity and linguistic content. Most existing approaches require parallel data and time alignment, which is not available in most real applications. We achieve nonparallel training based on an unsupervised style transfer technique, which learns a translation model between two distributions instead of a deterministic one-to-one mapping between paired examples. The conversion model consists of an encoder and a decoder for each emotion domain. We assume that the speech signal can be decomposed into an emotion-invariant content code and an emotion-related style code in latent space. Emotion conversion is performed by extracting and recombining the content code of the source speech and the style code of the target emotion. We tested our method on a nonparallel corpora with four emotions. Both subjective and objective evaluations show the effectiveness of our approach.
△ Less
Submitted 13 April, 2020; v1 submitted 3 November, 2018;
originally announced November 2018.
-
Stability and Equilibrium Analysis of Laneless Traffic with Local Control Laws
Authors:
Rakesh U. Chavan,
Debraj Chakraborty,
D. Manjunath
Abstract:
In this paper, a new model for traffic on roads with multiple lanes is developed, where the vehicles do not adhere to a lane discipline. Assuming identical vehicles, the dynamics is split along two independent directions: the Y-axis representing the direction of motion and the X-axis representing the lateral or the direction perpendicular to the direction of motion. Different influence graphs are…
▽ More
In this paper, a new model for traffic on roads with multiple lanes is developed, where the vehicles do not adhere to a lane discipline. Assuming identical vehicles, the dynamics is split along two independent directions: the Y-axis representing the direction of motion and the X-axis representing the lateral or the direction perpendicular to the direction of motion. Different influence graphs are used to model the interaction between the vehicles in these two directions. The instantaneous accelerations of each car, in both X and Y directions, are functions of the measurements from the neighbouring cars according to these influence graphs. The stability and equilibrium spacings of the car formation is analyzed for usual traffic situations such as steady flow, obstacles, lane changing and rogue drivers arbitrarily changing positions inside the formation. Conditions are derived under which the formation maintains stability and the desired intercar spacing for each of these traffic events. Simulations for some of these scenarios are included.
△ Less
Submitted 30 July, 2015;
originally announced July 2015.
-
Formation control with pole placement for multi-agent systems
Authors:
Ameer K. Mulla,
Rachel K. Kalaimani,
Debraj Chakraborty,
Madhu N. Belur
Abstract:
The problem of distributed controller synthesis for formation control of multi-agent systems is considered. The agents (single integrators) communicate over a communication graph and a decentralized linear feedback structure is assumed. One of the agents is designated as the leader. If the communication graph contains a directed spanning tree with the leader node as the root, then it is possible t…
▽ More
The problem of distributed controller synthesis for formation control of multi-agent systems is considered. The agents (single integrators) communicate over a communication graph and a decentralized linear feedback structure is assumed. One of the agents is designated as the leader. If the communication graph contains a directed spanning tree with the leader node as the root, then it is possible to place the poles of the ensemble system with purely local feedback controller gains. Given a desired formation, first one of the poles is placed at the origin. Then it is shown that the inter-agent weights can be independently adjusted to assign an eigenvector corresponding to the formation positions, to the zero eigenvalue. Then, only the leader input is enough to bring the agents to the desired formation and keep it there with no further inputs. Moreover, given a formation, the computation of the inter-agent weights that encode the formation information, can be calculated in a decentralized fashion using only local information.
△ Less
Submitted 16 May, 2013;
originally announced May 2013.