Skip to main content

Showing 1–28 of 28 results for author: Baranwal, M

.
  1. arXiv:2502.16079  [pdf, other

    cs.RO cs.AI cs.LG cs.MA eess.SY

    Together We Rise: Optimizing Real-Time Multi-Robot Task Allocation using Coordinated Heterogeneous Plays

    Authors: Aritra Pal, Anandsingh Chauhan, Mayank Baranwal

    Abstract: Efficient task allocation among multiple robots is crucial for optimizing productivity in modern warehouses, particularly in response to the increasing demands of online order fulfillment. This paper addresses the real-time multi-robot task allocation (MRTA) problem in dynamic warehouse environments, where tasks emerge with specified start and end locations. The objective is to minimize both the t… ▽ More

    Submitted 21 February, 2025; originally announced February 2025.

    Comments: Accepted to AAMAS 2025 (AAAI Track)

  2. arXiv:2502.04864  [pdf, other

    cs.MA cs.AI cs.LG cs.RO

    $TAR^2$: Temporal-Agent Reward Redistribution for Optimal Policy Preservation in Multi-Agent Reinforcement Learning

    Authors: Aditya Kapoor, Kale-ab Tessera, Mayank Baranwal, Harshad Khadilkar, Stefano Albrecht, Mingfei Sun

    Abstract: In cooperative multi-agent reinforcement learning (MARL), learning effective policies is challenging when global rewards are sparse and delayed. This difficulty arises from the need to assign credit across both agents and time steps, a problem that existing methods often fail to address in episodic, long-horizon tasks. We propose Temporal-Agent Reward Redistribution $TAR^2$, a novel approach that… ▽ More

    Submitted 7 February, 2025; originally announced February 2025.

    Comments: 23 pages, 5 figures, 4 tables

  3. arXiv:2412.14779  [pdf, other

    cs.MA cs.AI cs.GT cs.LG cs.RO

    Agent-Temporal Credit Assignment for Optimal Policy Preservation in Sparse Multi-Agent Reinforcement Learning

    Authors: Aditya Kapoor, Sushant Swamy, Kale-ab Tessera, Mayank Baranwal, Mingfei Sun, Harshad Khadilkar, Stefano V. Albrecht

    Abstract: In multi-agent environments, agents often struggle to learn optimal policies due to sparse or delayed global rewards, particularly in long-horizon tasks where it is challenging to evaluate actions at intermediate time steps. We introduce Temporal-Agent Reward Redistribution (TAR$^2$), a novel approach designed to address the agent-temporal credit assignment problem by redistributing sparse rewards… ▽ More

    Submitted 19 December, 2024; originally announced December 2024.

    Comments: 12 pages, 1 figure

  4. arXiv:2409.19279  [pdf, other

    math.OC cs.AI cs.LG eess.SY math.DS

    Distributed Optimization via Energy Conservation Laws in Dilated Coordinates

    Authors: Mayank Baranwal, Kushal Chakrabarti

    Abstract: Optimizing problems in a distributed manner is critical for systems involving multiple agents with private data. Despite substantial interest, a unified method for analyzing the convergence rates of distributed optimization algorithms is lacking. This paper introduces an energy conservation approach for analyzing continuous-time dynamical systems in dilated coordinates. Instead of directly analyzi… ▽ More

    Submitted 28 September, 2024; originally announced September 2024.

    Comments: 10 pages; (Near) optimal convergence rate

  5. arXiv:2407.12629  [pdf, ps, other

    cs.LG cs.AI math.OC stat.ML

    A Methodology Establishing Linear Convergence of Adaptive Gradient Methods under PL Inequality

    Authors: Kushal Chakrabarti, Mayank Baranwal

    Abstract: Adaptive gradient-descent optimizers are the standard choice for training neural network models. Despite their faster convergence than gradient-descent and remarkable performance in practice, the adaptive optimizers are not as well understood as vanilla gradient-descent. A reason is that the dynamic update of the learning rate that helps in faster convergence of these methods also makes their anal… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

    Comments: Accepted for publication at the main track of 27th European Conference on Artificial Intelligence (ECAI-2024)

  6. arXiv:2407.10090  [pdf, other

    physics.chem-ph cs.AI cs.LG

    ReactAIvate: A Deep Learning Approach to Predicting Reaction Mechanisms and Unmasking Reactivity Hotspots

    Authors: Ajnabiul Hoque, Manajit Das, Mayank Baranwal, Raghavan B. Sunoj

    Abstract: A chemical reaction mechanism (CRM) is a sequence of molecular-level events involving bond-breaking/forming processes, generating transient intermediates along the reaction pathway as reactants transform into products. Understanding such mechanisms is crucial for designing and discovering new reactions. One of the currently available methods to probe CRMs is quantum mechanical (QM) computations. T… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: Accepted to 27th ECAI main track

  7. arXiv:2310.00419  [pdf, other

    math.OC cs.LG eess.SY stat.ML

    On Linear Convergence of PI Consensus Algorithm under the Restricted Secant Inequality

    Authors: Kushal Chakrabarti, Mayank Baranwal

    Abstract: This paper considers solving distributed optimization problems in peer-to-peer multi-agent networks. The network is synchronous and connected. By using the proportional-integral (PI) control strategy, various algorithms with fixed stepsize have been developed. Two notable among them are the PI algorithm and the PI consensus algorithm. Although the PI algorithm has provable linear or exponential co… ▽ More

    Submitted 28 October, 2024; v1 submitted 30 September, 2023; originally announced October 2023.

    Comments: Accepted for publication at the 2024 Tenth Indian Control Conference (ICC-10)

  8. arXiv:2212.03765  [pdf, other

    cs.LG cs.AI eess.SY math.OC stat.ML

    Generalized Gradient Flows with Provable Fixed-Time Convergence and Fast Evasion of Non-Degenerate Saddle Points

    Authors: Mayank Baranwal, Param Budhraja, Vishal Raj, Ashish R. Hota

    Abstract: Gradient-based first-order convex optimization algorithms find widespread applicability in a variety of domains, including machine learning tasks. Motivated by the recent advances in fixed-time stability theory of continuous-time dynamical systems, we introduce a generalized framework for designing accelerated optimization algorithms with strongest convergence guarantees that further extend to a s… ▽ More

    Submitted 22 October, 2023; v1 submitted 7 December, 2022; originally announced December 2022.

    Comments: Accepted to Transactions on Automatic Control (TAC)

  9. arXiv:2212.02397  [pdf, other

    cs.LG cs.AI eess.SY stat.ML

    PowRL: A Reinforcement Learning Framework for Robust Management of Power Networks

    Authors: Anandsingh Chauhan, Mayank Baranwal, Ansuma Basumatary

    Abstract: Power grids, across the world, play an important societal and economical role by providing uninterrupted, reliable and transient-free power to several industries, businesses and household consumers. With the advent of renewable power resources and EVs resulting into uncertain generation and highly dynamic load demands, it has become ever so important to ensure robust operation of power networks th… ▽ More

    Submitted 20 April, 2023; v1 submitted 5 December, 2022; originally announced December 2022.

    Comments: Accepted at the 37th AAAI Conference on Artificial Intelligence

  10. arXiv:2207.12845  [pdf, other

    math.OC cs.AI cs.LG eess.SY

    Fixed-Time Convergence for a Class of Nonconvex-Nonconcave Min-Max Problems

    Authors: Kunal Garg, Mayank Baranwal

    Abstract: This study develops a fixed-time convergent saddle point dynamical system for solving min-max problems under a relaxation of standard convexity-concavity assumption. In particular, it is shown that by leveraging the dynamical systems viewpoint of an optimization algorithm, accelerated convergence to a saddle point can be obtained. Instead of requiring the objective function to be strongly-convex--… ▽ More

    Submitted 26 July, 2022; originally announced July 2022.

    Comments: 6 pages, 2 figures

  11. arXiv:2203.00885  [pdf, other

    cs.LG cs.AI math.OC

    A Learning Based Framework for Handling Uncertain Lead Times in Multi-Product Inventory Management

    Authors: Hardik Meisheri, Somjit Nath, Mayank Baranwal, Harshad Khadilkar

    Abstract: Most existing literature on supply chain and inventory management consider stochastic demand processes with zero or constant lead times. While it is true that in certain niche scenarios, uncertainty in lead times can be ignored, most real-world scenarios exhibit stochasticity in lead times. These random fluctuations can be caused due to uncertainty in arrival of raw materials at the manufacturer's… ▽ More

    Submitted 8 March, 2022; v1 submitted 2 March, 2022; originally announced March 2022.

  12. arXiv:2112.01363  [pdf, other

    math.OC cs.AI cs.LG eess.SY stat.ML

    Breaking the Convergence Barrier: Optimization via Fixed-Time Convergent Flows

    Authors: Param Budhraja, Mayank Baranwal, Kunal Garg, Ashish Hota

    Abstract: Accelerated gradient methods are the cornerstones of large-scale, data-driven optimization problems that arise naturally in machine learning and other fields concerning data analysis. We introduce a gradient-based optimization framework for achieving acceleration, based on the recently introduced notion of fixed-time stability of dynamical systems. The method presents itself as a generalization of… ▽ More

    Submitted 20 March, 2022; v1 submitted 2 December, 2021; originally announced December 2021.

    Comments: Accepted at AAAI Conference on Artificial Intelligence, 2022

  13. arXiv:2108.07555  [pdf, other

    cs.LG cs.AI eess.SY math.OC

    Revisiting State Augmentation methods for Reinforcement Learning with Stochastic Delays

    Authors: Somjit Nath, Mayank Baranwal, Harshad Khadilkar

    Abstract: Several real-world scenarios, such as remote control and sensing, are comprised of action and observation delays. The presence of delays degrades the performance of reinforcement learning (RL) algorithms, often to such an extent that algorithms fail to learn anything substantial. This paper formally describes the notion of Markov Decision Processes (MDPs) with stochastic delays and shows that dela… ▽ More

    Submitted 17 August, 2021; originally announced August 2021.

    Comments: Accepted at CIKM'21

  14. arXiv:2011.00053  [pdf, other

    physics.chem-ph math.OC

    On sparse identification of complex dynamical systems: A study on discovering influential reactions in chemical reaction networks

    Authors: Farshad Harirchi, Doohyun Kim, Omar Khalil, Sijia Liu, Paolo Elvati, Mayank Baranwal, Alfred Hero, Angela Violi

    Abstract: A wide variety of real life complex networks are prohibitively large for modeling, analysis and control. Understanding the structure and dynamics of such networks entails creating a smaller representative network that preserves its relevant topological and dynamical properties. While modern machine learning methods have enabled identification of governing laws for complex dynamical systems, their… ▽ More

    Submitted 8 July, 2020; originally announced November 2020.

    Journal ref: Fuel, Volume 279, 2020, 118204, ISSN 0016-2361

  15. CAPPA: Continuous-time Accelerated Proximal Point Algorithm for Sparse Recovery

    Authors: Kunal Garg, Mayank Baranwal

    Abstract: This paper develops a novel Continuous-time Accelerated Proximal Point Algorithm (CAPPA) for $\ell_1$-minimization problems with provable fixed-time convergence guarantees. The problem of $\ell_1$-minimization appears in several contexts, such as sparse recovery (SR) in Compressed Sensing (CS) theory, and sparse linear and logistic regressions in machine learning to name a few. Most existing algor… ▽ More

    Submitted 3 June, 2020; originally announced June 2020.

    Comments: 6 pages, 5 figures

  16. arXiv:2002.05678  [pdf, ps, other

    stat.ML cs.IT cs.LG math.PR

    The Power of Graph Convolutional Networks to Distinguish Random Graph Models: Short Version

    Authors: Abram Magner, Mayank Baranwal, Alfred O. Hero III

    Abstract: Graph convolutional networks (GCNs) are a widely used method for graph representation learning. We investigate the power of GCNs, as a function of their number of layers, to distinguish between different random graph models on the basis of the embeddings of their sample graphs. In particular, the graph models that we consider arise from graphons, which are the most general possible parameterizatio… ▽ More

    Submitted 13 February, 2020; originally announced February 2020.

    Comments: Conference version of arXiv:1910.12954

  17. Robust Distributed Fixed-Time Economic Dispatch under Time-Varying Topology

    Authors: Mayank Baranwal, Kunal Garg, Dimitra Panagou, Alfred O. Hero

    Abstract: The centralized power generation infrastructure that defines the North American electric grid is slowly moving to the distributed architecture due to the explosion in use of renewable generation and distributed energy resources (DERs), such as residential solar, wind turbines and battery storage. Furthermore, variable pricing policies and profusion of flexible loads entail frequent and severe chan… ▽ More

    Submitted 26 August, 2020; v1 submitted 30 October, 2019; originally announced October 2019.

    Comments: 6 pages, 3 figures, to appear in L-CSS

    Journal ref: IEEE Control Systems Letters, vol. 5, no. 4, pp. 1183-1188, Oct. 2021

  18. arXiv:1910.12954  [pdf, other

    stat.ML cs.IT cs.LG math.PR

    Fundamental Limits of Deep Graph Convolutional Networks

    Authors: Abram Magner, Mayank Baranwal, Alfred O. Hero III

    Abstract: Graph convolutional networks (GCNs) are a widely used method for graph representation learning. To elucidate the capabilities and limitations of GCNs, we investigate their power, as a function of their number of layers, to distinguish between different random graph models (corresponding to different class-conditional distributions in a classification problem) on the basis of the embeddings of thei… ▽ More

    Submitted 12 May, 2020; v1 submitted 28 October, 2019; originally announced October 2019.

    Comments: 19 pages

  19. Fixed-Time Stable Proximal Dynamical System for Solving MVIPs

    Authors: Kunal Garg, Mayank Baranwal, Rohit Gupta, Mouhacine Benosman

    Abstract: In this paper, a novel modified proximal dynamical system is proposed to compute the solution of a mixed variational inequality problem (MVIP) within a fixed time, where the time of convergence is finite and is uniformly bounded for all initial conditions. Under the assumptions of strong monotonicity and Lipschitz continuity, it is shown that a solution of the modified proximal dynamical system ex… ▽ More

    Submitted 19 October, 2022; v1 submitted 9 August, 2019; originally announced August 2019.

    Comments: 12 pages, 2 figures

  20. arXiv:1907.08720  [pdf, other

    math.OC eess.SY

    Multiway k-Cut in Static and Dynamic Graphs: A Maximum Entropy Principle Approach

    Authors: Mayank Baranwal, Amber Srivastava, Srinivasa Salapaka

    Abstract: This work presents a maximum entropy principle based algorithm for solving minimum multiway $k$-cut problem defined over static and dynamic {\em digraphs}. A multiway $k$-cut problem requires partitioning the set of nodes in a graph into $k$ subsets, such that each subset contains one prespecified node, and the corresponding total cut weight is minimized. These problems arise in many applications… ▽ More

    Submitted 19 July, 2019; originally announced July 2019.

    Comments: 8 pages, 7 figures

  21. arXiv:1905.10472  [pdf, other

    eess.SY math.OC

    Accelerating Distributed Optimization via Fixed-time Convergent Flows: Extensions to Non-convex Functions and Consistent Discretization

    Authors: Kunal Garg, Mayank Baranwal

    Abstract: Distributed optimization has gained significant attention in recent years, primarily fueled by the availability of a large amount of data and privacy-preserving requirements. This paper presents a fixed-time convergent optimization algorithm for solving a potentially non-convex optimization problem using a first-order multi-agent system. Each agent in the network can access only its private object… ▽ More

    Submitted 27 May, 2022; v1 submitted 24 May, 2019; originally announced May 2019.

    Comments: Under review. 10 pages, 1 figure

  22. arXiv:1811.00102  [pdf, other

    cs.LG cs.AI stat.ML

    On the Persistence of Clustering Solutions and True Number of Clusters in a Dataset

    Authors: Amber Srivastava, Mayank Baranwal, Srinivasa Salapaka

    Abstract: Typically clustering algorithms provide clustering solutions with prespecified number of clusters. The lack of a priori knowledge on the true number of underlying clusters in the dataset makes it important to have a metric to compare the clustering solutions with different number of clusters. This article quantifies a notion of persistence of clustering solutions that enables comparing solutions w… ▽ More

    Submitted 16 November, 2018; v1 submitted 31 October, 2018; originally announced November 2018.

  23. arXiv:1701.03065  [pdf, other

    math.OC eess.SY

    Robust Distributed Control of DC Microgrids with Time-Varying Power Sharing

    Authors: Mayank Baranwal, Alireza Askarian, Srinivasa M. Salapaka

    Abstract: This paper addresses the problem of output voltage regulation for multiple DC/DC converters connected to a microgrid, and prescribes a scheme for sharing power among different sources. This architecture is structured in such a way that it admits quantifiable analysis of the closed-loop performance of the network of converters; the analysis simplifies to studying closed-loop performance of an equiv… ▽ More

    Submitted 11 January, 2017; originally announced January 2017.

    Comments: arXiv admin note: substantial text overlap with arXiv:1604.04154

  24. arXiv:1606.06427  [pdf, other

    math.OC

    Clustering with Capacity and Size Constraints: A Deterministic Approach

    Authors: Mayank Baranwal, Srinivasa M. Salapaka

    Abstract: This paper discusses a deterministic clustering approach to capacitated resource allocation problems. In particular, the Deterministic Annealing (DA) algorithm from the data-compression literature, which bears a distinct analogy to the phase transformation under annealing process in statistical physics, is adapted to address problems pertaining to clustering with several forms of size constraints.… ▽ More

    Submitted 21 June, 2016; originally announced June 2016.

    Comments: 6 pages, 5 figures

  25. arXiv:1604.04169  [pdf, other

    math.OC cs.AI

    A Deterministic Annealing Approach to the Multiple Traveling Salesmen and Related Problems

    Authors: Mayank Baranwal, Brian Roehl, Srinivasa M. Salapaka

    Abstract: This paper presents a novel and efficient heuristic framework for approximating the solutions to the multiple traveling salesmen problem (m-TSP) and other variants on the TSP. The approach adopted in this paper is an extension of the Maximum-Entropy-Principle (MEP) and the Deterministic Annealing (DA) algorithm. The framework is presented as a general tool that can be suitably adapted to a number… ▽ More

    Submitted 14 April, 2016; originally announced April 2016.

  26. arXiv:1604.04154  [pdf, other

    math.OC eess.SY

    Robust Control Framework for Time-Varying Power-Sharing among Distributed Energy Resources

    Authors: Mayank Baranwal, Srinivasa M. Salapaka

    Abstract: One of the most important challenges facing an electric grid is to incorporate renewables and distributed energy resources (DERs) to the grid. Because of the associated uncertainties in power generations and peak power demands, opportunities for improving the functioning and reliability of the grid lie in the design of an efficient, yet pragmatic distributed control framework with guaranteed robus… ▽ More

    Submitted 14 April, 2016; originally announced April 2016.

    Comments: arXiv admin note: text overlap with arXiv:1604.03573

  27. arXiv:1604.03590  [pdf, other

    math.OC

    Vehicle Routing Problem with Time Windows: A Deterministic Annealing approach

    Authors: Mayank Baranwal, Pratik M. Parekh, Lavanya Marla, Srinivasa M. Salapaka, Carolyn L. Beck

    Abstract: The Vehicle Routing Problem with Time-Windows (VRPTW) is an important problem in allocating resources on networks in time and space. We present in this paper a Deterministic Annealing (DA)-based approach to solving the VRPTW with its aspects of routing and scheduling, as well as to model additional constraints of heterogeneous vehicles and shipments. This is the first time, to our knowledge, that… ▽ More

    Submitted 12 April, 2016; originally announced April 2016.

  28. arXiv:1604.03573  [pdf, other

    math.OC eess.SY

    Robust Decentralized Voltage Control of DC-DC Converters with Applications to Power Sharing and Ripple Sharing

    Authors: Mayank Baranwal, Srinivasa M. Salapaka, Murti V. Salapaka

    Abstract: This paper addresses the problem of output voltage regulation for multiple DC-DC converters connected to a grid, and prescribes a robust scheme for sharing power among different sources. Also it develops a method for sharing 120 Hz ripple among DC power sources in a prescribed proportion, which accommodates the different capabilities of DC power sources to sustain the ripple. We present a decentra… ▽ More

    Submitted 12 April, 2016; originally announced April 2016.