Skip to main content

Showing 1–15 of 15 results for author: George, J

Searching in archive eess. Search in all archives.
.
  1. arXiv:2304.11460  [pdf, other

    eess.SY cs.LG eess.SP

    Reinforcement Learning with an Abrupt Model Change

    Authors: Wuxia Chen, Taposh Banerjee, Jemin George, Carl Busart

    Abstract: The problem of reinforcement learning is considered where the environment or the model undergoes a change. An algorithm is proposed that an agent can apply in such a problem to achieve the optimal long-time discounted reward. The algorithm is model-free and learns the optimal policy by interacting with the environment. It is shown that the proposed algorithm has strong optimality properties. The e… ▽ More

    Submitted 22 April, 2023; originally announced April 2023.

  2. arXiv:2211.01338  [pdf, other

    eess.AS cs.CL cs.MM cs.SD eess.IV

    Technology Pipeline for Large Scale Cross-Lingual Dubbing of Lecture Videos into Multiple Indian Languages

    Authors: Anusha Prakash, Arun Kumar, Ashish Seth, Bhagyashree Mukherjee, Ishika Gupta, Jom Kuriakose, Jordan Fernandes, K V Vikram, Mano Ranjith Kumar M, Metilda Sagaya Mary, Mohammad Wajahat, Mohana N, Mudit Batra, Navina K, Nihal John George, Nithya Ravi, Pruthwik Mishra, Sudhanshu Srivastava, Vasista Sai Lodagala, Vandan Mujadia, Kada Sai Venkata Vineeth, Vrunda Sukhadia, Dipti Sharma, Hema Murthy, Pushpak Bhattacharya , et al. (2 additional authors not shown)

    Abstract: Cross-lingual dubbing of lecture videos requires the transcription of the original audio, correction and removal of disfluencies, domain term discovery, text-to-text translation into the target language, chunking of text using target language rhythm, text-to-speech synthesis followed by isochronous lipsyncing to the original video. This task becomes challenging when the source and target languages… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

  3. arXiv:2201.04962  [pdf, other

    cs.MA cs.AI cs.LG eess.SY math.OC

    Distributed Cooperative Multi-Agent Reinforcement Learning with Directed Coordination Graph

    Authors: Gangshan Jing, He Bai, Jemin George, Aranya Chakrabortty, Piyush. K. Sharma

    Abstract: Existing distributed cooperative multi-agent reinforcement learning (MARL) frameworks usually assume undirected coordination graphs and communication graphs while estimating a global reward via consensus algorithms for policy evaluation. Such a framework may induce expensive communication costs and exhibit poor scalability due to requirement of global consensus. In this work, we study MARLs with d… ▽ More

    Submitted 9 January, 2022; originally announced January 2022.

  4. arXiv:2107.12416  [pdf, other

    eess.SY cs.AI cs.LG math.OC

    Asynchronous Distributed Reinforcement Learning for LQR Control via Zeroth-Order Block Coordinate Descent

    Authors: Gangshan Jing, He Bai, Jemin George, Aranya Chakrabortty, Piyush K. Sharma

    Abstract: Recently introduced distributed zeroth-order optimization (ZOO) algorithms have shown their utility in distributed reinforcement learning (RL). Unfortunately, in the gradient estimation process, almost all of them require random samples with the same dimension as the global variable and/or require evaluation of the global cost function, which may induce high estimation variance for large-scale net… ▽ More

    Submitted 2 May, 2024; v1 submitted 26 July, 2021; originally announced July 2021.

    Comments: The arxiv version contains proofs of Lemma 3 and Lemma 5, which are missing in the published version

  5. arXiv:2103.04480  [pdf, other

    eess.SY math.OC

    Learning Distributed Stabilizing Controllers for Multi-Agent Systems

    Authors: Gangshan Jing, He Bai, Jemin George, Aranya Chakrabortty, Piyush K. Sharma

    Abstract: We address the problem of model-free distributed stabilization of heterogeneous multi-agent systems using reinforcement learning (RL). Two algorithms are developed. The first algorithm solves a centralized linear quadratic regulator (LQR) problem without knowing any initial stabilizing gain in advance. The second algorithm builds upon the results of the first algorithm, and extends it to distribut… ▽ More

    Submitted 7 March, 2021; originally announced March 2021.

    Comments: This paper propose model-free RL algorithms for deriving stabilizing gains of continuous-time multi-agent systems

  6. arXiv:2010.08615  [pdf, other

    eess.SY cs.AI math.OC

    Decomposability and Parallel Computation of Multi-Agent LQR

    Authors: Gangshan Jing, He Bai, Jemin George, Aranya Chakrabortty

    Abstract: Individual agents in a multi-agent system (MAS) may have decoupled open-loop dynamics, but a cooperative control objective usually results in coupled closed-loop dynamics thereby making the control design computationally expensive. The computation time becomes even higher when a learning strategy such as reinforcement learning (RL) needs to be applied to deal with the situation when the agents dyn… ▽ More

    Submitted 7 March, 2021; v1 submitted 16 October, 2020; originally announced October 2020.

    Comments: This paper contains proofs of all the theorems in the conference paper "Decomposability and Parallel Computation of Multi-Agent LQR"

  7. arXiv:2008.06604  [pdf, other

    eess.SY cs.MA math.OC

    Model-Free Optimal Control of Linear Multi-Agent Systems via Decomposition and Hierarchical Approximation

    Authors: Gangshan Jing, He Bai, Jemin George, Aranya Chakrabortty

    Abstract: Designing the optimal linear quadratic regulator (LQR) for a large-scale multi-agent system (MAS) is time-consuming since it involves solving a large-size matrix Riccati equation. The situation is further exasperated when the design needs to be done in a model-free way using schemes such as reinforcement learning (RL). To reduce this computational complexity, we decompose the large-scale LQR desig… ▽ More

    Submitted 16 March, 2021; v1 submitted 14 August, 2020; originally announced August 2020.

    Comments: This paper proposes a hierarchical learning and control framework for model-free LQR of heterogeneous linear multi-agent systems

  8. arXiv:2008.05853  [pdf

    eess.IV physics.optics

    Massively Parallel Amplitude-Only Fourier Neural Network

    Authors: Mario Miscuglio, Zibo Hu, Shurui Li, Jonathan George, Roberto Capanna, Philippe M. Bardet, Puneet Gupta, Volker J. Sorger

    Abstract: Machine-intelligence has become a driving factor in modern society. However, its demand outpaces the underlying electronic technology due to limitations given by fundamental physics such as capacitive charging of wires, but also by system architecture of storing and handling data, both driving recent trends towards processor heterogeneity. Here we introduce a novel amplitude-only Fourier-optical p… ▽ More

    Submitted 15 August, 2020; v1 submitted 13 August, 2020; originally announced August 2020.

  9. arXiv:2007.14186  [pdf, ps, other

    eess.SY math.DS

    Hierarchical Control of Multi-Agent Systems using Online Reinforcement Learning

    Authors: He Bai, Jemin George, Aranya Chakrabortty

    Abstract: We propose a new reinforcement learning based approach to designing hierarchical linear quadratic regulator (LQR) controllers for heterogeneous linear multi-agent systems with unknown state-space models and separated control objectives. The separation arises from grouping the agents into multiple non-overlapping groups, and defining the control goal as two distinct objectives. The first objective… ▽ More

    Submitted 28 July, 2020; originally announced July 2020.

  10. arXiv:2006.08925  [pdf, other

    cs.NI eess.SP

    Improving the Performance of Deep Learning for Wireless Localization

    Authors: Ramdoot Pydipaty, Johnu George, Krishna Selvaraju, Amit Saha

    Abstract: Indoor localization systems are most commonly based on Received Signal Strength Indicator (RSSI) measurements of either WiFi or Bluetooth-Low-Energy (BLE) beacons. In such systems, the two most common techniques are trilateration and fingerprinting, with the latter providing higher accuracy. In the fingerprinting technique, Deep Learning (DL) algorithms are often used to predict the location of th… ▽ More

    Submitted 16 June, 2020; originally announced June 2020.

  11. arXiv:1911.02511  [pdf

    eess.SP physics.optics

    Electronic Bottleneck Suppression in Next-generation Networks with Integrated Photonic Digital-to-analog Converters

    Authors: Jiawei Meng, Mario Miscuglio, Jonathan K. George, Aydin Babakhani, Volker J. Sorger

    Abstract: Digital-to-analog converters (DAC) are indispensable functional units in signal processing instrumentation and wide-band telecommunication links for both civil and military applications. Since photonic systems are capable of high data throughput and low latency, an increasingly found system limitation stems from the required domain-crossing such as digital-to-analog, and electronic-to-optical. A p… ▽ More

    Submitted 22 December, 2019; v1 submitted 3 November, 2019; originally announced November 2019.

    Journal ref: Advanced Photonics Research 2020, 2000033

  12. arXiv:1909.10556  [pdf, ps, other

    eess.SP eess.SY

    Multi-Agent Coordination for Distributed Transmit Beamforming

    Authors: Jemin George, Anjaly Parayil, He Bai

    Abstract: This paper presents the formulation and analysis of a two time-scale optimization algorithm for multi-agent coordination for the purpose of distributed beamforming. Each agent is assumed to be randomly positioned with respect to each other with random phase offsets and amplitudes. Agents are tasked with coordinate among themselves to position themselves and adjust their phase offset and amplitude… ▽ More

    Submitted 23 September, 2019; originally announced September 2019.

  13. arXiv:1908.06693  [pdf, ps, other

    math.OC eess.SY

    Distributed Stochastic Gradient Method for Non-Convex Problems with Applications in Supervised Learning

    Authors: Jemin George, Tao Yang, He Bai, Prudhvi Gurram

    Abstract: We develop a distributed stochastic gradient descent algorithm for solving non-convex optimization problems under the assumption that the local objective functions are twice continuously differentiable with Lipschitz continuous gradients and Hessians. We provide sufficient conditions on step-sizes that guarantee the asymptotic mean-square convergence of the proposed algorithm. We apply the develop… ▽ More

    Submitted 19 August, 2019; originally announced August 2019.

  14. arXiv:1805.08633  [pdf, ps, other

    eess.SP math.NA

    The right way to teach the FFT

    Authors: Jithin Donny George

    Abstract: The algorithm behind the Fast Fourier Transform has a simple yet beautiful geometric interpretation that is often lost in translation in a classroom. This article provides a visual perspective which aims to capture the essence of it.

    Submitted 22 June, 2018; v1 submitted 21 May, 2018; originally announced May 2018.

    Comments: Corrected typos and added more details

  15. arXiv:1711.02500  [pdf

    eess.SP physics.optics

    Integrated All-Optical Fast Fourier Transform: Design and Sensitivity Analysis

    Authors: Hani Nejadriahi, David HillerKuss, Jonathan K. George, Volker J. Sorger

    Abstract: The fast Fourier transform, FFT, is a useful and prevalent algorithm in signal processing. It characterizes the spectral components of a signal, or is used in combination with other operations to perform more complex computations such as filtering, convolution, and correlation. Digital FFTs are limited in speed by the necessity of moving charge within logic gates. An analog temporal FFT in fiber o… ▽ More

    Submitted 31 October, 2017; originally announced November 2017.