Skip to main content

Showing 1–13 of 13 results for author: Lakshminarayanan, C

.
  1. arXiv:2502.15017  [pdf, other

    cs.LG cs.CR

    Interpreting Adversarial Attacks and Defences using Architectures with Enhanced Interpretability

    Authors: Akshay G Rao, Chandrashekhar Lakshminarayanan, Arun Rajkumar

    Abstract: Adversarial attacks in deep learning represent a significant threat to the integrity and reliability of machine learning models. Adversarial training has been a popular defence technique against these adversarial attacks. In this work, we capitalize on a network architecture, namely Deep Linearly Gated Networks (DLGN), which has better interpretation capabilities than regular deep network architec… ▽ More

    Submitted 20 February, 2025; originally announced February 2025.

    Comments: Publication accepted at AAAI Deployable AI conference 2025 (proof - https://sites.google.com/view/dai-2025/accepted-papers?authuser=0) Total 17 pages

  2. arXiv:2411.13264  [pdf, other

    cs.LG

    Transformers with Sparse Attention for Granger Causality

    Authors: Riya Mahesh, Rahul Vashisht, Chandrashekar Lakshminarayanan

    Abstract: Temporal causal analysis means understanding the underlying causes behind observed variables over time. Deep learning based methods such as transformers are increasingly used to capture temporal dynamics and causal relationships beyond mere correlations. Recent works suggest self-attention weights of transformers as a useful indicator of causal links. We leverage this to propose a novel modificati… ▽ More

    Submitted 20 November, 2024; originally announced November 2024.

  3. arXiv:2404.04312  [pdf, other

    cs.LG cs.AI cs.NE

    Half-Space Feature Learning in Neural Networks

    Authors: Mahesh Lorik Yadav, Harish Guruprasad Ramaswamy, Chandrashekar Lakshminarayanan

    Abstract: There currently exist two extreme viewpoints for neural network feature learning -- (i) Neural networks simply implement a kernel method (a la NTK) and hence no features are learned (ii) Neural networks can represent (and hence learn) intricate hierarchical features suitable for the data. We argue in this paper neither interpretation is likely to be correct based on a novel viewpoint. Neural netwo… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  4. arXiv:2311.11789  [pdf, other

    cs.LG cs.MA math.OC

    Approximate Linear Programming for Decentralized Policy Iteration in Cooperative Multi-agent Markov Decision Processes

    Authors: Lakshmi Mandal, Chandrashekar Lakshminarayanan, Shalabh Bhatnagar

    Abstract: In this work, we consider a cooperative multi-agent Markov decision process (MDP) involving m agents. At each decision epoch, all the m agents independently select actions in order to maximize a common long-term objective. In the policy iteration process of multi-agent setup, the number of actions grows exponentially with the number of agents, incurring huge computational costs. Thus, recent works… ▽ More

    Submitted 29 April, 2024; v1 submitted 20 November, 2023; originally announced November 2023.

  5. arXiv:2203.16455  [pdf, other

    cs.LG

    Explicitising The Implicit Intrepretability of Deep Neural Networks Via Duality

    Authors: Chandrashekar Lakshminarayanan, Amit Vikram Singh, Arun Rajkumar

    Abstract: Recent work by Lakshminarayanan and Singh [2020] provided a dual view for fully connected deep neural networks (DNNs) with rectified linear units (ReLU). It was shown that (i) the information in the gates is analytically characterised by a kernel called the neural path kernel (NPK) and (ii) most critical information is learnt in the gates, in that, given the learnt gates, the weights can be retrai… ▽ More

    Submitted 28 February, 2022; originally announced March 2022.

    Comments: 18 pages, 9 Figures. arXiv admin note: text overlap with arXiv:2110.03403

  6. arXiv:2110.03403  [pdf, ps, other

    cs.LG

    Disentangling deep neural networks with rectified linear units using duality

    Authors: Chandrashekar Lakshminarayanan, Amit Vikram Singh

    Abstract: Despite their success deep neural networks (DNNs) are still largely considered as black boxes. The main issue is that the linear and non-linear operations are entangled in every layer, making it hard to interpret the hidden layer outputs. In this paper, we look at DNNs with rectified linear units (ReLUs), and focus on the gating property (`on/off' states) of the ReLUs. We extend the recently devel… ▽ More

    Submitted 6 October, 2021; originally announced October 2021.

  7. arXiv:2006.10529  [pdf, other

    cs.LG stat.ML

    Neural Path Features and Neural Path Kernel : Understanding the role of gates in deep learning

    Authors: Chandrashekar Lakshminarayanan, Amit Vikram Singh

    Abstract: Rectified linear unit (ReLU) activations can also be thought of as 'gates', which, either pass or stop their pre-activation input when they are 'on' (when the pre-activation input is positive) or 'off' (when the pre-activation input is negative) respectively. A deep neural network (DNN) with ReLU activations has many gates, and the on/off status of each gate changes across input examples as well a… ▽ More

    Submitted 12 June, 2021; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: Appeared in NeurIPS 2020 (23 pages)

  8. arXiv:2002.03996  [pdf, other

    cs.LG cs.AI stat.ML

    Deep Gated Networks: A framework to understand training and generalisation in deep learning

    Authors: Chandrashekar Lakshminarayanan, Amit Vikram Singh

    Abstract: Understanding the role of (stochastic) gradient descent (SGD) in the training and generalisation of deep neural networks (DNNs) with ReLU activation has been the object study in the recent past. In this paper, we make use of deep gated networks (DGNs) as a framework to obtain insights about DNNs with ReLU activation. In DGNs, a single neuronal unit has two components namely the pre-activation inpu… ▽ More

    Submitted 2 March, 2020; v1 submitted 10 February, 2020; originally announced February 2020.

    Comments: 18 Pages, submitted to ICML, added convnets

  9. arXiv:1709.04073  [pdf, ps, other

    cs.LG eess.SY stat.ML

    Linear Stochastic Approximation: Constant Step-Size and Iterate Averaging

    Authors: Chandrashekar Lakshminarayanan, Csaba Szepesvári

    Abstract: We consider $d$-dimensional linear stochastic approximation algorithms (LSAs) with a constant step-size and the so called Polyak-Ruppert (PR) averaging of iterates. LSAs are widely applied in machine learning and reinforcement learning (RL), where the aim is to compute an appropriate $θ_{*} \in \mathbb{R}^d$ (that is an optimum or a fixed point) using noisy data and $O(d)$ updates per iteration. I… ▽ More

    Submitted 12 September, 2017; originally announced September 2017.

    Comments: 16 pages, 2 figures, was submitted to NIPS 2017

  10. arXiv:1704.02544  [pdf, other

    eess.SY

    A Linearly Relaxed Approximate Linear Program for Markov Decision Processes

    Authors: Chandrashekar Lakshminarayanan, Shalabh Bhatnagar, Csaba Szepesvari

    Abstract: Approximate linear programming (ALP) and its variants have been widely applied to Markov Decision Processes (MDPs) with a large number of states. A serious limitation of ALP is that it has an intractable number of constraints, as a result of which constraint approximations are of interest. In this paper, we define a linearly relaxed approximation linear program (LRALP) that has a tractable number… ▽ More

    Submitted 8 April, 2017; originally announced April 2017.

    Comments: 23 pages, 2 figures, submitted to IEEE TAC

  11. arXiv:1409.3536  [pdf, ps, other

    eess.SY

    A Generalized Reduced Linear Program for Markov Decision Processes

    Authors: Chandrashekar Lakshminarayanan, Shalabh Bhatnagar

    Abstract: Markov decision processes (MDPs) with large number of states are of high practical interest. However, conventional algorithms to solve MDP are computationally infeasible in this scenario. Approximate dynamic programming (ADP) methods tackle this issue by computing approximate solutions. A widely applied ADP method is approximate linear program (ALP) which makes use of linear function approximation… ▽ More

    Submitted 18 November, 2014; v1 submitted 11 September, 2014; originally announced September 2014.

    Comments: 24 pages, submitted to AAAI on November 19 2014

  12. arXiv:1403.4179  [pdf, ps, other

    eess.SY math.OC

    Approximate dynamic programming with $(\min,+)$ linear function approximation for Markov decision processes

    Authors: Chandrashekar Lakshminarayanan, Shalabh Bhatnagar

    Abstract: Markov Decision Processes (MDP) is an useful framework to cast optimal sequential decision making problems. Given any MDP the aim is to find the optimal action selection mechanism i.e., the optimal policy. Typically, the optimal policy ($u^*$) is obtained by substituting the optimal value-function ($J^*$) in the Bellman equation. Alternately $u^*$ is also obtained by learning the optimal state-act… ▽ More

    Submitted 17 March, 2014; originally announced March 2014.

    Comments: 16 pages, 2 figures

  13. arXiv:1403.4175  [pdf, ps, other

    eess.SY math.OC

    Approximate Dynamic Programming based on Projection onto the (min,+) subsemimodule

    Authors: Chandrashekar Lakshminarayanan, Shalabh Bhatnagar

    Abstract: We develop a new Approximate Dynamic Programming (ADP) method for infinite horizon discounted reward Markov Decision Processes (MDP) based on projection onto a subsemimodule. We approximate the value function in terms of a $(\min,+)$ linear combination of a set of basis functions whose $(\min,+)$ linear span constitutes a subsemimodule. The projection operator is closely related to the Fenchel tra… ▽ More

    Submitted 17 March, 2014; originally announced March 2014.

    Comments: 20 pages, 6 figures (including tables), 1 algorithm, a convergence proof