Skip to main content

Showing 1–6 of 6 results for author: Squillante, M S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.14753  [pdf, other

    cs.LG stat.ME

    A General Control-Theoretic Approach for Reinforcement Learning: Theory and Algorithms

    Authors: Weiqin Chen, Mark S. Squillante, Chai Wah Wu, Santiago Paternain

    Abstract: We devise a control-theoretic reinforcement learning approach to support direct learning of the optimal policy. We establish various theoretical properties of our approach, such as convergence and optimality of our analog of the Bellman operator and Q-learning, a new control-policy-variable gradient theorem, and a specific gradient ascent algorithm based on this theorem within the context of a spe… ▽ More

    Submitted 27 November, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

  2. arXiv:2311.01994  [pdf, other

    stat.ML cs.AI cs.LG math.OC

    Obtaining Explainable Classification Models using Distributionally Robust Optimization

    Authors: Sanjeeb Dash, Soumyadip Ghosh, Joao Goncalves, Mark S. Squillante

    Abstract: Model explainability is crucial for human users to be able to interpret how a proposed classifier assigns labels to data based on its feature values. We study generalized linear models constructed using sets of feature value rules, which can capture nonlinear dependencies and interactions. An inherent trade-off exists between rule set sparsity and its prediction accuracy. It is computationally exp… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

  3. arXiv:2202.11685  [pdf, other

    cs.LG stat.ME stat.ML

    A Class of Geometric Structures in Transfer Learning: Minimax Bounds and Optimality

    Authors: Xuhui Zhang, Jose Blanchet, Soumyadip Ghosh, Mark S. Squillante

    Abstract: We study the problem of transfer learning, observing that previous efforts to understand its information-theoretic limits do not fully exploit the geometric structure of the source and target domains. In contrast, our study first illustrates the benefits of incorporating a natural geometric structure within a linear regression model, which corresponds to the generalized eigenvalue problem formed b… ▽ More

    Submitted 23 February, 2022; originally announced February 2022.

    Comments: AISTATS 2022

  4. arXiv:1905.12009  [pdf, other

    cs.LG math.OC stat.ML

    A General Markov Decision Process Framework for Directly Learning Optimal Control Policies

    Authors: Yingdong Lu, Mark S. Squillante, Chai Wah Wu

    Abstract: We consider a new form of reinforcement learning (RL) that is based on opportunities to directly learn the optimal control policy and a general Markov decision process (MDP) framework devised to support these opportunities. Derivations of general classes of our control-based RL methods are presented, together with forms of exploration and exploitation in learning and applying the optimal control p… ▽ More

    Submitted 31 March, 2021; v1 submitted 28 May, 2019; originally announced May 2019.

  5. arXiv:1812.08329  [pdf, other

    cs.LG cs.CR stat.ML

    PROVEN: Certifying Robustness of Neural Networks with a Probabilistic Approach

    Authors: Tsui-Wei Weng, Pin-Yu Chen, Lam M. Nguyen, Mark S. Squillante, Ivan Oseledets, Luca Daniel

    Abstract: With deep neural networks providing state-of-the-art machine learning models for numerous machine learning tasks, quantifying the robustness of these models has become an important area of research. However, most of the research literature merely focuses on the \textit{worst-case} setting where the input of the neural network is perturbed with noises that are constrained within an $\ell_p$ ball; a… ▽ More

    Submitted 7 January, 2019; v1 submitted 18 December, 2018; originally announced December 2018.

    Comments: updated ref [25]

  6. arXiv:1805.08122  [pdf, other

    stat.ML cs.LG

    A General Family of Robust Stochastic Operators for Reinforcement Learning

    Authors: Yingdong Lu, Mark S. Squillante, Chai Wah Wu

    Abstract: We consider a new family of operators for reinforcement learning with the goal of alleviating the negative effects and becoming more robust to approximation or estimation errors. Various theoretical results are established, which include showing on a sample path basis that our family of operators preserve optimality and increase the action gap. Our empirical results illustrate the strong benefits… ▽ More

    Submitted 28 May, 2019; v1 submitted 21 May, 2018; originally announced May 2018.

    Comments: 12 pages