Skip to main content

Showing 1–6 of 6 results for author: Ram, P

Searching in archive stat. Search in all archives.
.
  1. arXiv:2410.15483  [pdf, other

    cs.LG cs.AI cs.CL math.OC stat.ML

    Mitigating Forgetting in LLM Supervised Fine-Tuning and Preference Learning

    Authors: Heshan Fernando, Han Shen, Parikshit Ram, Yi Zhou, Horst Samulowitz, Nathalie Baracaldo, Tianyi Chen

    Abstract: Post-training of pre-trained LLMs, which typically consists of the supervised fine-tuning (SFT) stage and the preference learning (RLHF or DPO) stage, is crucial to effective and safe LLM applications. The widely adopted approach in post-training popular open-source LLMs is to sequentially perform SFT and RLHF/DPO. However, sequential training is sub-optimal in terms of SFT and RLHF/DPO trade-off:… ▽ More

    Submitted 5 February, 2025; v1 submitted 20 October, 2024; originally announced October 2024.

  2. arXiv:2011.12581  [pdf, other

    cs.LG stat.ML

    Overcoming Catastrophic Forgetting via Direction-Constrained Optimization

    Authors: Yunfei Teng, Anna Choromanska, Murray Campbell, Songtao Lu, Parikshit Ram, Lior Horesh

    Abstract: This paper studies a new design of the optimization algorithm for training deep learning models with a fixed architecture of the classification network in a continual learning framework. The training data is non-stationary and the non-stationarity is imposed by a sequence of distinct tasks. We first analyze a deep model trained on only one learning task in isolation and identify a region in networ… ▽ More

    Submitted 1 July, 2022; v1 submitted 25 November, 2020; originally announced November 2020.

  3. arXiv:2009.13714  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Learning to Generate Image Source-Agnostic Universal Adversarial Perturbations

    Authors: Pu Zhao, Parikshit Ram, Songtao Lu, Yuguang Yao, Djallel Bouneffouf, Xue Lin, Sijia Liu

    Abstract: Adversarial perturbations are critical for certifying the robustness of deep learning models. A universal adversarial perturbation (UAP) can simultaneously attack multiple images, and thus offers a more unified threat model, obviating an image-wise attack algorithm. However, the existing UAP generator is underdeveloped when images are drawn from different image sources (e.g., with different image… ▽ More

    Submitted 17 August, 2022; v1 submitted 28 September, 2020; originally announced September 2020.

  4. arXiv:2008.08685  [pdf, other

    cs.LG stat.ML

    Neural Neighborhood Encoding for Classification

    Authors: Kaushik Sinha, Parikshit Ram

    Abstract: Inspired by the fruit-fly olfactory circuit, the Fly Bloom Filter [Dasgupta et al., 2018] is able to efficiently summarize the data with a single pass and has been used for novelty detection. We propose a new classifier (for binary and multi-class classification) that effectively encodes the different local neighborhoods for each class with a per-class Fly Bloom Filter. The inference on test data… ▽ More

    Submitted 19 August, 2020; originally announced August 2020.

  5. arXiv:2006.09635  [pdf, other

    cs.LG math.OC stat.ML

    Solving Constrained CASH Problems with ADMM

    Authors: Parikshit Ram, Sijia Liu, Deepak Vijaykeerthi, Dakuo Wang, Djallel Bouneffouf, Greg Bramble, Horst Samulowitz, Alexander G. Gray

    Abstract: The CASH problem has been widely studied in the context of automated configurations of machine learning (ML) pipelines and various solvers and toolkits are available. However, CASH solvers do not directly handle black-box constraints such as fairness, robustness or other domain-specific custom constraints. We present our recent approach [Liu, et al., 2020] that leverages the ADMM optimization fram… ▽ More

    Submitted 10 July, 2020; v1 submitted 16 June, 2020; originally announced June 2020.

    Comments: 7th ICML Workshop on Automated Machine Learning (2020)

  6. arXiv:1905.00424  [pdf, other

    cs.LG stat.ML

    An ADMM Based Framework for AutoML Pipeline Configuration

    Authors: Sijia Liu, Parikshit Ram, Deepak Vijaykeerthy, Djallel Bouneffouf, Gregory Bramble, Horst Samulowitz, Dakuo Wang, Andrew Conn, Alexander Gray

    Abstract: We study the AutoML problem of automatically configuring machine learning pipelines by jointly selecting algorithms and their appropriate hyper-parameters for all steps in supervised learning pipelines. This black-box (gradient-free) optimization with mixed integer & continuous variables is a challenging problem. We propose a novel AutoML scheme by leveraging the alternating direction method of mu… ▽ More

    Submitted 6 December, 2019; v1 submitted 1 May, 2019; originally announced May 2019.

    Journal ref: published at AAAI 2020