Skip to main content

Showing 1–11 of 11 results for author: Wadhawan, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.11229  [pdf, other

    cs.CL

    Causal ATE Mitigates Unintended Bias in Controlled Text Generation

    Authors: Rahul Madhavan, Kahini Wadhawan

    Abstract: We study attribute control in language models through the method of Causal Average Treatment Effect (Causal ATE). Existing methods for the attribute control task in Language Models (LMs) check for the co-occurrence of words in a sentence with the attribute of interest, and control for them. However, spurious correlation of the words with the attribute in the training dataset, can cause models to h… ▽ More

    Submitted 16 February, 2024; v1 submitted 19 November, 2023; originally announced November 2023.

    Comments: 12 pages, 5 figures

  2. CFL: Causally Fair Language Models Through Token-level Attribute Controlled Generation

    Authors: Rahul Madhavan, Rishabh Garg, Kahini Wadhawan, Sameep Mehta

    Abstract: We propose a method to control the attributes of Language Models (LMs) for the text generation task using Causal Average Treatment Effect (ATE) scores and counterfactual augmentation. We explore this method, in the context of LM detoxification, and propose the Causally Fair Language (CFL) architecture for detoxifying pre-trained LMs in a plug-and-play manner. Our architecture is based on a Structu… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: 19 pages, 10 figures. Findings of ACL 2023

    Journal ref: Findings of the Association for Computational Linguistics: ACL 2023

  3. arXiv:2207.07174  [pdf, other

    cs.LG stat.ML

    Attribute Graphs Underlying Molecular Generative Models: Path to Learning with Limited Data

    Authors: Samuel C. Hoffman, Payel Das, Karthikeyan Shanmugam, Kahini Wadhawan, Prasanna Sattigeri

    Abstract: Training generative models that capture rich semantics of the data and interpreting the latent representations encoded by such models are very important problems in un-/self-supervised learning. In this work, we provide a simple algorithm that relies on perturbation experiments on latent codes of a pre-trained generative autoencoder to uncover an attribute graph that is implied by the generative m… ▽ More

    Submitted 29 August, 2024; v1 submitted 14 July, 2022; originally announced July 2022.

    Comments: New experiments; reframed contributions

  4. arXiv:2108.08077  [pdf, other

    q-bio.QM cs.LG

    Towards Interpreting Zoonotic Potential of Betacoronavirus Sequences With Attention

    Authors: Kahini Wadhawan, Payel Das, Barbara A. Han, Ilya R. Fischhoff, Adrian C. Castellanos, Arvind Varsani, Kush R. Varshney

    Abstract: Current methods for viral discovery target evolutionarily conserved proteins that accurately identify virus families but remain unable to distinguish the zoonotic potential of newly discovered viruses. Here, we apply an attention-enhanced long-short-term memory (LSTM) deep neural net classifier to a highly conserved viral protein target to predict zoonotic potential across betacoronaviruses. The c… ▽ More

    Submitted 18 August, 2021; originally announced August 2021.

    Comments: 11 pages, 8 figures, 1 table, accepted at ICLR 2021 workshop Machine learning for preventing and combating pandemics

  5. Optimizing Molecules using Efficient Queries from Property Evaluations

    Authors: Samuel Hoffman, Vijil Chenthamarakshan, Kahini Wadhawan, Pin-Yu Chen, Payel Das

    Abstract: Machine learning based methods have shown potential for optimizing existing molecules with more desirable properties, a critical step towards accelerating new chemical discovery. Here we propose QMO, a generic query-based molecule optimization framework that exploits latent embeddings from a molecule autoencoder. QMO improves the desired properties of an input molecule based on efficient queries,… ▽ More

    Submitted 18 October, 2021; v1 submitted 3 November, 2020; originally announced November 2020.

    Comments: Preprint version to be published at Nature Machine Intelligence; Github: https://github.com/IBM/QMO

    Journal ref: Nat Mach Intell 4, 21-31 (2022)

  6. arXiv:2010.02260  [pdf, other

    cs.CL cs.AI

    Effects of Naturalistic Variation in Goal-Oriented Dialog

    Authors: Jatin Ganhotra, Robert Moore, Sachindra Joshi, Kahini Wadhawan

    Abstract: Existing benchmarks used to evaluate the performance of end-to-end neural dialog systems lack a key component: natural variation present in human conversations. Most datasets are constructed through crowdsourcing, where the crowd workers follow a fixed template of instructions while enacting the role of a user/agent. This results in straight-forward, somewhat routine, and mostly trouble-free conve… ▽ More

    Submitted 5 October, 2020; originally announced October 2020.

    Comments: Findings of EMNLP 2020. The updated datasets are available at: https://github.com/IBM/naturalistic-variation-goal-oriented-dialog-datasets

  7. arXiv:2005.11248  [pdf, other

    cs.LG q-bio.QM stat.ML

    Accelerating Antimicrobial Discovery with Controllable Deep Generative Models and Molecular Dynamics

    Authors: Payel Das, Tom Sercu, Kahini Wadhawan, Inkit Padhi, Sebastian Gehrmann, Flaviu Cipcigan, Vijil Chenthamarakshan, Hendrik Strobelt, Cicero dos Santos, Pin-Yu Chen, Yi Yan Yang, Jeremy Tan, James Hedrick, Jason Crain, Aleksandra Mojsilovic

    Abstract: De novo therapeutic design is challenged by a vast chemical repertoire and multiple constraints, e.g., high broad-spectrum potency and low toxicity. We propose CLaSS (Controlled Latent attribute Space Sampling) - an efficient computational method for attribute-controlled generation of molecules, which leverages guidance from classifiers trained on an informative latent space of molecules modeled u… ▽ More

    Submitted 25 February, 2021; v1 submitted 22 May, 2020; originally announced May 2020.

    Journal ref: Nature Biomedical Engineering (2021)

  8. arXiv:1811.05443  [pdf, other

    cs.LG stat.ML

    Co-regularized Alignment for Unsupervised Domain Adaptation

    Authors: Abhishek Kumar, Prasanna Sattigeri, Kahini Wadhawan, Leonid Karlinsky, Rogerio Feris, William T. Freeman, Gregory Wornell

    Abstract: Deep neural networks, trained with large amount of labeled data, can fail to generalize well when tested with examples from a \emph{target domain} whose distribution differs from the training data distribution, referred as the \emph{source domain}. It can be expensive or even infeasible to obtain required amount of labeled data in all possible domains. Unsupervised domain adaptation sets out to ad… ▽ More

    Submitted 13 November, 2018; originally announced November 2018.

    Comments: NIPS 2018 accepted version

  9. arXiv:1810.07743  [pdf, other

    q-bio.QM cs.LG stat.ML

    PepCVAE: Semi-Supervised Targeted Design of Antimicrobial Peptide Sequences

    Authors: Payel Das, Kahini Wadhawan, Oscar Chang, Tom Sercu, Cicero Dos Santos, Matthew Riemer, Vijil Chenthamarakshan, Inkit Padhi, Aleksandra Mojsilovic

    Abstract: Given the emerging global threat of antimicrobial resistance, new methods for next-generation antimicrobial design are urgently needed. We report a peptide generation framework PepCVAE, based on a semi-supervised variational autoencoder (VAE) model, for designing novel antimicrobial peptide (AMP) sequences. Our model learns a rich latent space of the biological peptide context by taking advantage… ▽ More

    Submitted 13 November, 2018; v1 submitted 17 October, 2018; originally announced October 2018.

  10. arXiv:1711.09395  [pdf, other

    cs.CL cs.AI cs.LG

    Improved Neural Text Attribute Transfer with Non-parallel Data

    Authors: Igor Melnyk, Cicero Nogueira dos Santos, Kahini Wadhawan, Inkit Padhi, Abhishek Kumar

    Abstract: Text attribute transfer using non-parallel data requires methods that can perform disentanglement of content and linguistic attributes. In this work, we propose multiple improvements over the existing approaches that enable the encoder-decoder framework to cope with the text attribute transfer from non-parallel data. We perform experiments on the sentiment transfer task using two datasets. For bot… ▽ More

    Submitted 4 December, 2017; v1 submitted 26 November, 2017; originally announced November 2017.

    Comments: NIPS 2017 Workshop on Learning Disentangled Representations: from Perception to Control

  11. arXiv:1707.02198  [pdf, other

    cs.LG

    Learning Loss Functions for Semi-supervised Learning via Discriminative Adversarial Networks

    Authors: Cicero Nogueira dos Santos, Kahini Wadhawan, Bowen Zhou

    Abstract: We propose discriminative adversarial networks (DAN) for semi-supervised learning and loss function learning. Our DAN approach builds upon generative adversarial networks (GANs) and conditional GANs but includes the key differentiator of using two discriminators instead of a generator and a discriminator. DAN can be seen as a framework to learn loss functions for predictors that also implements se… ▽ More

    Submitted 7 July, 2017; originally announced July 2017.

    Comments: 11 pages