Skip to main content

Showing 1–10 of 10 results for author: Olivier, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.01207  [pdf, ps, other

    cs.LG cs.CR cs.SD eess.AS

    Improving Membership Inference in ASR Model Auditing with Perturbed Loss Features

    Authors: Francisco Teixeira, Karla Pizzi, Raphael Olivier, Alberto Abad, Bhiksha Raj, Isabel Trancoso

    Abstract: Membership Inference (MI) poses a substantial privacy threat to the training data of Automatic Speech Recognition (ASR) systems, while also offering an opportunity to audit these models with regard to user data. This paper explores the effectiveness of loss-based features in combination with Gaussian and adversarial perturbations to perform MI in ASR models. To the best of our knowledge, this appr… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: Trustworthy Speech Processing, Satellite Workshop at ICASSP 2024

  2. arXiv:2310.04445  [pdf, other

    cs.CL cs.AI cs.LG

    LoFT: Local Proxy Fine-tuning For Improving Transferability Of Adversarial Attacks Against Large Language Model

    Authors: Muhammad Ahmed Shah, Roshan Sharma, Hira Dhamyal, Raphael Olivier, Ankit Shah, Joseph Konan, Dareen Alharthi, Hazim T Bukhari, Massa Baali, Soham Deshmukh, Michael Kuhlmann, Bhiksha Raj, Rita Singh

    Abstract: It has been shown that Large Language Model (LLM) alignments can be circumvented by appending specially crafted attack suffixes with harmful queries to elicit harmful responses. To conduct attacks against private target models whose characterization is unknown, public models can be used as proxies to fashion the attack, with successful attacks being transferred from public proxies to private targe… ▽ More

    Submitted 21 October, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

  3. arXiv:2210.17316  [pdf, other

    eess.AS cs.AI cs.CL cs.LG cs.SD

    There is more than one kind of robustness: Fooling Whisper with adversarial examples

    Authors: Raphael Olivier, Bhiksha Raj

    Abstract: Whisper is a recent Automatic Speech Recognition (ASR) model displaying impressive robustness to both out-of-distribution inputs and random noise. In this work, we show that this robustness does not carry over to adversarial noise. We show that we can degrade Whisper performance dramatically, or even transcribe a target sentence of our choice, by generating very small input perturbations with Sign… ▽ More

    Submitted 10 August, 2023; v1 submitted 26 October, 2022; originally announced October 2022.

    Comments: Accepted at InterSpeech 2023

  4. arXiv:2209.13523  [pdf, other

    cs.LG cs.CL cs.CR

    Watch What You Pretrain For: Targeted, Transferable Adversarial Examples on Self-Supervised Speech Recognition models

    Authors: Raphael Olivier, Hadi Abdullah, Bhiksha Raj

    Abstract: A targeted adversarial attack produces audio samples that can force an Automatic Speech Recognition (ASR) system to output attacker-chosen text. To exploit ASR models in real-world, black-box settings, an adversary can leverage the transferability property, i.e. that an adversarial sample produced for a proxy ASR can also fool a different remote ASR. However recent work has shown that transferabil… ▽ More

    Submitted 29 September, 2022; v1 submitted 17 September, 2022; originally announced September 2022.

  5. arXiv:2207.04129  [pdf, other

    cs.LG cs.CR stat.ML

    How many perturbations break this model? Evaluating robustness beyond adversarial accuracy

    Authors: Raphael Olivier, Bhiksha Raj

    Abstract: Robustness to adversarial attacks is typically evaluated with adversarial accuracy. While essential, this metric does not capture all aspects of robustness and in particular leaves out the question of how many perturbations can be found for each point. In this work, we introduce an alternative approach, adversarial sparsity, which quantifies how difficult it is to find a successful perturbation gi… ▽ More

    Submitted 10 August, 2023; v1 submitted 8 July, 2022; originally announced July 2022.

    Journal ref: Proceedings of the 40th International Conference on Machine Learning, PMLR 202:26583-26598, 2023

  6. arXiv:2203.16536  [pdf, other

    cs.CR cs.AI cs.LG cs.SD eess.AS

    Recent improvements of ASR models in the face of adversarial attacks

    Authors: Raphael Olivier, Bhiksha Raj

    Abstract: Like many other tasks involving neural networks, Speech Recognition models are vulnerable to adversarial attacks. However recent research has pointed out differences between attacks and defenses on ASR models compared to image models. Improving the robustness of ASR models requires a paradigm shift from evaluating attacks on one or a few models to a systemic approach in evaluation. We lay the grou… ▽ More

    Submitted 4 April, 2022; v1 submitted 29 March, 2022; originally announced March 2022.

    Comments: Submitted to Interspeech 2022

  7. Sequential Randomized Smoothing for Adversarially Robust Speech Recognition

    Authors: Raphael Olivier, Bhiksha Raj

    Abstract: While Automatic Speech Recognition has been shown to be vulnerable to adversarial attacks, defenses against these attacks are still lagging. Existing, naive defenses can be partially broken with an adaptive attack. In classification tasks, the Randomized Smoothing paradigm has been shown to be effective at defending models. However, it is difficult to apply this paradigm to ASR tasks, due to their… ▽ More

    Submitted 10 January, 2022; v1 submitted 5 November, 2021; originally announced December 2021.

    Comments: This update adds some relevant references to past and concurrent work

    Journal ref: 2021.emnlp-main.514 (2021) 6372-6386

  8. arXiv:2005.14070  [pdf, other

    cs.LG stat.ML

    Exploiting Non-Linear Redundancy for Neural Model Compression

    Authors: Muhammad A. Shah, Raphael Olivier, Bhiksha Raj

    Abstract: Deploying deep learning models, comprising of non-linear combination of millions, even billions, of parameters is challenging given the memory, power and compute constraints of the real world. This situation has led to research into model compression techniques most of which rely on suboptimal heuristics and do not consider the parameter redundancies due to linear dependence between neuron activat… ▽ More

    Submitted 28 May, 2020; originally announced May 2020.

  9. arXiv:1910.06393  [pdf, other

    cs.CL cs.LG

    In-training Matrix Factorization for Parameter-frugal Neural Machine Translation

    Authors: Zachary Kaden, Teven Le Scao, Raphael Olivier

    Abstract: In this paper, we propose the use of in-training matrix factorization to reduce the model size for neural machine translation. Using in-training matrix factorization, parameter matrices may be decomposed into the products of smaller matrices, which can compress large machine translation architectures by vastly reducing the number of learnable parameters. We apply in-training matrix factorization t… ▽ More

    Submitted 23 March, 2020; v1 submitted 27 September, 2019; originally announced October 2019.

  10. arXiv:1808.10025  [pdf, other

    cs.CL

    Retrieval-Based Neural Code Generation

    Authors: Shirley Anugrah Hayati, Raphael Olivier, Pravalika Avvaru, Pengcheng Yin, Anthony Tomasic, Graham Neubig

    Abstract: In models to generate program source code from natural language, representing this code in a tree structure has been a common approach. However, existing methods often fail to generate complex code correctly due to a lack of ability to memorize large and complex structures. We introduce ReCode, a method based on subtree retrieval that makes it possible to explicitly reference existing code example… ▽ More

    Submitted 29 August, 2018; originally announced August 2018.

    Comments: This paper is accepted in EMNLP 2018. It has 6 pages