Skip to main content

Showing 1–6 of 6 results for author: Igamberdiev, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.18789  [pdf, other

    cs.CL

    Granularity is crucial when applying differential privacy to text: An investigation for neural machine translation

    Authors: Doan Nam Long Vu, Timour Igamberdiev, Ivan Habernal

    Abstract: Applying differential privacy (DP) by means of the DP-SGD algorithm to protect individual data points during training is becoming increasingly popular in NLP. However, the choice of granularity at which DP is applied is often neglected. For example, neural machine translation (NMT) typically operates on the sentence-level granularity. From the perspective of DP, this setup assumes that each senten… ▽ More

    Submitted 26 September, 2024; v1 submitted 26 July, 2024; originally announced July 2024.

    Comments: Accepted at EMNLP Findings 2024

  2. arXiv:2311.14465  [pdf, other

    cs.CL

    DP-NMT: Scalable Differentially-Private Machine Translation

    Authors: Timour Igamberdiev, Doan Nam Long Vu, Felix Künnecke, Zhuo Yu, Jannik Holmer, Ivan Habernal

    Abstract: Neural machine translation (NMT) is a widely popular text generation task, yet there is a considerable research gap in the development of privacy-preserving NMT models, despite significant data privacy concerns for NMT systems. Differentially private stochastic gradient descent (DP-SGD) is a popular method for training machine learning models with concrete privacy guarantees; however, the implemen… ▽ More

    Submitted 24 April, 2024; v1 submitted 24 November, 2023; originally announced November 2023.

    Comments: Accepted at EACL 2024

  3. arXiv:2302.07636  [pdf, other

    cs.CR cs.CL

    DP-BART for Privatized Text Rewriting under Local Differential Privacy

    Authors: Timour Igamberdiev, Ivan Habernal

    Abstract: Privatized text rewriting with local differential privacy (LDP) is a recent approach that enables sharing of sensitive textual documents while formally guaranteeing privacy protection to individuals. However, existing systems face several issues, such as formal mathematical flaws, unrealistic privacy guarantees, privatization of only individual words, as well as a lack of transparency and reproduc… ▽ More

    Submitted 6 June, 2023; v1 submitted 15 February, 2023; originally announced February 2023.

    Comments: Accepted at ACL Findings 2023

  4. arXiv:2208.10400  [pdf, other

    cs.CL cs.CR

    DP-Rewrite: Towards Reproducibility and Transparency in Differentially Private Text Rewriting

    Authors: Timour Igamberdiev, Thomas Arnold, Ivan Habernal

    Abstract: Text rewriting with differential privacy (DP) provides concrete theoretical guarantees for protecting the privacy of individuals in textual documents. In practice, existing systems may lack the means to validate their privacy-preserving claims, leading to problems of transparency and reproducibility. We introduce DP-Rewrite, an open-source framework for differentially private text rewriting which… ▽ More

    Submitted 22 August, 2022; originally announced August 2022.

    Comments: Accepted at COLING 2022

  5. arXiv:2112.08159  [pdf, other

    cs.CL

    One size does not fit all: Investigating strategies for differentially-private learning across NLP tasks

    Authors: Manuel Senge, Timour Igamberdiev, Ivan Habernal

    Abstract: Preserving privacy in contemporary NLP models allows us to work with sensitive data, but unfortunately comes at a price. We know that stricter privacy guarantees in differentially-private stochastic gradient descent (DP-SGD) generally degrade model performance. However, previous research on the efficiency of DP-SGD in NLP is inconclusive or even counter-intuitive. In this short paper, we provide a… ▽ More

    Submitted 31 January, 2023; v1 submitted 15 December, 2021; originally announced December 2021.

    Comments: EMNLP 2022 final camera-ready version

  6. arXiv:2102.09604  [pdf, other

    cs.SI cs.CL cs.CR cs.LG

    Privacy-Preserving Graph Convolutional Networks for Text Classification

    Authors: Timour Igamberdiev, Ivan Habernal

    Abstract: Graph convolutional networks (GCNs) are a powerful architecture for representation learning on documents that naturally occur as graphs, e.g., citation or social networks. However, sensitive personal information, such as documents with people's profiles or relationships as edges, are prone to privacy leaks, as the trained model might reveal the original input. Although differential privacy (DP) of… ▽ More

    Submitted 2 May, 2022; v1 submitted 10 February, 2021; originally announced February 2021.

    Comments: Accepted at LREC 2022