Skip to main content

Showing 1–18 of 18 results for author: Dolamic, L

.
  1. arXiv:2412.13809  [pdf, ps, other

    cs.LG

    Extreme Multi-label Completion for Semantic Document Labelling with Taxonomy-Aware Parallel Learning

    Authors: Julien Audiffren, Christophe Broillet, Ljiljana Dolamic, Philippe Cudré-Mauroux

    Abstract: In Extreme Multi Label Completion (XMLCo), the objective is to predict the missing labels of a collection of documents. Together with XML Classification, XMLCo is arguably one of the most challenging document classification tasks, as the very high number of labels (at least ten of thousands) is generally very large compared to the number of available labelled documents in the training dataset. Suc… ▽ More

    Submitted 18 December, 2024; originally announced December 2024.

  2. arXiv:2411.12473  [pdf, other

    cs.CL

    NMT-Obfuscator Attack: Ignore a sentence in translation with only one word

    Authors: Sahar Sadrizadeh, César Descalzo, Ljiljana Dolamic, Pascal Frossard

    Abstract: Neural Machine Translation systems are used in diverse applications due to their impressive performance. However, recent studies have shown that these systems are vulnerable to carefully crafted small perturbations to their inputs, known as adversarial attacks. In this paper, we propose a new type of adversarial attack against NMT models. In this attack, we find a word to be added between two sent… ▽ More

    Submitted 19 November, 2024; originally announced November 2024.

  3. arXiv:2410.04147  [pdf, other

    cs.CL

    Can the Variation of Model Weights be used as a Criterion for Self-Paced Multilingual NMT?

    Authors: Àlex R. Atrio, Alexis Allemann, Ljiljana Dolamic, Andrei Popescu-Belis

    Abstract: Many-to-one neural machine translation systems improve over one-to-one systems when training data is scarce. In this paper, we design and test a novel algorithm for selecting the language of minibatches when training such systems. The algorithm changes the language of the minibatch when the weights of the model do not evolve significantly, as measured by the smoothed KL divergence between all laye… ▽ More

    Submitted 5 October, 2024; originally announced October 2024.

  4. arXiv:2409.03291  [pdf, other

    cs.CL cs.AI cs.CR cs.LG

    LLM Detectors Still Fall Short of Real World: Case of LLM-Generated Short News-Like Posts

    Authors: Henrique Da Silva Gameiro, Andrei Kucharavy, Ljiljana Dolamic

    Abstract: With the emergence of widely available powerful LLMs, disinformation generated by large Language Models (LLMs) has become a major concern. Historically, LLM detectors have been touted as a solution, but their effectiveness in the real world is still to be proven. In this paper, we focus on an important setting in information operations -- short news-like posts generated by moderately sophisticated… ▽ More

    Submitted 27 September, 2024; v1 submitted 5 September, 2024; originally announced September 2024.

    Comments: 20 pages, 7 tables, 13 figures, under consideration for EMNLP

    ACM Class: I.2.7; K.6.5

  5. arXiv:2407.18251  [pdf, other

    cs.CV cs.CR cs.LG

    Sparse vs Contiguous Adversarial Pixel Perturbations in Multimodal Models: An Empirical Analysis

    Authors: Cristian-Alexandru Botocan, Raphael Meier, Ljiljana Dolamic

    Abstract: Assessing the robustness of multimodal models against adversarial examples is an important aspect for the safety of its users. We craft L0-norm perturbation attacks on the preprocessed input images. We launch them in a black-box setup against four multimodal models and two unimodal DNNs, considering both targeted and untargeted misclassification. Our attacks target less than 0.04% of perturbed ima… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

    ACM Class: I.2.0; I.4.0

  6. arXiv:2406.14986  [pdf, ps, other

    cs.AI cs.CL

    Do Large Language Models Exhibit Cognitive Dissonance? Studying the Difference Between Revealed Beliefs and Stated Answers

    Authors: Manuel Mondal, Ljiljana Dolamic, Gérôme Bovet, Philippe Cudré-Mauroux, Julien Audiffren

    Abstract: Multiple Choice Questions (MCQ) have become a commonly used approach to assess the capabilities of Large Language Models (LLMs), due to their ease of manipulation and evaluation. The experimental appraisals of the LLMs' Stated Answer (their answer to MCQ) have pointed to their apparent ability to perform probabilistic reasoning or to grasp uncertainty. In this work, we investigate whether these ap… ▽ More

    Submitted 17 June, 2025; v1 submitted 21 June, 2024; originally announced June 2024.

  7. arXiv:2308.15246  [pdf, other

    cs.CL

    A Classification-Guided Approach for Adversarial Attacks against Neural Machine Translation

    Authors: Sahar Sadrizadeh, Ljiljana Dolamic, Pascal Frossard

    Abstract: Neural Machine Translation (NMT) models have been shown to be vulnerable to adversarial attacks, wherein carefully crafted perturbations of the input can mislead the target model. In this paper, we introduce ACT, a novel adversarial attack framework against NMT systems guided by a classifier. In our attack, the adversary aims to craft meaning-preserving adversarial examples whose translations in t… ▽ More

    Submitted 22 February, 2024; v1 submitted 29 August, 2023; originally announced August 2023.

  8. arXiv:2306.09991  [pdf, other

    cs.NE cs.LG q-bio.PE

    Evolutionary Algorithms in the Light of SGD: Limit Equivalence, Minima Flatness, and Transfer Learning

    Authors: Andrei Kucharavy, Rachid Guerraoui, Ljiljana Dolamic

    Abstract: Whenever applicable, the Stochastic Gradient Descent (SGD) has shown itself to be unreasonably effective. Instead of underperforming and getting trapped in local minima due to the batch noise, SGD leverages it to learn to generalize better and find minima that are good enough for the entire dataset. This led to numerous theoretical and experimental investigations, especially in the context of Arti… ▽ More

    Submitted 20 May, 2023; originally announced June 2023.

    Comments: To be published in ALIFE 2023; 16 pages, 10 figures, 1 listing

    ACM Class: I.2.8; G.1.6

  9. arXiv:2306.08492  [pdf, other

    cs.CL

    A Relaxed Optimization Approach for Adversarial Attacks against Neural Machine Translation Models

    Authors: Sahar Sadrizadeh, Clément Barbier, Ljiljana Dolamic, Pascal Frossard

    Abstract: In this paper, we propose an optimization-based adversarial attack against Neural Machine Translation (NMT) models. First, we propose an optimization problem to generate adversarial examples that are semantically similar to the original sentences but destroy the translation generated by the target NMT model. This optimization problem is discrete, and we propose a continuous relaxation to solve it.… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

  10. arXiv:2306.01393  [pdf, other

    cs.CL

    Assessing the Importance of Frequency versus Compositionality for Subword-based Tokenization in NMT

    Authors: Benoist Wolleb, Romain Silvestri, Giorgos Vernikos, Ljiljana Dolamic, Andrei Popescu-Belis

    Abstract: Subword tokenization is the de facto standard for tokenization in neural language models and machine translation systems. Three advantages are frequently cited in favor of subwords: shorter encoding of frequent tokens, compositionality of subwords, and ability to deal with unknown words. As their relative importance is not entirely clear yet, we propose a tokenization approach that enables us to s… ▽ More

    Submitted 12 January, 2024; v1 submitted 2 June, 2023; originally announced June 2023.

    Comments: Accepted at EAMT 2023

  11. arXiv:2304.13540  [pdf, ps, other

    cs.DC cs.LG cs.NE

    Byzantine-Resilient Learning Beyond Gradients: Distributing Evolutionary Search

    Authors: Andrei Kucharavy, Matteo Monti, Rachid Guerraoui, Ljiljana Dolamic

    Abstract: Modern machine learning (ML) models are capable of impressive performances. However, their prowess is not due only to the improvements in their architecture and training algorithms but also to a drastic increase in computational power used to train them. Such a drastic increase led to a growing interest in distributed ML, which in turn made worker failures and adversarial attacks an increasingly… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

    Comments: 10 pages, 4 listings, 2 theorems

    ACM Class: I.2.11; D.1.3; F.1.2

  12. arXiv:2303.12132  [pdf, other

    cs.CL cs.CR cs.LG

    Fundamentals of Generative Large Language Models and Perspectives in Cyber-Defense

    Authors: Andrei Kucharavy, Zachary Schillaci, Loïc Maréchal, Maxime Würsch, Ljiljana Dolamic, Remi Sabonnadiere, Dimitri Percia David, Alain Mermoud, Vincent Lenders

    Abstract: Generative Language Models gained significant attention in late 2022 / early 2023, notably with the introduction of models refined to act consistently with users' expectations of interactions with AI (conversational models). Arguably the focal point of public attention has been such a refinement of the GPT3 model -- the ChatGPT and its subsequent integration with auxiliary capabilities, including… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: 41 pages (without references), 13 figures; public report of Cyber-Defence Campus

    ACM Class: I.2.7; I.2.1; K.6.5; K.4.2; J.7

  13. arXiv:2303.01068  [pdf, other

    cs.CL cs.CR cs.LG

    Targeted Adversarial Attacks against Neural Machine Translation

    Authors: Sahar Sadrizadeh, AmirHossein Dabiri Aghdam, Ljiljana Dolamic, Pascal Frossard

    Abstract: Neural Machine Translation (NMT) systems are used in various applications. However, it has been shown that they are vulnerable to very small perturbations of their inputs, known as adversarial attacks. In this paper, we propose a new targeted adversarial attack against NMT models. In particular, our goal is to insert a predefined target keyword into the translation of the adversarial sentence whil… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

    Comments: ICASSP 2023, Code available at: http://github.com/sssadrizadeh/NMT-targeted-attack

  14. arXiv:2302.00944  [pdf, other

    cs.CL

    TransFool: An Adversarial Attack against Neural Machine Translation Models

    Authors: Sahar Sadrizadeh, Ljiljana Dolamic, Pascal Frossard

    Abstract: Deep neural networks have been shown to be vulnerable to small perturbations of their inputs, known as adversarial attacks. In this paper, we investigate the vulnerability of Neural Machine Translation (NMT) models to adversarial attacks and propose a new attack algorithm called TransFool. To fool NMT models, TransFool builds on a multi-term optimization problem and a gradient projection step. By… ▽ More

    Submitted 16 June, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

  15. arXiv:2209.10224  [pdf, other

    cs.CR cs.SI

    Identifying Emerging Technologies and Leading Companies using Network Dynamics of Patent Clusters: a Cybersecurity Case Study

    Authors: Michael Tsesmelis, Ljiljana Dolamic, Marcus Matthias Keupp, Dimitri Percia David, Alain Mermoud

    Abstract: Strategic decisions rely heavily on non-scientific instrumentation to forecast emerging technologies and leading companies. Instead, we build a fast quantitative system with a small computational footprint to discover the most important technologies and companies in a given field, using generalisable methods applicable to any industry. With the help of patent data from the US Patent and Trademark… ▽ More

    Submitted 21 September, 2022; originally announced September 2022.

    Comments: 24 pages, 8 figures

  16. arXiv:2206.00282  [pdf, other

    cs.CV cs.PF

    Needle In A Haystack, Fast: Benchmarking Image Perceptual Similarity Metrics At Scale

    Authors: Cyril Vallez, Andrei Kucharavy, Ljiljana Dolamic

    Abstract: The advent of the internet, followed shortly by the social media made it ubiquitous in consuming and sharing information between anyone with access to it. The evolution in the consumption of media driven by this change, led to the emergence of images as means to express oneself, convey information and convince others efficiently. With computer vision algorithms progressing radically over the last… ▽ More

    Submitted 1 June, 2022; originally announced June 2022.

    Comments: 26 pages, 10 figures

    ACM Class: H.3.1; I.4.10; I.4.7; I.5.5; I.5.4; K.4

  17. arXiv:2203.05948  [pdf, other

    cs.CL cs.LG

    Block-Sparse Adversarial Attack to Fool Transformer-Based Text Classifiers

    Authors: Sahar Sadrizadeh, Ljiljana Dolamic, Pascal Frossard

    Abstract: Recently, it has been shown that, in spite of the significant performance of deep neural networks in different fields, those are vulnerable to adversarial examples. In this paper, we propose a gradient-based adversarial attack against transformer-based text classifiers. The adversarial perturbation in our method is imposed to be block-sparse so that the resultant adversarial example differs from t… ▽ More

    Submitted 11 March, 2022; originally announced March 2022.

    Comments: ICASSP 2022, Code available at: https://github.com/sssadrizadeh/transformer-text-classifier-attack

  18. arXiv:2112.04810  [pdf, ps, other

    cs.IR

    From Scattered Sources to Comprehensive Technology Landscape: A Recommendation-based Retrieval Approach

    Authors: Chi Thang Duong, Dimitri Percia David, Ljiljana Dolamic, Alain Mermoud, Vincent Lenders, Karl Aberer

    Abstract: Mapping the technology landscape is crucial for market actors to take informed investment decisions. However, given the large amount of data on the Web and its subsequent information overload, manually retrieving information is a seemingly ineffective and incomplete approach. In this work, we propose an end-to-end recommendation based retrieval approach to support automatic retrieval of technologi… ▽ More

    Submitted 9 December, 2021; originally announced December 2021.