Skip to main content

Showing 1–11 of 11 results for author: Kucharavy, A

.
  1. arXiv:2409.03291  [pdf, other

    cs.CL cs.AI cs.CR cs.LG

    LLM Detectors Still Fall Short of Real World: Case of LLM-Generated Short News-Like Posts

    Authors: Henrique Da Silva Gameiro, Andrei Kucharavy, Ljiljana Dolamic

    Abstract: With the emergence of widely available powerful LLMs, disinformation generated by large Language Models (LLMs) has become a major concern. Historically, LLM detectors have been touted as a solution, but their effectiveness in the real world is still to be proven. In this paper, we focus on an important setting in information operations -- short news-like posts generated by moderately sophisticated… ▽ More

    Submitted 27 September, 2024; v1 submitted 5 September, 2024; originally announced September 2024.

    Comments: 20 pages, 7 tables, 13 figures, under consideration for EMNLP

    ACM Class: I.2.7; K.6.5

  2. arXiv:2312.07110  [pdf, other

    cs.CL cs.CR cs.LG

    LLMs Perform Poorly at Concept Extraction in Cyber-security Research Literature

    Authors: Maxime Würsch, Andrei Kucharavy, Dimitri Percia David, Alain Mermoud

    Abstract: The cybersecurity landscape evolves rapidly and poses threats to organizations. To enhance resilience, one needs to track the latest developments and trends in the domain. It has been demonstrated that standard bibliometrics approaches show their limits in such a fast-evolving domain. For this purpose, we use large language models (LLMs) to extract relevant knowledge entities from cybersecurity-re… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: 24 pages, 9 figures

  3. arXiv:2306.09991  [pdf, other

    cs.NE cs.LG q-bio.PE

    Evolutionary Algorithms in the Light of SGD: Limit Equivalence, Minima Flatness, and Transfer Learning

    Authors: Andrei Kucharavy, Rachid Guerraoui, Ljiljana Dolamic

    Abstract: Whenever applicable, the Stochastic Gradient Descent (SGD) has shown itself to be unreasonably effective. Instead of underperforming and getting trapped in local minima due to the batch noise, SGD leverages it to learn to generalize better and find minima that are good enough for the entire dataset. This led to numerous theoretical and experimental investigations, especially in the context of Arti… ▽ More

    Submitted 20 May, 2023; originally announced June 2023.

    Comments: To be published in ALIFE 2023; 16 pages, 10 figures, 1 listing

    ACM Class: I.2.8; G.1.6

  4. arXiv:2304.13540  [pdf, ps, other

    cs.DC cs.LG cs.NE

    Byzantine-Resilient Learning Beyond Gradients: Distributing Evolutionary Search

    Authors: Andrei Kucharavy, Matteo Monti, Rachid Guerraoui, Ljiljana Dolamic

    Abstract: Modern machine learning (ML) models are capable of impressive performances. However, their prowess is not due only to the improvements in their architecture and training algorithms but also to a drastic increase in computational power used to train them. Such a drastic increase led to a growing interest in distributed ML, which in turn made worker failures and adversarial attacks an increasingly… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

    Comments: 10 pages, 4 listings, 2 theorems

    ACM Class: I.2.11; D.1.3; F.1.2

  5. arXiv:2304.08968  [pdf, other

    cs.CL cs.AI cs.CR cs.LG

    Stochastic Parrots Looking for Stochastic Parrots: LLMs are Easy to Fine-Tune and Hard to Detect with other LLMs

    Authors: Da Silva Gameiro Henrique, Andrei Kucharavy, Rachid Guerraoui

    Abstract: The self-attention revolution allowed generative language models to scale and achieve increasingly impressive abilities. Such models - commonly referred to as Large Language Models (LLMs) - have recently gained prominence with the general public, thanks to conversational fine-tuning, putting their behavior in line with public expectations regarding AI. This prominence amplified prior concerns rega… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Comments: 15 pages, 6 figures; 10 pages, 7 figures Supplementary Materials; under review at ECML 2023

    ACM Class: I.2.7; K.6.5

  6. arXiv:2303.12132  [pdf, other

    cs.CL cs.CR cs.LG

    Fundamentals of Generative Large Language Models and Perspectives in Cyber-Defense

    Authors: Andrei Kucharavy, Zachary Schillaci, Loïc Maréchal, Maxime Würsch, Ljiljana Dolamic, Remi Sabonnadiere, Dimitri Percia David, Alain Mermoud, Vincent Lenders

    Abstract: Generative Language Models gained significant attention in late 2022 / early 2023, notably with the introduction of models refined to act consistently with users' expectations of interactions with AI (conversational models). Arguably the focal point of public attention has been such a refinement of the GPT3 model -- the ChatGPT and its subsequent integration with auxiliary capabilities, including… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: 41 pages (without references), 13 figures; public report of Cyber-Defence Campus

    ACM Class: I.2.7; I.2.1; K.6.5; K.4.2; J.7

  7. arXiv:2206.00282  [pdf, other

    cs.CV cs.PF

    Needle In A Haystack, Fast: Benchmarking Image Perceptual Similarity Metrics At Scale

    Authors: Cyril Vallez, Andrei Kucharavy, Ljiljana Dolamic

    Abstract: The advent of the internet, followed shortly by the social media made it ubiquitous in consuming and sharing information between anyone with access to it. The evolution in the consumption of media driven by this change, led to the emergence of images as means to express oneself, convey information and convince others efficiently. With computer vision algorithms progressing radically over the last… ▽ More

    Submitted 1 June, 2022; originally announced June 2022.

    Comments: 26 pages, 10 figures

    ACM Class: H.3.1; I.4.10; I.4.7; I.5.5; I.5.4; K.4

  8. arXiv:2108.12275  [pdf, other

    cs.LG cs.CL

    Can the Transformer Be Used as a Drop-in Replacement for RNNs in Text-Generating GANs?

    Authors: Kevin Blin, Andrei Kucharavy

    Abstract: In this paper we address the problem of fine-tuned text generation with a limited computational budget. For that, we use a well-performing text generative adversarial network (GAN) architecture - Diversity-Promoting GAN (DPGAN), and attempted a drop-in replacement of the LSTM layer with a self-attention-based Transformer layer in order to leverage their efficiency. The resulting Self-Attention DPG… ▽ More

    Submitted 26 August, 2021; originally announced August 2021.

    Comments: accepted to RANLP 2021

    MSC Class: 68T50; 68T05 ACM Class: I.2.7

  9. arXiv:2007.14585  [pdf

    q-bio.GN

    On the Transcriptomic Signature and General Stress State Associated with Aneuploidy

    Authors: Hung-Ji Tsai, Anjali R. Nelliat, Andrei Kucharavy, Mohammad Ikbal Choudhury, Sean X. Sun, Michael C. Schatz, Rong Li

    Abstract: Whether aneuploid cells with diverse karyotypes have any properties in common has a been a subject of intense interest. A recent study by Terhorst et al. (1) reinvestigated the common aneuploidy gene expression (CAGE), disputing the conclusion of our recent work (2). In this short article, which has been submitted to PNAS as a Letter to the Editor, we explain our major concerns about Terhorst et a… ▽ More

    Submitted 28 July, 2020; originally announced July 2020.

    Comments: 1 page, no figure, with new analyses (a letter to PNAS Editor)

  10. arXiv:2006.04720  [pdf, other

    cs.NE cs.LG q-bio.PE stat.ML

    Host-Pathongen Co-evolution Inspired Algorithm Enables Robust GAN Training

    Authors: Andrei Kucharavy, El Mahdi El Mhamdi, Rachid Guerraoui

    Abstract: Generative adversarial networks (GANs) are pairs of artificial neural networks that are trained one against each other. The outputs from a generator are mixed with the real-world inputs to the discriminator and both networks are trained until an equilibrium is reached, where the discriminator cannot distinguish generated inputs from real ones. Since their introduction, GANs have allowed for the ge… ▽ More

    Submitted 9 June, 2020; v1 submitted 22 May, 2020; originally announced June 2020.

    Comments: 8 pages, 10 figures

    MSC Class: 92B20; 68T05; ACM Class: I.5.2

  11. arXiv:1902.01686  [pdf, other

    stat.ML cs.DC cs.LG cs.NE

    The Probabilistic Fault Tolerance of Neural Networks in the Continuous Limit

    Authors: El-Mahdi El-Mhamdi, Rachid Guerraoui, Andrei Kucharavy, Sergei Volodin

    Abstract: The loss of a few neurons in a brain rarely results in any visible loss of function. However, the insight into what "few" means in this context is unclear. How many random neuron failures will it take to lead to a visible loss of function? In this paper, we address the fundamental question of the impact of the crash of a random subset of neurons on the overall computation of a neural network and t… ▽ More

    Submitted 25 September, 2019; v1 submitted 5 February, 2019; originally announced February 2019.

    Comments: 10 pages (without references), 2 figures, 2 tables, 1 algorithm, 26 pages of supplementary material