Skip to main content

Showing 1–8 of 8 results for author: Lukács, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.02048  [pdf, ps, other

    cs.CR cs.AI

    Improving LLM Agents with Reinforcement Learning on Cryptographic CTF Challenges

    Authors: Lajos Muzsai, David Imolai, András Lukács

    Abstract: Large Language Models (LLMs) still struggle with the structured reasoning and tool-assisted computation needed for problem solving in cybersecurity applications. In this work, we introduce "random-crypto", a cryptographic Capture-the-Flag (CTF) challenge generator framework that we use to fine-tune a tool-augmented Llama-3.1-8B with Guided Reinforcement Prompt Optimisation (GRPO), allowing the age… ▽ More

    Submitted 31 May, 2025; originally announced June 2025.

    Comments: 11 pages, 1 figure

    MSC Class: 68M25 ACM Class: I.2.1; K.6.5

  2. arXiv:2412.01778  [pdf, other

    cs.CR cs.AI

    HackSynth: LLM Agent and Evaluation Framework for Autonomous Penetration Testing

    Authors: Lajos Muzsai, David Imolai, András Lukács

    Abstract: We introduce HackSynth, a novel Large Language Model (LLM)-based agent capable of autonomous penetration testing. HackSynth's dual-module architecture includes a Planner and a Summarizer, which enable it to generate commands and process feedback iteratively. To benchmark HackSynth, we propose two new Capture The Flag (CTF)-based benchmark sets utilizing the popular platforms PicoCTF and OverTheWir… ▽ More

    Submitted 2 December, 2024; originally announced December 2024.

    Comments: 16 pages, 9 figures

    MSC Class: 68M25 ACM Class: I.2.1; K.6.5

  3. arXiv:2410.18677   

    cs.CV cs.LG eess.IV

    Enhancing pretraining efficiency for medical image segmentation via transferability metrics

    Authors: Gábor Hidy, Bence Bakos, András Lukács

    Abstract: In medical image segmentation tasks, the scarcity of labeled training data poses a significant challenge when training deep neural networks. When using U-Net-style architectures, it is common practice to address this problem by pretraining the encoder part on a large general-purpose dataset like ImageNet. However, these methods are resource-intensive and do not guarantee improved performance on th… ▽ More

    Submitted 6 June, 2025; v1 submitted 24 October, 2024; originally announced October 2024.

    Comments: An error was discovered in the aggregation process of our results, particularly affecting the experiments involving the advanced pretraining method. This impacts the main conclusions of the paper, and we are therefore withdrawing the submission

    ACM Class: I.4.6

  4. arXiv:2410.03776  [pdf, other

    cs.LG stat.ML

    Parameter Estimation of Long Memory Stochastic Processes with Deep Neural Networks

    Authors: Bálint Csanády, Lóránt Nagy, Dániel Boros, Iván Ivkovic, Dávid Kovács, Dalma Tóth-Lakits, László Márkus, András Lukács

    Abstract: We present a purely deep neural network-based approach for estimating long memory parameters of time series models that incorporate the phenomenon of long-range dependence. Parameters, such as the Hurst exponent, are critical in characterizing the long-range dependence, roughness, and self-similarity of stochastic processes. The accurate and fast estimation of these parameters holds significant im… ▽ More

    Submitted 2 October, 2024; originally announced October 2024.

    Comments: 14 pages, 16 figures, https://github.com/aielte-research/LMSParEst

    MSC Class: 68T07; 62M45; 60G22 ACM Class: I.2.m; G.3

  5. arXiv:2403.15938  [pdf, other

    cs.CL cs.AI cs.LG

    LlamBERT: Large-scale low-cost data annotation in NLP

    Authors: Bálint Csanády, Lajos Muzsai, Péter Vedres, Zoltán Nádasdy, András Lukács

    Abstract: Large Language Models (LLMs), such as GPT-4 and Llama 2, show remarkable proficiency in a wide range of natural language processing (NLP) tasks. Despite their effectiveness, the high costs associated with their use pose a challenge. We present LlamBERT, a hybrid approach that leverages LLMs to annotate a small subset of large, unlabeled databases and uses the results for fine-tuning transformer en… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

    Comments: 11 pages, 1 figure

    ACM Class: I.2.7; F.1.1

  6. arXiv:2401.01789  [pdf, other

    stat.ML cs.AI cs.LG

    Deep learning the Hurst parameter of linear fractional processes and assessing its reliability

    Authors: Dániel Boros, Bálint Csanády, Iván Ivkovic, Lóránt Nagy, András Lukács, László Márkus

    Abstract: This research explores the reliability of deep learning, specifically Long Short-Term Memory (LSTM) networks, for estimating the Hurst parameter in fractional stochastic processes. The study focuses on three types of processes: fractional Brownian motion (fBm), fractional Ornstein-Uhlenbeck (fOU) process, and linear fractional stable motions (lfsm). The work involves a fast generation of extensive… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

    MSC Class: 68T07

  7. arXiv:2201.06757  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Dilated Convolutional Neural Networks for Lightweight Diacritics Restoration

    Authors: Bálint Csanády, András Lukács

    Abstract: Diacritics restoration has become a ubiquitous task in the Latin-alphabet-based English-dominated Internet language environment. In this paper, we describe a small footprint 1D dilated convolution-based approach which operates on a character-level. We find that solutions based on 1D dilated convolutional neural networks are competitive alternatives to models based on recursive neural networks or l… ▽ More

    Submitted 18 January, 2022; originally announced January 2022.

    Comments: 7 pages, 2 figures

  8. arXiv:2003.10304  [pdf, other

    eess.IV cs.CV cs.LG

    Attention U-Net Based Adversarial Architectures for Chest X-ray Lung Segmentation

    Authors: Gusztáv Gaál, Balázs Maga, András Lukács

    Abstract: Chest X-ray is the most common test among medical imaging modalities. It is applied for detection and differentiation of, among others, lung cancer, tuberculosis, and pneumonia, the last with importance due to the COVID-19 disease. Integrating computer-aided detection methods into the radiologist diagnostic pipeline, greatly reduces the doctors' workload, increasing reliability and quantitative an… ▽ More

    Submitted 23 March, 2020; originally announced March 2020.

    Comments: 7 pages, 4 figures