Skip to main content

Showing 1–3 of 3 results for author: Lomshakov, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.04406  [pdf, ps, other

    cs.CL cs.AI cs.SE

    YABLoCo: Yet Another Benchmark for Long Context Code Generation

    Authors: Aidar Valeev, Roman Garaev, Vadim Lomshakov, Irina Piontkovskaya, Vladimir Ivanov, Israel Adewuyi

    Abstract: Large Language Models demonstrate the ability to solve various programming tasks, including code generation. Typically, the performance of LLMs is measured on benchmarks with small or medium-sized context windows of thousands of lines of code. At the same time, in real-world software projects, repositories can span up to millions of LoC. This paper closes this gap by contributing to the long conte… ▽ More

    Submitted 7 May, 2025; originally announced May 2025.

    Comments: Presented at LLM4Code 2025 Workshop co-located wtih ICSE 2025

  2. arXiv:2505.02376  [pdf, other

    cs.SE

    LAMeD: LLM-generated Annotations for Memory Leak Detection

    Authors: Ekaterina Shemetova, Ilya Shenbin, Ivan Smirnov, Anton Alekseev, Alexey Rukhovich, Sergey Nikolenko, Vadim Lomshakov, Irina Piontkovskaya

    Abstract: Static analysis tools are widely used to detect software bugs and vulnerabilities but often struggle with scalability and efficiency in complex codebases. Traditional approaches rely on manually crafted annotations -- labeling functions as sources or sinks -- to track data flows, e.g., ensuring that allocated memory is eventually freed, and code analysis tools such as CodeQL, Infer, or Cooddy can… ▽ More

    Submitted 5 May, 2025; originally announced May 2025.

  3. arXiv:2205.10692  [pdf, other

    cs.SE cs.LG

    All You Need Is Logs: Improving Code Completion by Learning from Anonymous IDE Usage Logs

    Authors: Vitaliy Bibaev, Alexey Kalina, Vadim Lomshakov, Yaroslav Golubev, Alexander Bezzubov, Nikita Povarov, Timofey Bryksin

    Abstract: In this work, we propose an approach for collecting completion usage logs from the users in an IDE and using them to train a machine learning based model for ranking completion candidates. We developed a set of features that describe completion candidates and their context, and deployed their anonymized collection in the Early Access Program of IntelliJ-based IDEs. We used the logs to collect a da… ▽ More

    Submitted 3 September, 2022; v1 submitted 21 May, 2022; originally announced May 2022.

    Comments: 11 pages, 4 figures