Skip to main content

Showing 1–4 of 4 results for author: Shapkin, A

.
  1. arXiv:2406.11612  [pdf, ps, other

    cs.LG cs.AI cs.IR cs.SE

    Long Code Arena: a Set of Benchmarks for Long-Context Code Models

    Authors: Egor Bogomolov, Aleksandra Eliseeva, Timur Galimzyanov, Evgeniy Glukhov, Anton Shapkin, Maria Tigina, Yaroslav Golubev, Alexander Kovrigin, Arie van Deursen, Maliheh Izadi, Timofey Bryksin

    Abstract: Nowadays, the fields of code and natural language processing are evolving rapidly. In particular, models become better at processing long context windows - supported context sizes have increased by orders of magnitude over the last few years. However, there is a shortage of benchmarks for code processing that go beyond a single file of context, while the most popular ones are limited to a single m… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 54 pages, 4 figures, 22 tables

  2. arXiv:2405.19250  [pdf, ps, other

    cs.SE cs.AI cs.PL

    Kotlin ML Pack: Technical Report

    Authors: Sergey Titov, Mikhail Evtikhiev, Anton Shapkin, Oleg Smirnov, Sergei Boytsov, Sergei Boytsov, Dariia Karaeva, Maksim Sheptyakov, Mikhail Arkhipov, Timofey Bryksin, Egor Bogomolov

    Abstract: In this technical report, we present three novel datasets of Kotlin code: KStack, KStack-clean, and KExercises. We also describe the results of fine-tuning CodeLlama and DeepSeek models on this data. Additionally, we present a version of the HumanEval benchmark rewritten by human experts into Kotlin - both the solutions and the tests. Our results demonstrate that small, high-quality datasets (KSta… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  3. arXiv:2312.08976  [pdf, other

    cs.SE cs.LG

    Dynamic Retrieval-Augmented Generation

    Authors: Anton Shapkin, Denis Litvinov, Yaroslav Zharov, Egor Bogomolov, Timur Galimzyanov, Timofey Bryksin

    Abstract: Current state-of-the-art large language models are effective in generating high-quality text and encapsulating a broad spectrum of world knowledge. These models, however, often hallucinate and lack locally relevant factual data. Retrieval-augmented approaches were introduced to overcome these problems and provide more accurate responses. Typically, the retrieved information is simply appended to t… ▽ More

    Submitted 20 February, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: 10 pages

  4. arXiv:1011.4902  [pdf

    q-bio.NC

    Recording and Reproduction of Pattern Memory Trace in EEG by Direct Electrical Stimulation of Brain Cortex

    Authors: A. G. Shapkin, M. V. Taborov, Yu. G. Shapkin

    Abstract: This study demonstrates the capability of external signal recording into memory and the reproduction of memory trace of this pattern in EEG by direct AC electrical stimulation of rat cerebral cortex. Additionally, we examine shifts of the DC potential level related to these phenomena. We show that in the course of memory trace reproduction, consecutive phases of engram activation and relaxation ar… ▽ More

    Submitted 22 November, 2011; v1 submitted 22 November, 2010; originally announced November 2010.

    Comments: Article: 9 pages, 3 figures

    Journal ref: Bulletin of ESCC SB RAMS, 2011, No.4(80), part 1, p. 289-294 (in Russian)