Skip to main content

Showing 1–9 of 9 results for author: Konovalov, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.21115  [pdf, other

    cs.CL

    Will It Still Be True Tomorrow? Multilingual Evergreen Question Classification to Improve Trustworthy QA

    Authors: Sergey Pletenev, Maria Marina, Nikolay Ivanov, Daria Galimzianova, Nikita Krayko, Mikhail Salnikov, Vasily Konovalov, Alexander Panchenko, Viktor Moskvoretskii

    Abstract: Large Language Models (LLMs) often hallucinate in question answering (QA) tasks. A key yet underexplored factor contributing to this is the temporality of questions -- whether they are evergreen (answers remain stable over time) or mutable (answers change). In this work, we introduce EverGreenQA, the first multilingual QA dataset with evergreen labels, supporting both evaluation and training. Usin… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

  2. arXiv:2505.07704  [pdf, ps, other

    cs.CV cs.CL

    Through the Looking Glass: Common Sense Consistency Evaluation of Weird Images

    Authors: Elisei Rykov, Kseniia Petrushina, Kseniia Titova, Anton Razzhigaev, Alexander Panchenko, Vasily Konovalov

    Abstract: Measuring how real images look is a complex task in artificial intelligence research. For example, an image of a boy with a vacuum cleaner in a desert violates common sense. We introduce a novel method, which we call Through the Looking Glass (TLG), to assess image common sense consistency using Large Vision-Language Models (LVLMs) and Transformer-based encoder. By leveraging LVLMs to extract atom… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

    Journal ref: Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 4: Student Research Workshop)

  3. arXiv:2505.04253  [pdf, other

    cs.CL cs.LG

    LLM-Independent Adaptive RAG: Let the Question Speak for Itself

    Authors: Maria Marina, Nikolay Ivanov, Sergey Pletenev, Mikhail Salnikov, Daria Galimzianova, Nikita Krayko, Vasily Konovalov, Alexander Panchenko, Viktor Moskvoretskii

    Abstract: Large Language Models~(LLMs) are prone to hallucinations, and Retrieval-Augmented Generation (RAG) helps mitigate this, but at a high computational cost while risking misinformation. Adaptive retrieval aims to retrieve only when necessary, but existing approaches rely on LLM-based uncertainty estimation, which remain inefficient and impractical. In this study, we introduce lightweight LLM-independ… ▽ More

    Submitted 7 May, 2025; originally announced May 2025.

    Comments: 11 pages, 5 figures, 2 tables

  4. arXiv:2503.15948  [pdf, other

    cs.CV cs.AI cs.CL

    Don't Fight Hallucinations, Use Them: Estimating Image Realism using NLI over Atomic Facts

    Authors: Elisei Rykov, Kseniia Petrushina, Kseniia Titova, Alexander Panchenko, Vasily Konovalov

    Abstract: Quantifying the realism of images remains a challenging problem in the field of artificial intelligence. For example, an image of Albert Einstein holding a smartphone violates common-sense because modern smartphone were invented after Einstein's death. We introduce a novel method for assessing image realism using Large Vision-Language Models (LVLMs) and Natural Language Inference (NLI). Our approa… ▽ More

    Submitted 20 March, 2025; originally announced March 2025.

    Comments: Proceedings of De-Factify 4: 4nd Workshop on Multimodal Fact Checking and Hate Speech Detection, co-located with AAAI-2025

  5. arXiv:2502.14502  [pdf, other

    cs.CL

    How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?

    Authors: Sergey Pletenev, Maria Marina, Daniil Moskovskiy, Vasily Konovalov, Pavel Braslavski, Alexander Panchenko, Mikhail Salnikov

    Abstract: The performance of Large Language Models (LLMs) on many tasks is greatly limited by the knowledge learned during pre-training and stored in the model's parameters. Low-rank adaptation (LoRA) is a popular and efficient training technique for updating or domain-specific adaptation of LLMs. In this study, we investigate how new facts can be incorporated into the LLM using LoRA without compromising th… ▽ More

    Submitted 24 March, 2025; v1 submitted 20 February, 2025; originally announced February 2025.

  6. arXiv:2501.12835  [pdf, other

    cs.CL cs.LG

    Adaptive Retrieval Without Self-Knowledge? Bringing Uncertainty Back Home

    Authors: Viktor Moskvoretskii, Maria Lysyuk, Mikhail Salnikov, Nikolay Ivanov, Sergey Pletenev, Daria Galimzianova, Nikita Krayko, Vasily Konovalov, Irina Nikishina, Alexander Panchenko

    Abstract: Retrieval Augmented Generation (RAG) improves correctness of Question Answering (QA) and addresses hallucinations in Large Language Models (LLMs), yet greatly increase computational costs. Besides, RAG is not always needed as may introduce irrelevant information. Recent adaptive retrieval methods integrate LLMs' intrinsic knowledge with external information appealing to LLM self-knowledge, but the… ▽ More

    Submitted 21 February, 2025; v1 submitted 22 January, 2025; originally announced January 2025.

    Comments: The code and data are at https://github.com/s-nlp/AdaRAGUE

  7. arXiv:2405.10629  [pdf, other

    cs.CL cs.AI

    DeepPavlov at SemEval-2024 Task 8: Leveraging Transfer Learning for Detecting Boundaries of Machine-Generated Texts

    Authors: Anastasia Voznyuk, Vasily Konovalov

    Abstract: The Multigenerator, Multidomain, and Multilingual Black-Box Machine-Generated Text Detection shared task in the SemEval-2024 competition aims to tackle the problem of misusing collaborative human-AI writing. Although there are a lot of existing detectors of AI content, they are often designed to give a binary answer and thus may not be suitable for more nuanced problem of finding the boundaries be… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: New best score from the leaderboard, to appear in SemEval-2024 Workshop proceedings

  8. arXiv:2205.02340  [pdf, other

    cs.CL cs.LG

    Knowledge Distillation of Russian Language Models with Reduction of Vocabulary

    Authors: Alina Kolesnikova, Yuri Kuratov, Vasily Konovalov, Mikhail Burtsev

    Abstract: Today, transformer language models serve as a core component for majority of natural language processing tasks. Industrial application of such models requires minimization of computation time and memory footprint. Knowledge distillation is one of approaches to address this goal. Existing methods in this field are mainly focused on reducing the number of layers or dimension of embeddings/hidden rep… ▽ More

    Submitted 4 May, 2022; originally announced May 2022.

  9. arXiv:2002.02450  [pdf, other

    cs.CL cs.LG stat.ML

    Goal-Oriented Multi-Task BERT-Based Dialogue State Tracker

    Authors: Pavel Gulyaev, Eugenia Elistratova, Vasily Konovalov, Yuri Kuratov, Leonid Pugachev, Mikhail Burtsev

    Abstract: Dialogue State Tracking (DST) is a core component of virtual assistants such as Alexa or Siri. To accomplish various tasks, these assistants need to support an increasing number of services and APIs. The Schema-Guided State Tracking track of the 8th Dialogue System Technology Challenge highlighted the DST problem for unseen services. The organizers introduced the Schema-Guided Dialogue (SGD) datas… ▽ More

    Submitted 5 February, 2020; originally announced February 2020.