Skip to main content

Showing 1–6 of 6 results for author: Rykov, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.07704  [pdf, ps, other

    cs.CV cs.CL

    Through the Looking Glass: Common Sense Consistency Evaluation of Weird Images

    Authors: Elisei Rykov, Kseniia Petrushina, Kseniia Titova, Anton Razzhigaev, Alexander Panchenko, Vasily Konovalov

    Abstract: Measuring how real images look is a complex task in artificial intelligence research. For example, an image of a boy with a vacuum cleaner in a desert violates common sense. We introduce a novel method, which we call Through the Looking Glass (TLG), to assess image common sense consistency using Large Vision-Language Models (LVLMs) and Transformer-based encoder. By leveraging LVLMs to extract atom… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

    Journal ref: Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 4: Student Research Workshop)

  2. arXiv:2503.15948  [pdf, other

    cs.CV cs.AI cs.CL

    Don't Fight Hallucinations, Use Them: Estimating Image Realism using NLI over Atomic Facts

    Authors: Elisei Rykov, Kseniia Petrushina, Kseniia Titova, Alexander Panchenko, Vasily Konovalov

    Abstract: Quantifying the realism of images remains a challenging problem in the field of artificial intelligence. For example, an image of Albert Einstein holding a smartphone violates common-sense because modern smartphone were invented after Einstein's death. We introduce a novel method for assessing image realism using Large Vision-Language Models (LVLMs) and Natural Language Inference (NLI). Our approa… ▽ More

    Submitted 20 March, 2025; originally announced March 2025.

    Comments: Proceedings of De-Factify 4: 4nd Workshop on Multimodal Fact Checking and Hate Speech Detection, co-located with AAAI-2025

  3. arXiv:2407.05449  [pdf, other

    cs.CL cs.AI

    SmurfCat at PAN 2024 TextDetox: Alignment of Multilingual Transformers for Text Detoxification

    Authors: Elisei Rykov, Konstantin Zaytsev, Ivan Anisimov, Alexandr Voronin

    Abstract: This paper presents a solution for the Multilingual Text Detoxification task in the PAN-2024 competition of the SmurfCat team. Using data augmentation through machine translation and a special filtering procedure, we collected an additional multilingual parallel dataset for text detoxification. Using the obtained data, we fine-tuned several multilingual sequence-to-sequence models, such as mT0 and… ▽ More

    Submitted 10 July, 2024; v1 submitted 7 July, 2024; originally announced July 2024.

  4. arXiv:2406.18305  [pdf, other

    cs.CL cs.AI

    S3: A Simple Strong Sample-effective Multimodal Dialog System

    Authors: Elisei Rykov, Egor Malkershin, Alexander Panchenko

    Abstract: In this work, we present a conceptually simple yet powerful baseline for the multimodal dialog task, an S3 model, that achieves near state-of-the-art results on two compelling leaderboards: MMMU and AI Journey Contest 2023. The system is based on a pre-trained large language model, pre-trained modality encoders for image and audio, and a trainable modality projector. The proposed effective data mi… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  5. arXiv:2404.06137  [pdf, other

    cs.CL cs.AI

    SmurfCat at SemEval-2024 Task 6: Leveraging Synthetic Data for Hallucination Detection

    Authors: Elisei Rykov, Yana Shishkina, Kseniia Petrushina, Kseniia Titova, Sergey Petrakov, Alexander Panchenko

    Abstract: In this paper, we present our novel systems developed for the SemEval-2024 hallucination detection task. Our investigation spans a range of strategies to compare model predictions with reference standards, encompassing diverse baselines, the refinement of pre-trained encoders through supervised learning, and an ensemble approaches utilizing several high-performing models. Through these exploration… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 12 pages, 10 tables, 3 figures

  6. arXiv:2209.13750  [pdf, other

    cs.CL

    RuDSI: graph-based word sense induction dataset for Russian

    Authors: Anna Aksenova, Ekaterina Gavrishina, Elisey Rykov, Andrey Kutuzov

    Abstract: We present RuDSI, a new benchmark for word sense induction (WSI) in Russian. The dataset was created using manual annotation and semi-automatic clustering of Word Usage Graphs (WUGs). Unlike prior WSI datasets for Russian, RuDSI is completely data-driven (based on texts from Russian National Corpus), with no external word senses imposed on annotators. Depending on the parameters of graph clusterin… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

    Comments: TextGraphs-16 workshop at the CoLING-2022 conference