Skip to main content

Showing 1–7 of 7 results for author: Chertok, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2309.15552  [pdf, other

    cs.LG cs.CE q-fin.CP

    Startup success prediction and VC portfolio simulation using CrunchBase data

    Authors: Mark Potanin, Andrey Chertok, Konstantin Zorin, Cyril Shtabtsovsky

    Abstract: Predicting startup success presents a formidable challenge due to the inherently volatile landscape of the entrepreneurial ecosystem. The advent of extensive databases like Crunchbase jointly with available open data enables the application of machine learning and artificial intelligence for more accurate predictive analytics. This paper focuses on startups at their Series B and Series C investmen… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: 13 pages, preprint

    ACM Class: I.2.1; J.4

  2. arXiv:2206.12514  [pdf, other

    cs.CL

    DetIE: Multilingual Open Information Extraction Inspired by Object Detection

    Authors: Michael Vasilkovsky, Anton Alekseev, Valentin Malykh, Ilya Shenbin, Elena Tutubalina, Dmitriy Salikhov, Mikhail Stepnov, Andrey Chertok, Sergey Nikolenko

    Abstract: State of the art neural methods for open information extraction (OpenIE) usually extract triplets (or tuples) iteratively in an autoregressive or predicate-based manner in order not to produce duplicates. In this work, we propose a different approach to the problem that can be equally or more successful. Namely, we present a novel single-pass method for OpenIE inspired by object detection algorith… ▽ More

    Submitted 24 June, 2022; originally announced June 2022.

    Comments: Accepted to the Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI-22)

  3. arXiv:2202.10784  [pdf, other

    cs.CV cs.AI

    RuCLIP -- new models and experiments: a technical report

    Authors: Alex Shonenkov, Andrey Kuznetsov, Denis Dimitrov, Tatyana Shavrina, Daniil Chesakov, Anastasia Maltseva, Alena Fenogenova, Igor Pavlov, Anton Emelyanov, Sergey Markov, Daria Bakshandaeva, Vera Shybaeva, Andrey Chertok

    Abstract: In the report we propose six new implementations of ruCLIP model trained on our 240M pairs. The accuracy results are compared with original CLIP model with Ru-En translation (OPUS-MT) on 16 datasets from different domains. Our best implementations outperform CLIP + OPUS-MT solution on most of the datasets in few-show and zero-shot tasks. In the report we briefly describe the implementations and co… ▽ More

    Submitted 22 February, 2022; originally announced February 2022.

  4. arXiv:2112.07395  [pdf, other

    cs.CV

    Handwritten text generation and strikethrough characters augmentation

    Authors: Alex Shonenkov, Denis Karachev, Max Novopoltsev, Mark Potanin, Denis Dimitrov, Andrey Chertok

    Abstract: We introduce two data augmentation techniques, which, used with a Resnet-BiLSTM-CTC network, significantly reduce Word Error Rate (WER) and Character Error Rate (CER) beyond best-reported results on handwriting text recognition (HTR) tasks. We apply a novel augmentation that simulates strikethrough text (HandWritten Blots) and a handwritten text generation method based on printed text (StackMix),… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.

    Comments: 16 pages, 15 figures. arXiv admin note: substantial text overlap with arXiv:2108.11667

    MSC Class: 68-04 ACM Class: I.7.5; I.4.6

  5. arXiv:2110.04228  [pdf, ps, other

    cs.LG

    Hybrid Graph Embedding Techniques in Estimated Time of Arrival Task

    Authors: Vadim Porvatov, Natalia Semenova, Andrey Chertok

    Abstract: Recently, deep learning has achieved promising results in the calculation of Estimated Time of Arrival (ETA), which is considered as predicting the travel time from the start point to a certain place along a given path. ETA plays an essential role in intelligent taxi services or automotive navigation systems. A common practice is to use embedding vectors to represent the elements of a road network… ▽ More

    Submitted 8 October, 2021; originally announced October 2021.

    Comments: Accepted in ICCNA 2021

  6. RussianSuperGLUE: A Russian Language Understanding Evaluation Benchmark

    Authors: Tatiana Shavrina, Alena Fenogenova, Anton Emelyanov, Denis Shevelev, Ekaterina Artemova, Valentin Malykh, Vladislav Mikhailov, Maria Tikhonova, Andrey Chertok, Andrey Evlampiev

    Abstract: In this paper, we introduce an advanced Russian general language understanding evaluation benchmark -- RussianGLUE. Recent advances in the field of universal language models and transformers require the development of a methodology for their broad diagnostics and testing for general intellectual skills - detection of natural language inference, commonsense reasoning, ability to perform simple logi… ▽ More

    Submitted 2 November, 2020; v1 submitted 29 October, 2020; originally announced October 2020.

    Comments: to appear in EMNLP 2020

  7. SberQuAD -- Russian Reading Comprehension Dataset: Description and Analysis

    Authors: Pavel Efimov, Andrey Chertok, Leonid Boytsov, Pavel Braslavski

    Abstract: SberQuAD -- a large scale analog of Stanford SQuAD in the Russian language - is a valuable resource that has not been properly presented to the scientific community. We fill this gap by providing a description, a thorough analysis, and baseline experimental results.

    Submitted 2 May, 2020; v1 submitted 20 December, 2019; originally announced December 2019.