Skip to main content

Showing 1–15 of 15 results for author: Nikolenko, S I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2412.13146  [pdf, ps, other

    cs.CL

    Syntactic Transfer to Kyrgyz Using the Treebank Translation Method

    Authors: Anton Alekseev, Alina Tillabaeva, Gulnara Dzh. Kabaeva, Sergey I. Nikolenko

    Abstract: The Kyrgyz language, as a low-resource language, requires significant effort to create high-quality syntactic corpora. This study proposes an approach to simplify the development process of a syntactic corpus for Kyrgyz. We present a tool for transferring syntactic annotations from Turkish to Kyrgyz based on a treebank translation method. The effectiveness of the proposed tool was evaluated using… ▽ More

    Submitted 17 December, 2024; originally announced December 2024.

    Comments: To be published in the Journal of Math. Sciences. Zapiski version (in Russian): http://www.pdmi.ras.ru/znsl/2024/v540/abs252.html

  2. arXiv:2308.15952  [pdf, ps, other

    cs.CL

    Benchmarking Multilabel Topic Classification in the Kyrgyz Language

    Authors: Anton Alekseev, Sergey I. Nikolenko, Gulnara Kabaeva

    Abstract: Kyrgyz is a very underrepresented language in terms of modern natural language processing resources. In this work, we present a new public benchmark for topic classification in Kyrgyz, introducing a dataset based on collected and annotated data from the news site 24.KG and presenting several baseline models for news classification in the multilabel setting. We train and evaluate both classical sta… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

    Comments: Accepted to AIST 2023

  3. RecVAE: a New Variational Autoencoder for Top-N Recommendations with Implicit Feedback

    Authors: Ilya Shenbin, Anton Alekseev, Elena Tutubalina, Valentin Malykh, Sergey I. Nikolenko

    Abstract: Recent research has shown the advantages of using autoencoders based on deep neural networks for collaborative filtering. In particular, the recently proposed Mult-VAE model, which used the multinomial likelihood variational autoencoders, has shown excellent results for top-N recommendations. In this work, we propose the Recommender VAE (RecVAE) model that originates from our research on regulariz… ▽ More

    Submitted 23 December, 2019; originally announced December 2019.

    Comments: In The Thirteenth ACM International Conference on Web Search and Data Mining (WSDM '20), February 3-7, 2020, Houston, TX, USA. ACM, New York, NY, USA, 9 pages

  4. arXiv:1909.11512  [pdf, other

    cs.LG cs.CR cs.CV

    Synthetic Data for Deep Learning

    Authors: Sergey I. Nikolenko

    Abstract: Synthetic data is an increasingly popular tool for training deep learning models, especially in computer vision but also in other areas. In this work, we attempt to provide a comprehensive survey of the various directions in the development and application of synthetic data. First, we discuss synthetic datasets for basic computer vision problems, both low-level (e.g., optical flow estimation) and… ▽ More

    Submitted 25 September, 2019; originally announced September 2019.

    Comments: 156 pages, 24 figures, 719 references

  5. arXiv:1907.04399  [pdf, other

    cs.NI cs.DS

    New Competitiveness Bounds for the Shared Memory Switch

    Authors: Ivan Bochkov, Alex Davydow, Nikita Gaevoy, Sergey I. Nikolenko

    Abstract: We consider one of the simplest and best known buffer management architectures: the shared memory switch with multiple output queues and uniform packets. It was one of the first models studied by competitive analysis, with the Longest Queue Drop (LQD) buffer management policy shown to be at least $\sqrt{2}$- and at most $2$-competitive; a general lower bound of $4/3$ has been proven for all determ… ▽ More

    Submitted 9 July, 2019; originally announced July 2019.

    Comments: 23 pages, 8 figures

    MSC Class: 68W27

  6. arXiv:1901.07829  [pdf, other

    cs.CL cs.AI

    AspeRa: Aspect-based Rating Prediction Model

    Authors: Sergey I. Nikolenko, Elena Tutubalina, Valentin Malykh, Ilya Shenbin, Anton Alekseev

    Abstract: We propose a novel end-to-end Aspect-based Rating Prediction model (AspeRa) that estimates user rating based on review texts for the items and at the same time discovers coherent aspects of reviews that can be used to explain predictions or profile users. The AspeRa model uses max-margin losses for joint item and user embedding learning and a dual-headed architecture; it significantly outperforms… ▽ More

    Submitted 23 January, 2019; originally announced January 2019.

    Comments: accepted to ECIR 2019

  7. arXiv:1901.06345  [pdf, ps, other

    cs.CV

    Adapting Convolutional Neural Networks for Geographical Domain Shift

    Authors: Pavel Ostyakov, Sergey I. Nikolenko

    Abstract: We present the winning solution for the Inclusive Images Competition organized as part of the Conference on Neural Information Processing Systems (NeurIPS 2018) Competition Track. The competition was organized to study ways to cope with domain shift in image processing, specifically geographical shift: the training and two test sets in the competition had different geographical distributions. Our… ▽ More

    Submitted 18 January, 2019; originally announced January 2019.

  8. arXiv:1811.11067  [pdf, other

    cs.LG cs.AI stat.ML

    Learning State Representations in Complex Systems with Multimodal Data

    Authors: Pavel Solovev, Vladimir Aliev, Pavel Ostyakov, Gleb Sterkin, Elizaveta Logacheva, Stepan Troeshestov, Roman Suvorov, Anton Mashikhin, Oleg Khomenko, Sergey I. Nikolenko

    Abstract: Representation learning becomes especially important for complex systems with multimodal data sources such as cameras or sensors. Recent advances in reinforcement learning and optimal control make it possible to design control algorithms on these latent representations, but the field still lacks a large-scale standard dataset for unified comparison. In this work, we present a large-scale dataset a… ▽ More

    Submitted 15 January, 2019; v1 submitted 27 November, 2018; originally announced November 2018.

    Comments: Fixed references

  9. arXiv:1811.07630  [pdf, other

    cs.CV cs.LG cs.NE

    SEIGAN: Towards Compositional Image Generation by Simultaneously Learning to Segment, Enhance, and Inpaint

    Authors: Pavel Ostyakov, Roman Suvorov, Elizaveta Logacheva, Oleg Khomenko, Sergey I. Nikolenko

    Abstract: We present a novel approach to image manipulation and understanding by simultaneously learning to segment object masks, paste objects to another background image, and remove them from original images. For this purpose, we develop a novel generative model for compositional image generation, SEIGAN (Segment-Enhance-Inpaint Generative Adversarial Network), which learns these three operations together… ▽ More

    Submitted 15 January, 2019; v1 submitted 19 November, 2018; originally announced November 2018.

  10. arXiv:1809.04403  [pdf, other

    cs.CV cs.LG

    Label Denoising with Large Ensembles of Heterogeneous Neural Networks

    Authors: Pavel Ostyakov, Elizaveta Logacheva, Roman Suvorov, Vladimir Aliev, Gleb Sterkin, Oleg Khomenko, Sergey I. Nikolenko

    Abstract: Despite recent advances in computer vision based on various convolutional architectures, video understanding remains an important challenge. In this work, we present and discuss a top solution for the large-scale video classification (labeling) problem introduced as a Kaggle competition based on the YouTube-8M dataset. We show and compare different approaches to preprocessing, data augmentation, m… ▽ More

    Submitted 15 January, 2019; v1 submitted 12 September, 2018; originally announced September 2018.

  11. arXiv:1211.2756  [pdf, other

    q-bio.QM cs.CE cs.DS q-bio.GN

    BayesHammer: Bayesian clustering for error correction in single-cell sequencing

    Authors: Sergey I. Nikolenko, Anton I. Korobeynikov, Max A. Alekseyev

    Abstract: Error correction of sequenced reads remains a difficult task, especially in single-cell sequencing projects with extremely non-uniform coverage. While existing error correction tools designed for standard (multi-cell) sequencing data usually come up short in single-cell sequencing projects, algorithms actually used for single-cell error correction have been so far very simplistic. We introduce s… ▽ More

    Submitted 12 November, 2012; originally announced November 2012.

    Journal ref: BMC Genomics 14(Suppl 1) (2013), pp. S7

  12. arXiv:1204.5443  [pdf, other

    cs.NI

    FIFO Queueing Policies for Packets with Heterogeneous Processing

    Authors: Kirill Kogan, Alejandro López-Ortiz, Sergey I. Nikolenko, Alexander V. Sirotkin, Denis Tugaryov

    Abstract: We consider the problem of managing a bounded size First-In-First-Out (FIFO) queue buffer, where each incoming unit-sized packet requires several rounds of processing before it can be transmitted out. Our objective is to maximize the total number of successfully transmitted packets. We consider both push-out (when the policy is permitted to drop already admitted packets) and non-push-out cases. In… ▽ More

    Submitted 24 April, 2012; originally announced April 2012.

    Comments: 15 pages

  13. arXiv:1202.5755  [pdf, other

    cs.NI cs.PF

    Balancing Work and Size with Bounded Buffers

    Authors: Kirill Kogan, Alejandro Lopez-Ortiz, Sergey I. Nikolenko, Gabriel Scalosub, Michael Segal

    Abstract: We consider the fundamental problem of managing a bounded size queue buffer where traffic consists of packets of varying size, where each packet requires several rounds of processing before it can be transmitted from the queue buffer. The goal in such an environment is to maximize the overall size of packets that are successfully transmitted. This model is motivated by the ever-growing ubiquity of… ▽ More

    Submitted 5 September, 2013; v1 submitted 26 February, 2012; originally announced February 2012.

    Comments: 22 pages, 7 figures

  14. arXiv:0802.2863  [pdf, ps, other

    cs.CC cs.CR

    New Combinatorial Complete One-Way Functions

    Authors: Arist Kojevnikov, Sergey I. Nikolenko

    Abstract: In 2003, Leonid A. Levin presented the idea of a combinatorial complete one-way function and a sketch of the proof that Tiling represents such a function. In this paper, we present two new one-way functions based on semi-Thue string rewriting systems and a version of the Post Correspondence Problem and prove their completeness. Besides, we present an alternative proof of Levin's result. We also… ▽ More

    Submitted 20 February, 2008; originally announced February 2008.

    Journal ref: Dans Proceedings of the 25th Annual Symposium on the Theoretical Aspects of Computer Science - STACS 2008, Bordeaux : France (2008)

  15. arXiv:cs/0301012  [pdf, ps, other

    cs.CC

    Hard satisfiable formulas for DPLL-type algorithms

    Authors: Sergey I. Nikolenko

    Abstract: We address lower bounds on the time complexity of algorithms solving the propositional satisfiability problem. Namely, we consider two DPLL-type algorithms, enhanced with the unit clause and pure literal heuristics. Exponential lower bounds for solving satisfiability on provably satisfiable formulas are proven.

    Submitted 15 January, 2003; originally announced January 2003.

    Comments: 9 pages

    ACM Class: F.2.2