Skip to main content

Showing 1–39 of 39 results for author: Nikolaev, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.07179  [pdf, other

    cs.CL

    Strategies for political-statement segmentation and labelling in unstructured text

    Authors: Dmitry Nikolaev, Sean Papay

    Abstract: Analysis of parliamentary speeches and political-party manifestos has become an integral area of computational study of political texts. While speeches have been overwhelmingly analysed using unsupervised methods, a large corpus of manifestos with by-statement political-stance labels has been created by the participants of the MARPOR project. It has been recently shown that these labels can be pre… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

    Comments: Accepted to NLP4DH 2025 @ NAACL 2025

  2. arXiv:2411.07351  [pdf, other

    cs.CV

    Generalization of Brady-Yong Algorithm for Fast Hough Transform to Arbitrary Image Size

    Authors: Danil Kazimirov, Dmitry Nikolaev, Ekaterina Rybakova, Arseniy Terekhin

    Abstract: Nowadays, the Hough (discrete Radon) transform (HT/DRT) has proved to be an extremely powerful and widespread tool harnessed in a number of application areas, ranging from general image processing to X-ray computed tomography. Efficient utilization of the HT to solve applied problems demands its acceleration and increased accuracy. Along with this, most fast algorithms for computing the HT, especi… ▽ More

    Submitted 11 November, 2024; originally announced November 2024.

    Comments: 6 pages, 2 figures. Accepted to Symposium on Pattern Recognition and Applications 2024 (SPRA 2024)

  3. arXiv:2407.00475  [pdf, other

    cs.CL

    Classifier identification in Ancient Egyptian as a low-resource sequence-labelling task

    Authors: Dmitry Nikolaev, Jorke Grotenhuis, Haleli Harel, Orly Goldwasser

    Abstract: The complex Ancient Egyptian (AE) writing system was characterised by widespread use of graphemic classifiers (determinatives): silent (unpronounced) hieroglyphic signs clarifying the meaning or indicating the pronunciation of the host word. The study of classifiers has intensified in recent years with the launch and quick growth of the iClassifier project, a web-based platform for annotation and… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    Comments: Accepted to ML4AL 2024 (First Machine Learning for Ancient Languages Workshop)

  4. arXiv:2402.17649  [pdf, other

    cs.CL cs.CY

    Beyond prompt brittleness: Evaluating the reliability and consistency of political worldviews in LLMs

    Authors: Tanise Ceron, Neele Falk, Ana Barić, Dmitry Nikolaev, Sebastian Padó

    Abstract: Due to the widespread use of large language models (LLMs), we need to understand whether they embed a specific "worldview" and what these views reflect. Recent studies report that, prompted with political questionnaires, LLMs show left-liberal leanings (Feng et al., 2023; Motoki et al., 2024). However, it is as yet unclear whether these leanings are reliable (robust to prompt variations) and wheth… ▽ More

    Submitted 8 August, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: 12 pages, TACL

  5. arXiv:2402.02883  [pdf, other

    cs.CL cs.LG

    Approximate Attributions for Off-the-Shelf Siamese Transformers

    Authors: Lucas Möller, Dmitry Nikolaev, Sebastian Padó

    Abstract: Siamese encoders such as sentence transformers are among the least understood deep models. Established attribution methods cannot tackle this model class since it compares two inputs rather than processing a single one. To address this gap, we have recently proposed an attribution method specifically for Siamese encoders (Möller et al., 2023). However, it requires models to be adjusted and fine-tu… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: Accepted for EACL 2024, St. Julian's, Malta

  6. Unfolder: Fast localization and image rectification of a document with a crease from folding in half

    Authors: A. M. Ershov, D. V. Tropin, E. E. Limonova, D. P. Nikolaev, V. V. Arlazarov

    Abstract: Presentation of folded documents is not an uncommon case in modern society. Digitizing such documents by capturing them with a smartphone camera can be tricky since a crease can divide the document contents into separate planes. To unfold the document, one could hold the edges potentially obscuring it in a captured image. While there are many geometrical rectification methods, they were usually de… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

    Comments: This is a preprint of the article accepted for publication in the journal "Computer Optics"

  7. arXiv:2310.13583  [pdf, other

    cs.CL cs.LG

    Improving Cross-Lingual Transfer through Subtree-Aware Word Reordering

    Authors: Ofir Arviv, Dmitry Nikolaev, Taelin Karidi, Omri Abend

    Abstract: Despite the impressive growth of the abilities of multilingual language models, such as XLM-R and mT5, it has been shown that they still face difficulties when tackling typologically-distant languages, particularly in the low-resource setting. One obstacle for effective cross-lingual transfer is variability in word-order patterns. It can be potentially mitigated via source- or target-side word reo… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP Findings 2023

  8. arXiv:2310.12575  [pdf, other

    cs.CL

    Multilingual estimation of political-party positioning: From label aggregation to long-input Transformers

    Authors: Dmitry Nikolaev, Tanise Ceron, Sebastian Padó

    Abstract: Scaling analysis is a technique in computational political science that assigns a political actor (e.g. politician or party) a score on a predefined scale based on a (typically long) body of text (e.g. a parliamentary speech or an election manifesto). For example, political scientists have often used the left--right scale to systematically analyse political landscapes of different countries. NLP m… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023

  9. arXiv:2310.11923  [pdf, other

    cs.CL

    Investigating semantic subspaces of Transformer sentence embeddings through linear structural probing

    Authors: Dmitry Nikolaev, Sebastian Padó

    Abstract: The question of what kinds of linguistic information are encoded in different layers of Transformer-based language models is of considerable interest for the NLP community. Existing work, however, has overwhelmingly focused on word-level representations and encoder-only language models with the masked-token training objective. In this paper, we present experiments with semantic structural probing,… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: Accepted to BlackboxNLP 2023

  10. arXiv:2310.05703  [pdf, other

    cs.CL cs.AI cs.LG

    An Attribution Method for Siamese Encoders

    Authors: Lucas Möller, Dmitry Nikolaev, Sebastian Padó

    Abstract: Despite the success of Siamese encoder models such as sentence transformers (ST), little is known about the aspects of inputs they pay attention to. A barrier is that their predictions cannot be attributed to individual features, as they compare two inputs rather than processing a single one. This paper derives a local attribution method for Siamese encoders by generalizing the principle of integr… ▽ More

    Submitted 29 November, 2023; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP'23

  11. arXiv:2305.19650  [pdf, other

    cs.CL

    Adverbs, Surprisingly

    Authors: Dmitry Nikolaev, Collin F. Baker, Miriam R. L. Petruck, Sebastian Padó

    Abstract: This paper begins with the premise that adverbs are neglected in computational linguistics. This view derives from two analyses: a literature review and a novel adverb dataset to probe a state-of-the-art language model, thereby uncovering systematic gaps in accounts for adverb meaning. We suggest that using Frame Semantics for characterizing word meaning, as in FrameNet, provides a promising appro… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

  12. arXiv:2305.10136  [pdf, other

    cs.CL cs.CY

    Additive manifesto decomposition: A policy domain aware method for understanding party positioning

    Authors: Tanise Ceron, Dmitry Nikolaev, Sebastian Padó

    Abstract: Automatic extraction of party (dis)similarities from texts such as party election manifestos or parliamentary speeches plays an increasing role in computational political science. However, existing approaches are fundamentally limited to targeting only global party (dis)-similarity: they condense the relationship between a pair of parties into a single figure, their similarity. In aggregating over… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

    Comments: Findings of ACL 2023

  13. arXiv:2301.13039  [pdf, other

    cs.CL

    Representation biases in sentence transformers

    Authors: Dmitry Nikolaev, Sebastian Padó

    Abstract: Variants of the BERT architecture specialised for producing full-sentence representations often achieve better performance on downstream tasks than sentence embeddings extracted from vanilla BERT. However, there is still little understanding of what properties of inputs determine the properties of such representations. In this study, we construct several sets of sentences with pre-defined lexical… ▽ More

    Submitted 30 January, 2023; originally announced January 2023.

    Comments: Accepted to EACL 2023

  14. arXiv:2205.11987  [pdf, other

    cs.CL

    Word-order typology in Multilingual BERT: A case study in subordinate-clause detection

    Authors: Dmitry Nikolaev, Sebastian Padó

    Abstract: The capabilities and limitations of BERT and similar models are still unclear when it comes to learning syntactic abstractions, in particular across languages. In this paper, we use the task of subordinate-clause detection within and across languages to probe these properties. We show that this task is deceptively simple, with easy gains offset by a long tail of harder cases, and that BERT's zero-… ▽ More

    Submitted 24 May, 2022; originally announced May 2022.

    Comments: Accepted for publication in the proceedings of SIGTYP workshop 2022

  15. arXiv:2205.09120  [pdf, other

    cs.LG

    Fast matrix multiplication for binary and ternary CNNs on ARM CPU

    Authors: Anton Trusov, Elena Limonova, Dmitry Nikolaev, Vladimir V. Arlazarov

    Abstract: Low-bit quantized neural networks are of great interest in practical applications because they significantly reduce the consumption of both memory and computational resources. Binary neural networks are memory and computationally efficient as they require only one bit per weight and activation and can be computed using Boolean logic and bit count operations. QNNs with ternary weights and activatio… ▽ More

    Submitted 18 May, 2022; originally announced May 2022.

    Comments: Accepted to 26th International Conference on Pattern Recognition (ICPR 2022)

  16. On the properties of some low-parameter models for color reproduction in terms of spectrum transformations and coverage of a color triangle

    Authors: Alexey Kroshnin, Viacheslav Vasilev, Egor Ershov, Denis Shepelev, Dmitry Nikolaev, Mikhail Tchobanou

    Abstract: One of the classical approaches to solving color reproduction problems, such as color adaptation or color space transform, is the use of low-parameter spectral models. The strength of this approach is the ability to choose a set of properties that the model should have, be it a large coverage area of a color triangle, an accurate description of the addition or multiplication of spectra, knowing on… ▽ More

    Submitted 21 October, 2021; originally announced October 2021.

    Comments: 23 pages, 2 figures

  17. arXiv:2110.04644  [pdf, other

    cs.CL cs.LG

    On the Relation between Syntactic Divergence and Zero-Shot Performance

    Authors: Ofir Arviv, Dmitry Nikolaev, Taelin Karidi, Omri Abend

    Abstract: We explore the link between the extent to which syntactic relations are preserved in translation and the ease of correctly constructing a parse tree in a zero-shot setting. While previous work suggests such a relation, it tends to focus on the macro level and not on the level of individual edges-a gap we aim to address. As a test case, we take the transfer of Universal Dependencies (UD) parsing fr… ▽ More

    Submitted 9 October, 2021; originally announced October 2021.

    Comments: Accepted to EMNLP 2021

  18. Advanced Hough-based method for on-device document localization

    Authors: D. V. Tropin, A. M. Ershov, D. P. Nikolaev, V. V. Arlazarov

    Abstract: The demand for on-device document recognition systems increases in conjunction with the emergence of more strict privacy and security requirements. In such systems, there is no data transfer from the end device to a third-party information processing servers. The response time is vital to the user experience of on-device document recognition. Combined with the unavailability of discrete GPUs, powe… ▽ More

    Submitted 18 June, 2021; originally announced June 2021.

    Comments: This is a preprint of the article submitted for publication in the journal "Computer Optics"

  19. arXiv:2106.00745  [pdf

    cs.CL cs.AI cs.LG

    Part of Speech and Universal Dependency effects on English Arabic Machine Translation

    Authors: Ofek Rafaeli, Omri Abend, Leshem Choshen, Dmitry Nikolaev

    Abstract: In this research paper, I will elaborate on a method to evaluate machine translation models based on their performance on underlying syntactical phenomena between English and Arabic languages. This method is especially important as such "neural" and "machine learning" are hard to fine-tune and change. Thus, finding a way to evaluate them easily and diversely would greatly help the task of betterin… ▽ More

    Submitted 3 June, 2021; v1 submitted 1 June, 2021; originally announced June 2021.

    Comments: 19 pages

  20. arXiv:2104.02310  [pdf, ps, other

    cs.CL

    SERRANT: a syntactic classifier for English Grammatical Error Types

    Authors: Leshem Choshen, Matanel Oren, Dmitry Nikolaev, Omri Abend

    Abstract: SERRANT is a system and code for automatic classification of English grammatical errors that combines SErCl and ERRANT. SERRANT uses ERRANT's annotations when they are informative and those provided by SErCl otherwise.

    Submitted 7 April, 2021; v1 submitted 6 April, 2021; originally announced April 2021.

    Comments: Code library in: https://github.com/matanel-oren/serrant

  21. arXiv:2012.15779  [pdf, other

    cs.CV

    Illumination Estimation Challenge: experience of past two years

    Authors: Egor Ershov, Alex Savchik, Ilya Semenkov, Nikola Banić, Karlo Koscević, Marko Subašić, Alexander Belokopytov, Zhihao Li, Arseniy Terekhin, Daria Senshina, Artem Nikonorov, Yanlin Qian, Marco Buzzelli, Riccardo Riva, Simone Bianco, Raimondo Schettini, Sven Lončarić, Dmitry Nikolaev

    Abstract: Illumination estimation is the essential step of computational color constancy, one of the core parts of various image processing pipelines of modern digital cameras. Having an accurate and reliable illumination estimation is important for reducing the illumination influence on the image colors. To motivate the generation of new ideas and the development of new algorithms in this field, the 2nd Il… ▽ More

    Submitted 31 December, 2020; originally announced December 2020.

  22. arXiv:2012.07653  [pdf, other

    cs.CV

    ProLab: perceptually uniform projective colour coordinate system

    Authors: Ivan A. Konovalenko, Anna A. Smagina, Dmitry P. Nikolaev, Petr P. Nikolaev

    Abstract: In this work, we propose proLab: a new colour coordinate system derived as a 3D projective transformation of CIE XYZ. We show that proLab is far ahead of the widely used CIELAB coordinate system (though inferior to the modern CAM16-UCS) according to perceptual uniformity evaluated by the STRESS metric in reference to the CIEDE2000 colour difference formula. At the same time, angular errors of chro… ▽ More

    Submitted 11 January, 2021; v1 submitted 14 December, 2020; originally announced December 2020.

  23. arXiv:2010.11032  [pdf, other

    cs.CL

    Classifying Syntactic Errors in Learner Language

    Authors: Leshem Choshen, Dmitry Nikolaev, Yevgeni Berzak, Omri Abend

    Abstract: We present a method for classifying syntactic errors in learner language, namely errors whose correction alters the morphosyntactic structure of a sentence. The methodology builds on the established Universal Dependencies syntactic representation scheme, and provides complementary information to other error-classification systems. Unlike existing error classification methods, our method is app… ▽ More

    Submitted 27 October, 2020; v1 submitted 21 October, 2020; originally announced October 2020.

    Comments: CoNLL 2020

  24. arXiv:2009.07190  [pdf, other

    cs.CV

    ResNet-like Architecture with Low Hardware Requirements

    Authors: Elena Limonova, Daniil Alfonso, Dmitry Nikolaev, Vladimir V. Arlazarov

    Abstract: One of the most computationally intensive parts in modern recognition systems is an inference of deep neural networks that are used for image classification, segmentation, enhancement, and recognition. The growing popularity of edge computing makes us look for ways to reduce its time for mobile and embedded devices. One way to decrease the neural network inference time is to modify a neuron model… ▽ More

    Submitted 21 October, 2020; v1 submitted 15 September, 2020; originally announced September 2020.

    Comments: Accepted to ICPR 2020. Corrected typos and bibliography formatting

  25. arXiv:2009.06488  [pdf, other

    cs.CV

    Fast Implementation of 4-bit Convolutional Neural Networks for Mobile Devices

    Authors: Anton Trusov, Elena Limonova, Dmitry Slugin, Dmitry Nikolaev, Vladimir V. Arlazarov

    Abstract: Quantized low-precision neural networks are very popular because they require less computational resources for inference and can provide high performance, which is vital for real-time and embedded recognition systems. However, their advantages are apparent for FPGA and ASIC devices, while general-purpose processor architectures are not always able to perform low-bit integer computations efficientl… ▽ More

    Submitted 20 October, 2020; v1 submitted 14 September, 2020; originally announced September 2020.

  26. arXiv:2008.08884  [pdf

    cs.CV

    Line detection via a lightweight CNN with a Hough Layer

    Authors: Lev Teplyakov, Kirill Kaymakov, Evgeny Shvets, Dmitry Nikolaev

    Abstract: Line detection is an important computer vision task traditionally solved by Hough Transform. With the advance of deep learning, however, trainable approaches to line detection became popular. In this paper we propose a lightweight CNN for line detection with an embedded parameter-free Hough layer, which allows the network neurons to have global strip-like receptive fields. We argue that traditiona… ▽ More

    Submitted 16 October, 2020; v1 submitted 20 August, 2020; originally announced August 2020.

  27. Approach for Document Detection by Contours and Contrasts

    Authors: Daniil V. Tropin, Sergey A. Ilyuhin, Dmitry P. Nikolaev, Vladimir V. Arlazarov

    Abstract: This paper considers arbitrary document detection performed on a mobile device. The classical contour-based approach often fails in cases featuring occlusion, complex background, or blur. The region-based approach, which relies on the contrast between object and background, does not have application limitations, however, its known implementations are highly resource-consuming. We propose a modific… ▽ More

    Submitted 19 October, 2020; v1 submitted 6 August, 2020; originally announced August 2020.

    Comments: This paper has been accepted to the ICPR 2020 conference in Milan which will be held on the 10-15 January 2021. Therefore this work has not yet been presented

    Journal ref: 2020 25th International Conference on Pattern Recognition (ICPR), (2021) 9689-9695

  28. arXiv:2007.06289  [pdf, other

    eess.IV cs.CV

    Accelerated FBP for computed tomography image reconstruction

    Authors: Anastasiya Dolmatova, Marina Chukalina, Dmitry Nikolaev

    Abstract: Filtered back projection (FBP) is a commonly used technique in tomographic image reconstruction demonstrating acceptable quality. The classical direct implementations of this algorithm require the execution of $Θ(N^3)$ operations, where $N$ is the linear size of the 2D slice. Recent approaches including reconstruction via the Fourier slice theorem require $Θ(N^2\log N)$ multiplication operations.… ▽ More

    Submitted 13 July, 2020; originally announced July 2020.

  29. arXiv:2005.03436  [pdf, other

    cs.CL

    Fine-Grained Analysis of Cross-Linguistic Syntactic Divergences

    Authors: Dmitry Nikolaev, Ofir Arviv, Taelin Karidi, Neta Kenneth, Veronika Mitnik, Lilja Maria Saeboe, Omri Abend

    Abstract: The patterns in which the syntax of different languages converges and diverges are often used to inform work on cross-lingual transfer. Nevertheless, little empirical work has been done on quantifying the prevalence of different syntactic divergences across language pairs. We propose a framework for extracting divergence patterns for any language pair from a parallel corpus, building on Universal… ▽ More

    Submitted 13 July, 2020; v1 submitted 7 May, 2020; originally announced May 2020.

  30. arXiv:2002.09474  [pdf

    cs.DC cs.CV eess.IV

    Fast Implementation of Morphological Filtering Using ARM NEON Extension

    Authors: Elena Limonova, Arseny Terekhin, Dmitry Nikolaev, Vladimir Arlazarov

    Abstract: In this paper we consider speedup potential of morphological image filtering on ARM processors. Morphological operations are widely used in image analysis and recognition and their speedup in some cases can significantly reduce overall execution time of recognition. More specifically, we propose fast implementation of erosion and dilation using ARM SIMD extension NEON. These operations with the re… ▽ More

    Submitted 19 February, 2020; originally announced February 2020.

    Comments: 6 pages, 4 figures

    MSC Class: 68T45

    Journal ref: International Journal of Applied Engineering Research (ISSN 0973-4562), Volume 11, Number 24 (2016), pp. 11675-11680

  31. arXiv:2002.07754  [pdf

    cs.CV eess.IV

    Computational optimization of convolutional neural networks using separated filters architecture

    Authors: Elena Limonova, Alexander Sheshkus, Dmitry Nikolaev

    Abstract: This paper considers a convolutional neural network transformation that reduces computation complexity and thus speedups neural network processing. Usage of convolutional neural networks (CNN) is the standard approach to image recognition despite the fact they can be too computationally demanding, for example for recognition on mobile platforms or in embedded systems. In this paper we propose CNN… ▽ More

    Submitted 18 February, 2020; originally announced February 2020.

    Comments: 4 pages, 3 figures

    MSC Class: 68T10

    Journal ref: International Journal of Applied Engineering Research (ISSN 0973-4562), Volume 11, Number 11 (2016), pp 7491-7494

  32. Vanishing Point Detection with Direct and Transposed Fast Hough Transform inside the neural network

    Authors: A. Sheshkus, A. Chirvonaya, D. Matveev, D. Nikolaev, V. L. Arlazarov

    Abstract: In this paper, we suggest a new neural network architecture for vanishing point detection in images. The key element is the use of the direct and transposed Fast Hough Transforms separated by convolutional layer blocks with standard activation functions. It allows us to get the answer in the coordinates of the input image at the output of the network and thus to calculate the coordinates of the va… ▽ More

    Submitted 7 July, 2020; v1 submitted 4 February, 2020; originally announced February 2020.

    Comments: 9 pages, 9 figures, submitted to "Computer Optics"; extra experiment added, new theorem proof added, references added; typos corrected

    Journal ref: Computer Optics 2020; 44(5): 737-745

  33. arXiv:1912.02504  [pdf, other

    cs.CV

    A Document Skew Detection Method Using Fast Hough Transform

    Authors: Pavel Bezmaternykh, Dmitry Nikolaev

    Abstract: The majority of document image analysis systems use a document skew detection algorithm to simplify all its further processing stages. A huge amount of such algorithms based on Hough transform (HT) analysis has already been proposed. Despite this, we managed to find only one work where the Fast Hough Transform (FHT) usage was suggested to solve the indicated problem. Unfortunately, no study of tha… ▽ More

    Submitted 5 December, 2019; originally announced December 2019.

  34. arXiv:1912.01923  [pdf

    cs.CV

    A Low Computational Approach for Price Tag Recognition

    Authors: M. A. Aliev, D. A. Bocharov, I. A. Kunina, D. P. Nikolaev

    Abstract: In this work we discuss the task of search, localization and recognition of price zone within a photograph of the price tag. The task is being addressed for the case when image is acquired by small-scale digital camera and calculation device has significant resource constraints. The proposed approach is based on Niblack binarization algorithm, analysis and clasterization of connected components in… ▽ More

    Submitted 4 December, 2019; originally announced December 2019.

    Comments: 9 pages, 7 figures, 3 tables

  35. arXiv:1912.01884  [pdf

    cs.CV

    A Method of Detecting End-To-End Curves of Limited Curvature

    Authors: Ekaterina Panfilova, Mikhail Aliev, Irina Kunina, Vasiliy Postnikov, Dmitry Nikolaev

    Abstract: In this paper we consider a method for detecting end-to-end curves of limited curvature like the k-link polylines with bending angle between adjacent segments in a given range. The approximation accuracy is achieved by maximization of the quality function in the image matrix. The method is based on a dynamic programming scheme constructed over Fast Hough Transform calculation results for image ban… ▽ More

    Submitted 4 December, 2019; originally announced December 2019.

  36. arXiv:1911.01971  [pdf, other

    cs.NE cs.CV cs.LG eess.IV

    Bipolar Morphological Neural Networks: Convolution Without Multiplication

    Authors: Elena Limonova, Daniil Matveev, Dmitry Nikolaev, Vladimir V. Arlazarov

    Abstract: In the paper we introduce a novel bipolar morphological neuron and bipolar morphological layer models. The models use only such operations as addition, subtraction and maximum inside the neuron and exponent and logarithm as activation functions for the layer. The proposed models unlike previously introduced morphological neural networks approximate the classical computations and show better recogn… ▽ More

    Submitted 5 November, 2019; originally announced November 2019.

    Comments: Submitted to International Conference on Machine Vision 2019

  37. HoughNet: neural network architecture for vanishing points detection

    Authors: Alexander Sheshkus, Anastasia Ingacheva, Vladimir Arlazarov, Dmitry Nikolaev

    Abstract: In this paper we introduce a novel neural network architecture based on Fast Hough Transform layer. The layer of this type allows our neural network to accumulate features from linear areas across the entire image instead of local areas. We demonstrate its potential by solving the problem of vanishing points detection in the images of documents. Such problem occurs when dealing with camera shots o… ▽ More

    Submitted 6 October, 2019; v1 submitted 9 September, 2019; originally announced September 2019.

    Comments: 6 pages, 6 figures, 2 tables, 28 references, conference

    Journal ref: 15th International Conference on Document Analysis and Recognition (ICDAR 2019)

  38. Linear colour segmentation revisited

    Authors: Anna Smagina, Valentina Bozhkova, Sergey Gladilin, Dmitry Nikolaev

    Abstract: In this work we discuss the known algorithms for linear colour segmentation based on a physical approach and propose a new modification of segmentation algorithm. This algorithm is based on a region adjacency graph framework without a pre-segmentation stage. Proposed edge weight functions are defined from linear image model with normal noise. The colour space projective transform is introduced as… ▽ More

    Submitted 2 January, 2019; originally announced January 2019.

    Journal ref: Proc. SPIE 11041, Eleventh International Conference on Machine Vision (ICMV 2018)

  39. arXiv:1811.06378  [pdf

    eess.IV cs.CV

    On the use of FHT, its modification for practical applications and the structure of Hough image

    Authors: M. Aliev, E. I. Ershov, D. P. Nikolaev

    Abstract: This work focuses on the Fast Hough Transform (FHT) algorithm proposed by M.L. Brady. We propose how to modify the standard FHT to calculate sums along lines within any given range of their inclination angles. We also describe a new way to visualise Hough-image based on regrouping of accumulator space around its center. Finally, we prove that using Brady parameterization transforms any line into a… ▽ More

    Submitted 14 November, 2018; originally announced November 2018.

    Comments: 8 pages, 8 figures. Submitted and presented at ICMV 2018