Skip to main content

Showing 1–10 of 10 results for author: Coquenet, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.17051  [pdf, ps, other

    cs.CV

    Relaxed syntax modeling in Transformers for future-proof license plate recognition

    Authors: Florent Meyer, Laurent Guichard, Denis Coquenet, Guillaume Gravier, Yann Soullard, Bertrand Coüasnon

    Abstract: Effective license plate recognition systems are required to be resilient to constant change, as new license plates are released into traffic daily. While Transformer-based networks excel in their recognition at first sight, we observe significant performance drop over time which proves them unsuitable for tense production environments. Indeed, such systems obtain state-of-the-art results on plates… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

  2. arXiv:2504.03349  [pdf, other

    cs.CV

    Meta-DAN: towards an efficient prediction strategy for page-level handwritten text recognition

    Authors: Denis Coquenet

    Abstract: Recent advances in text recognition led to a paradigm shift for page-level recognition, from multi-step segmentation-based approaches to end-to-end attention-based ones. However, the naïve character-level autoregressive decoding process results in long prediction times: it requires several seconds to process a single page image on a modern GPU. We propose the Meta Document Attention Network (Meta-… ▽ More

    Submitted 4 April, 2025; originally announced April 2025.

  3. arXiv:2307.06795  [pdf, other

    cs.CV

    Leveraging Vision-Language Foundation Models for Fine-Grained Downstream Tasks

    Authors: Denis Coquenet, Clément Rambour, Emanuele Dalsasso, Nicolas Thome

    Abstract: Vision-language foundation models such as CLIP have shown impressive zero-shot performance on many tasks and datasets, especially thanks to their free-text inputs. However, they struggle to handle some downstream tasks, such as fine-grained attribute detection and localization. In this paper, we propose a multitask fine-tuning strategy based on a positive/negative prompt formulation to further lev… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

  4. Faster DAN: Multi-target Queries with Document Positional Encoding for End-to-end Handwritten Document Recognition

    Authors: Denis Coquenet, Clément Chatelain, Thierry Paquet

    Abstract: Recent advances in handwritten text recognition enabled to recognize whole documents in an end-to-end way: the Document Attention Network (DAN) recognizes the characters one after the other through an attention-based prediction process until reaching the end of the document. However, this autoregressive process leads to inference that cannot benefit from any parallelization optimization. In this p… ▽ More

    Submitted 25 January, 2023; originally announced January 2023.

    Journal ref: International Conference on Document Analysis and Recognition - ICDAR 2023

  5. arXiv:2209.15362  [pdf, other

    cs.CV

    Towards End-to-end Handwritten Document Recognition

    Authors: Denis Coquenet

    Abstract: Handwritten text recognition has been widely studied in the last decades for its numerous applications. Nowadays, the state-of-the-art approach consists in a three-step process. The document is segmented into text lines, which are then ordered and recognized. However, this three-step approach has many drawbacks. The three steps are treated independently whereas they are closely related. Errors acc… ▽ More

    Submitted 20 October, 2022; v1 submitted 30 September, 2022; originally announced September 2022.

    Comments: Ph.D Thesis

  6. DAN: a Segmentation-free Document Attention Network for Handwritten Document Recognition

    Authors: Denis Coquenet, Clément Chatelain, Thierry Paquet

    Abstract: Unconstrained handwritten text recognition is a challenging computer vision task. It is traditionally handled by a two-step approach, combining line segmentation followed by text line recognition. For the first time, we propose an end-to-end segmentation-free architecture for the task of handwritten document recognition: the Document Attention Network. In addition to text recognition, the model is… ▽ More

    Submitted 13 December, 2022; v1 submitted 23 March, 2022; originally announced March 2022.

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence 2023

  7. SPAN: a Simple Predict & Align Network for Handwritten Paragraph Recognition

    Authors: Denis Coquenet, Clément Chatelain, Thierry Paquet

    Abstract: Unconstrained handwriting recognition is an essential task in document analysis. It is usually carried out in two steps. First, the document is segmented into text lines. Second, an Optical Character Recognition model is applied on these line images. We propose the Simple Predict & Align Network: an end-to-end recurrence-free Fully Convolutional Network performing OCR at paragraph level without an… ▽ More

    Submitted 17 February, 2021; originally announced February 2021.

    Journal ref: Document Analysis and Recognition - ICDAR 2021. ICDAR 2021. Lecture Notes in Computer Science, vol 12823

  8. Recurrence-free unconstrained handwritten text recognition using gated fully convolutional network

    Authors: Denis Coquenet, Clément Chatelain, Thierry Paquet

    Abstract: Unconstrained handwritten text recognition is a major step in most document analysis tasks. This is generally processed by deep recurrent neural networks and more specifically with the use of Long Short-Term Memory cells. The main drawbacks of these components are the large number of parameters involved and their sequential execution during training and prediction. One alternative solution to usin… ▽ More

    Submitted 9 December, 2020; originally announced December 2020.

    Journal ref: 2020 17th International Conference on Frontiers in Handwriting Recognition (ICFHR)

  9. Have convolutions already made recurrence obsolete for unconstrained handwritten text recognition ?

    Authors: Denis Coquenet, Yann Soullard, Clément Chatelain, Thierry Paquet

    Abstract: Unconstrained handwritten text recognition remains an important challenge for deep neural networks. These last years, recurrent networks and more specifically Long Short-Term Memory networks have achieved state-of-the-art performance in this field. Nevertheless, they are made of a large number of trainable parameters and training recurrent neural networks does not support parallelism. This has a d… ▽ More

    Submitted 9 December, 2020; originally announced December 2020.

    Journal ref: 2019 International Conference on Document Analysis and Recognition Workshops (ICDARW)

  10. End-to-end Handwritten Paragraph Text Recognition Using a Vertical Attention Network

    Authors: Denis Coquenet, Clément Chatelain, Thierry Paquet

    Abstract: Unconstrained handwritten text recognition remains challenging for computer vision systems. Paragraph text recognition is traditionally achieved by two models: the first one for line segmentation and the second one for text line recognition. We propose a unified end-to-end model using hybrid attention to tackle this task. This model is designed to iteratively process a paragraph image line by line… ▽ More

    Submitted 3 December, 2021; v1 submitted 7 December, 2020; originally announced December 2020.

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence 2022