Skip to main content

Showing 1–25 of 25 results for author: Fornés, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.08616  [pdf, other

    cs.CV

    Preserving Privacy Without Compromising Accuracy: Machine Unlearning for Handwritten Text Recognition

    Authors: Lei Kang, Xuanshuo Fu, Lluis Gomez, Alicia Fornés, Ernest Valveny, Dimosthenis Karatzas

    Abstract: Handwritten Text Recognition (HTR) is essential for document analysis and digitization. However, handwritten data often contains user-identifiable information, such as unique handwriting styles and personal lexicon choices, which can compromise privacy and erode trust in AI services. Legislation like the ``right to be forgotten'' underscores the necessity for methods that can expunge sensitive inf… ▽ More

    Submitted 11 April, 2025; originally announced April 2025.

  2. arXiv:2410.21913  [pdf, other

    cs.CV cs.DL

    Structured Analysis and Comparison of Alphabets in Historical Handwritten Ciphers

    Authors: Martín Méndez, Pau Torras, Adrià Molina, Jialuo Chen, Oriol Ramos-Terrades, Alicia Fornés

    Abstract: Historical ciphered manuscripts are documents that were typically used in sensitive communications within military and diplomatic contexts or among members of secret societies. These secret messages were concealed by inventing a method of writing employing symbols from diverse sources such as digits, alchemy signs and Latin or Greek characters. When studying a new, unseen cipher, the automatic sea… ▽ More

    Submitted 29 October, 2024; originally announced October 2024.

    Comments: Acccepted at ECCV24 Workshop AI4DH

  3. arXiv:2408.07259  [pdf, other

    cs.CV cs.AI

    GRIF-DM: Generation of Rich Impression Fonts using Diffusion Models

    Authors: Lei Kang, Fei Yang, Kai Wang, Mohamed Ali Souibgui, Lluis Gomez, Alicia Fornés, Ernest Valveny, Dimosthenis Karatzas

    Abstract: Fonts are integral to creative endeavors, design processes, and artistic productions. The appropriate selection of a font can significantly enhance artwork and endow advertisements with a higher level of expressivity. Despite the availability of numerous diverse font designs online, traditional retrieval-based methods for font selection are increasingly being supplanted by generation-based approac… ▽ More

    Submitted 13 August, 2024; originally announced August 2024.

    Comments: Accepted to ECAI2024

  4. A Unified Representation Framework for the Evaluation of Optical Music Recognition Systems

    Authors: Pau Torras, Sanket Biswas, Alicia Fornés

    Abstract: Modern-day Optical Music Recognition (OMR) is a fairly fragmented field. Most OMR approaches use datasets that are independent and incompatible between each other, making it difficult to both combine them and compare recognition systems built upon them. In this paper we identify the need of a common music representation language and propose the Music Tree Notation (MTN) format, with the idea to co… ▽ More

    Submitted 6 September, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

    Comments: 18 pages, 4 figures, 3 tables, submitted (under review) for the International Journal in Document Analysis and Recognition

    ACM Class: I.4.9; J.5

    Journal ref: International Journal on Document Analysis and Recognition (IJDAR), Volume 27, 2024, pp. 379-393

  5. I Can't Believe It's Not Better: In-air Movement For Alzheimer Handwriting Synthetic Generation

    Authors: Asma Bensalah, Antonio Parziale, Giuseppe De Gregorio, Angelo Marcelli, Alicia Fornés, Lladós

    Abstract: During recent years, there here has been a boom in terms of deep learning use for handwriting analysis and recognition. One main application for handwriting analysis is early detection and diagnosis in the health field. Unfortunately, most real case problems still suffer a scarcity of data, which makes difficult the use of deep learning-based models. To alleviate this problem, some works resort to… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  6. arXiv:2303.09347   

    cs.CV

    CSSL-MHTR: Continual Self-Supervised Learning for Scalable Multi-script Handwritten Text Recognition

    Authors: Marwa Dhiaf, Mohamed Ali Souibgui, Kai Wang, Yuyang Liu, Yousri Kessentini, Alicia Fornés, Ahmed Cheikh Rouhou

    Abstract: Self-supervised learning has recently emerged as a strong alternative in document analysis. These approaches are now capable of learning high-quality image representations and overcoming the limitations of supervised methods, which require a large amount of labeled data. However, these methods are unable to capture new knowledge in an incremental fashion, where data is presented to the model seque… ▽ More

    Submitted 26 April, 2024; v1 submitted 16 March, 2023; originally announced March 2023.

    Comments: Due to current company policy constraints, we are compelled to withdraw our paper. The organization's guidelines prohibit us from proceeding with the publication of this work at this time. We apologize for any inconvenience this may cause and appreciate your understanding in this matter

  7. Easing Automatic Neurorehabilitation via Classification and Smoothness Analysis

    Authors: Asma Bensalah, Alicia Fornés, Cristina Carmona-Duarte, Josep Lladós

    Abstract: Assessing the quality of movements for post-stroke patients during the rehabilitation phase is vital given that there is no standard stroke rehabilitation plan for all the patients. In fact, it depends basically on the patient's functional independence and its progress along the rehabilitation sessions. To tackle this challenge and make neurorehabilitation more agile, we propose an automatic asses… ▽ More

    Submitted 9 December, 2022; originally announced December 2022.

  8. The RPM3D project: 3D Kinematics for Remote Patient Monitoring

    Authors: Alicia Fornés, Asma Bensalah, Cristina Carmona-Duarte, Jialuo Chen, Miguel A. Ferrer, Andreas Fischer, Josep Lladós, Cristina Martín, Eloy Opisso, Réjean Plamondon, Anna Scius-Bertrand, Josep Maria Tormos

    Abstract: This project explores the feasibility of remote patient monitoring based on the analysis of 3D movements captured with smartwatches. We base our analysis on the Kinematic Theory of Rapid Human Movement. We have validated our research in a real case scenario for stroke rehabilitation at the Guttmann Institute5 (neurorehabilitation hospital), showing promising results. Our work could have a great im… ▽ More

    Submitted 9 December, 2022; originally announced December 2022.

  9. Towards Stroke Patients' Upper-limb Automatic Motor Assessment Using Smartwatches

    Authors: Asma Bensalah, Jialuo Chen, Alicia Fornés, Cristina Carmona-Duarte, Josep Lladós, Miguel A. Ferrer

    Abstract: Assessing the physical condition in rehabilitation scenarios is a challenging problem, since it involves Human Activity Recognition (HAR) and kinematic analysis methods. In addition, the difficulties increase in unconstrained rehabilitation scenarios, which are much closer to the real use cases. In particular, our aim is to design an upper-limb assessment pipeline for stroke patients using smartwa… ▽ More

    Submitted 9 December, 2022; originally announced December 2022.

  10. arXiv:2209.10441  [pdf, other

    cs.CV

    A Few Shot Multi-Representation Approach for N-gram Spotting in Historical Manuscripts

    Authors: Giuseppe De Gregorio, Sanket Biswas, Mohamed Ali Souibgui, Asma Bensalah, Josep Lladós, Alicia Fornés, Angelo Marcelli

    Abstract: Despite recent advances in automatic text recognition, the performance remains moderate when it comes to historical manuscripts. This is mainly because of the scarcity of available labelled data to train the data-hungry Handwritten Text Recognition (HTR) models. The Keyword Spotting System (KWS) provides a valid alternative to HTR due to the reduction in error rate, but it is usually limited to a… ▽ More

    Submitted 21 September, 2022; originally announced September 2022.

    Comments: Accepted in ICFHR 2022

  11. Content and Style Aware Generation of Text-line Images for Handwriting Recognition

    Authors: Lei Kang, Pau Riba, Marçal Rusiñol, Alicia Fornés, Mauricio Villegas

    Abstract: Handwritten Text Recognition has achieved an impressive performance in public benchmarks. However, due to the high inter- and intra-class variability between handwriting styles, such recognizers need to be trained using huge volumes of manually labeled training data. To alleviate this labor-consuming problem, synthetic data produced with TrueType fonts has been often used in the training loop to g… ▽ More

    Submitted 12 April, 2022; originally announced April 2022.

    Comments: Accepted to TPAMI

  12. arXiv:2203.04814  [pdf, other

    cs.CV

    Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement

    Authors: Mohamed Ali Souibgui, Sanket Biswas, Andres Mafla, Ali Furkan Biten, Alicia Fornés, Yousri Kessentini, Josep Lladós, Lluis Gomez, Dimosthenis Karatzas

    Abstract: In this paper, we propose a Text-Degradation Invariant Auto Encoder (Text-DIAE), a self-supervised model designed to tackle two tasks, text recognition (handwritten or scene-text) and document image enhancement. We start by employing a transformer-based architecture that incorporates three pretext tasks as learning objectives to be optimized during pre-training without the usage of labeled data. E… ▽ More

    Submitted 18 August, 2022; v1 submitted 9 March, 2022; originally announced March 2022.

    Comments: Preprint

  13. arXiv:2201.10252  [pdf, other

    cs.CV

    DocEnTr: An End-to-End Document Image Enhancement Transformer

    Authors: Mohamed Ali Souibgui, Sanket Biswas, Sana Khamekhem Jemni, Yousri Kessentini, Alicia Fornés, Josep Lladós, Umapada Pal

    Abstract: Document images can be affected by many degradation scenarios, which cause recognition and processing difficulties. In this age of digitization, it is important to denoise them for proper usage. To address this challenge, we present a new encoder-decoder architecture based on vision transformers to enhance both machine-printed and handwritten document images, in an end-to-end fashion. The encoder… ▽ More

    Submitted 25 January, 2022; originally announced January 2022.

    Comments: submitted to ICPR 2022

  14. Few Shots Are All You Need: A Progressive Few Shot Learning Approach for Low Resource Handwritten Text Recognition

    Authors: Mohamed Ali Souibgui, Alicia Fornés, Yousri Kessentini, Beáta Megyesi

    Abstract: Handwritten text recognition in low resource scenarios, such as manuscripts with rare alphabets, is a challenging problem. The main difficulty comes from the very few annotated data and the limited linguistic information (e.g. dictionaries and language models). Thus, we propose a few-shot learning-based handwriting recognition approach that significantly reduces the human labor annotation process,… ▽ More

    Submitted 13 June, 2022; v1 submitted 21 July, 2021; originally announced July 2021.

    Comments: Accepted in Pattern Recognition Letters

  15. Enhance to Read Better: A Multi-Task Adversarial Network for Handwritten Document Image Enhancement

    Authors: Sana Khamekhem Jemni, Mohamed Ali Souibgui, Yousri Kessentini, Alicia Fornés

    Abstract: Handwritten document images can be highly affected by degradation for different reasons: Paper ageing, daily-life scenarios (wrinkles, dust, etc.), bad scanning process and so on. These artifacts raise many readability issues for current Handwritten Text Recognition (HTR) algorithms and severely devalue their efficiency. In this paper, we propose an end to end architecture based on Generative Adve… ▽ More

    Submitted 22 October, 2021; v1 submitted 26 May, 2021; originally announced May 2021.

    Comments: Accepted in Pattern Recognition

  16. arXiv:2105.05300  [pdf, other

    cs.CV

    One-shot Compositional Data Generation for Low Resource Handwritten Text Recognition

    Authors: Mohamed Ali Souibgui, Ali Furkan Biten, Sounak Dey, Alicia Fornés, Yousri Kessentini, Lluis Gomez, Dimosthenis Karatzas, Josep Lladós

    Abstract: Low resource Handwritten Text Recognition (HTR) is a hard problem due to the scarce annotated data and the very limited linguistic information (dictionaries and language models). For example, in the case of historical ciphered manuscripts, which are usually written with invented alphabets to hide the message contents. Thus, in this paper we address this problem through a data generation technique… ▽ More

    Submitted 5 October, 2021; v1 submitted 11 May, 2021; originally announced May 2021.

    Comments: Accepted in WACV 2022

  17. arXiv:2009.12577  [pdf, other

    cs.CV

    A Few-shot Learning Approach for Historical Ciphered Manuscript Recognition

    Authors: Mohamed Ali Souibgui, Alicia Fornés, Yousri Kessentini, Crina Tudor

    Abstract: Encoded (or ciphered) manuscripts are a special type of historical documents that contain encrypted text. The automatic recognition of this kind of documents is challenging because: 1) the cipher alphabet changes from one document to another, 2) there is a lack of annotated corpus for training and 3) touching symbols make the symbol segmentation difficult and complex. To overcome these difficultie… ▽ More

    Submitted 26 September, 2020; originally announced September 2020.

    Comments: Accepted in the 25th International Conference on Pattern Recognition (ICPR2020), Milan, Italy 10 - 15 January 2021 (Camera Ready Version)

  18. arXiv:2008.07641  [pdf, other

    cs.CV cs.LG

    Learning Graph Edit Distance by Graph Neural Networks

    Authors: Pau Riba, Andreas Fischer, Josep Lladós, Alicia Fornés

    Abstract: The emergence of geometric deep learning as a novel framework to deal with graph-based representations has faded away traditional approaches in favor of completely new methodologies. In this paper, we propose a new framework able to combine the advances on deep metric learning with traditional approximations of the graph edit distance. Hence, we propose an efficient graph distance based on the nov… ▽ More

    Submitted 17 August, 2020; originally announced August 2020.

  19. arXiv:2005.13044  [pdf, other

    cs.CV

    Pay Attention to What You Read: Non-recurrent Handwritten Text-Line Recognition

    Authors: Lei Kang, Pau Riba, Marçal Rusiñol, Alicia Fornés, Mauricio Villegas

    Abstract: The advent of recurrent neural networks for handwriting recognition marked an important milestone reaching impressive recognition accuracies despite the great variability that we observe across different writing styles. Sequential architectures are a perfect fit to model text lines, not only because of the inherent temporal aspect of text, but also to learn probability distributions over sequences… ▽ More

    Submitted 26 May, 2020; originally announced May 2020.

  20. arXiv:2003.02567  [pdf, other

    cs.CV

    GANwriting: Content-Conditioned Generation of Styled Handwritten Word Images

    Authors: Lei Kang, Pau Riba, Yaxing Wang, Marçal Rusiñol, Alicia Fornés, Mauricio Villegas

    Abstract: Although current image generation methods have reached impressive quality levels, they are still unable to produce plausible yet diverse images of handwritten words. On the contrary, when writing by hand, a great variability is observed across different writers, and even when analyzing words scribbled by the same individual, involuntary variations are conspicuous. In this work, we take a step clos… ▽ More

    Submitted 21 July, 2020; v1 submitted 5 March, 2020; originally announced March 2020.

    Comments: Accepted to ECCV2020

  21. arXiv:1912.10308  [pdf, other

    cs.CV cs.CL

    Candidate Fusion: Integrating Language Modelling into a Sequence-to-Sequence Handwritten Word Recognition Architecture

    Authors: Lei Kang, Pau Riba, Mauricio Villegas, Alicia Fornés, Marçal Rusiñol

    Abstract: Sequence-to-sequence models have recently become very popular for tackling handwritten word recognition problems. However, how to effectively integrate an external language model into such recognizer is still a challenging problem. The main challenge faced when training a language model is to deal with the language model corpus which is usually different to the one used for training the handwritte… ▽ More

    Submitted 21 December, 2019; originally announced December 2019.

  22. arXiv:1912.10016  [pdf, other

    cs.CV

    A Neural Model for Text Localization, Transcription and Named Entity Recognition in Full Pages

    Authors: Manuel Carbonell, Alicia Fornés, Mauricio Villegas, Josep Lladós

    Abstract: In the last years, the consolidation of deep neural network architectures for information extraction in document images has brought big improvements in the performance of each of the tasks involved in this process, consisting of text localization, transcription, and named entity recognition. However, this process is traditionally performed with separate methods for each task. In this work we propo… ▽ More

    Submitted 4 May, 2020; v1 submitted 20 December, 2019; originally announced December 2019.

    Comments: To be published in Pattern Recognition Letters

  23. Unsupervised Adaptation for Synthetic-to-Real Handwritten Word Recognition

    Authors: Lei Kang, Marçal Rusiñol, Alicia Fornés, Pau Riba, Mauricio Villegas

    Abstract: Handwritten Text Recognition (HTR) is still a challenging problem because it must deal with two important difficulties: the variability among writing styles, and the scarcity of labelled data. To alleviate such problems, synthetic data generation and data augmentation are typically used to train HTR systems. However, training with such data produces encouraging but still inaccurate transcriptions… ▽ More

    Submitted 26 May, 2020; v1 submitted 18 September, 2019; originally announced September 2019.

    Comments: Accepted to WACV 2020

  24. arXiv:1807.02839  [pdf, other

    cs.CV cs.LG stat.ML

    Hierarchical stochastic graphlet embedding for graph-based pattern recognition

    Authors: Anjan Dutta, Pau Riba, Josep Lladós, Alicia Fornés

    Abstract: Despite being very successful within the pattern recognition and machine learning community, graph-based methods are often unusable because of the lack of mathematical operations defined in graph domain. Graph embedding, which maps graphs to a vectorial space, has been proposed as a way to tackle these difficulties enabling the use of standard machine learning techniques. However, it is well known… ▽ More

    Submitted 4 January, 2020; v1 submitted 8 July, 2018; originally announced July 2018.

    Comments: In Neural Computing and Applications (17 pages, 5 figures, 6 tables)

  25. arXiv:1803.06252  [pdf, other

    cs.CV cs.CL

    Joint Recognition of Handwritten Text and Named Entities with a Neural End-to-end Model

    Authors: Manuel Carbonell, Mauricio Villegas, Alicia Fornés, Josep Lladós

    Abstract: When extracting information from handwritten documents, text transcription and named entity recognition are usually faced as separate subsequent tasks. This has the disadvantage that errors in the first module affect heavily the performance of the second module. In this work we propose to do both tasks jointly, using a single neural network with a common architecture used for plain text recognitio… ▽ More

    Submitted 22 March, 2018; v1 submitted 16 March, 2018; originally announced March 2018.

    Comments: To appear in IAPR International Workshop on Document Analysis Systems 2018 (DAS 2018)