Understanding attention-based encoder-decoder networks: a case study with chess scoresheet recognition

Hayashi, Sergio Y.; Hirata, Nina S. T.

doi:10.1109/ICPR56361.2022.9956133

Computer Science > Computer Vision and Pattern Recognition

arXiv:2406.06538 (cs)

[Submitted on 23 Apr 2024]

Title:Understanding attention-based encoder-decoder networks: a case study with chess scoresheet recognition

Authors:Sergio Y. Hayashi, Nina S. T. Hirata

View PDF

Abstract:Deep neural networks are largely used for complex prediction tasks. There is plenty of empirical evidence of their successful end-to-end training for a diversity of tasks. Success is often measured based solely on the final performance of the trained network, and explanations on when, why and how they work are less emphasized. In this paper we study encoder-decoder recurrent neural networks with attention mechanisms for the task of reading handwritten chess scoresheets. Rather than prediction performance, our concern is to better understand how learning occurs in these type of networks. We characterize the task in terms of three subtasks, namely input-output alignment, sequential pattern recognition, and handwriting recognition, and experimentally investigate which factors affect their learning. We identify competition, collaboration and dependence relations between the subtasks, and argue that such knowledge might help one to better balance factors to properly train a network.

Comments:	This work was accepted and published in the 2022 26th International Conference on Pattern Recognition (ICPR)
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2406.06538 [cs.CV]
	(or arXiv:2406.06538v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2406.06538
Journal reference:	2022 26th International Conference on Pattern Recognition (ICPR)
Related DOI:	https://doi.org/10.1109/ICPR56361.2022.9956133

Submission history

From: Sergio Hayashi Y [view email]
[v1] Tue, 23 Apr 2024 16:23:18 UTC (11,183 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Understanding attention-based encoder-decoder networks: a case study with chess scoresheet recognition

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Understanding attention-based encoder-decoder networks: a case study with chess scoresheet recognition

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators