-
Co-creation for Sign Language Processing and Machine Translation
Authors:
Lisa Lepp,
Dimitar Shterionov,
Mirella De Sisto,
Grzegorz Chrupała
Abstract:
Sign language machine translation (SLMT) -- the task of automatically translating between sign and spoken languages or between sign languages -- is a complex task within the field of NLP. Its multi-modal and non-linear nature require the joint efforts of sign language (SL) linguists, technical experts and SL users. Effective user involvement is a challenge that can be addressed through co-creation…
▽ More
Sign language machine translation (SLMT) -- the task of automatically translating between sign and spoken languages or between sign languages -- is a complex task within the field of NLP. Its multi-modal and non-linear nature require the joint efforts of sign language (SL) linguists, technical experts and SL users. Effective user involvement is a challenge that can be addressed through co-creation. Co-creation has been formally defined in many fields, e.g. business, marketing, educational and others, however in NLP and in particular in SLMT there is no formal, widely accepted definition. Starting from the inception and evolution of co-creation across various fields over time, we develop a relationship typology to address the collaboration between deaf, Hard of Hearing and hearing researchers and the co-creation with SL-users. We compare this new typology to the guiding principles of participatory design for NLP. We, then, assess 110 articles from the perspective of involvement of SL users and highlight the lack of involvement of the sign language community or users in decision-making processes required for effective co-creation. Finally, we derive formal guidelines for co-creation for SLMT which take the dynamic nature of co-creation throughout the life cycle of a research project into account.
△ Less
Submitted 3 March, 2025;
originally announced March 2025.
-
AI in Support of Diversity and Inclusion
Authors:
Çiçek Güven,
Afra Alishahi,
Henry Brighton,
Gonzalo Nápoles,
Juan Sebastian Olier,
Marie Šafář,
Eric Postma,
Dimitar Shterionov,
Mirella De Sisto,
Eva Vanmassenhove
Abstract:
In this paper, we elaborate on how AI can support diversity and inclusion and exemplify research projects conducted in that direction. We start by looking at the challenges and progress in making large language models (LLMs) more transparent, inclusive, and aware of social biases. Even though LLMs like ChatGPT have impressive abilities, they struggle to understand different cultural contexts and e…
▽ More
In this paper, we elaborate on how AI can support diversity and inclusion and exemplify research projects conducted in that direction. We start by looking at the challenges and progress in making large language models (LLMs) more transparent, inclusive, and aware of social biases. Even though LLMs like ChatGPT have impressive abilities, they struggle to understand different cultural contexts and engage in meaningful, human like conversations. A key issue is that biases in language processing, especially in machine translation, can reinforce inequality. Tackling these biases requires a multidisciplinary approach to ensure AI promotes diversity, fairness, and inclusion. We also highlight AI's role in identifying biased content in media, which is important for improving representation. By detecting unequal portrayals of social groups, AI can help challenge stereotypes and create more inclusive technologies. Transparent AI algorithms, which clearly explain their decisions, are essential for building trust and reducing bias in AI systems. We also stress AI systems need diverse and inclusive training data. Projects like the Child Growth Monitor show how using a wide range of data can help address real world problems like malnutrition and poverty. We present a project that demonstrates how AI can be applied to monitor the role of search engines in spreading disinformation about the LGBTQ+ community. Moreover, we discuss the SignON project as an example of how technology can bridge communication gaps between hearing and deaf people, emphasizing the importance of collaboration and mutual trust in developing inclusive AI. Overall, with this paper, we advocate for AI systems that are not only effective but also socially responsible, promoting fair and inclusive interactions between humans and machines.
△ Less
Submitted 16 January, 2025;
originally announced January 2025.
-
Metronome: tracing variation in poetic meters via local sequence alignment
Authors:
Ben Nagy,
Artjoms Šeļa,
Mirella De Sisto,
Petr Plecháč
Abstract:
All poetic forms come from somewhere. Prosodic templates can be copied for generations, altered by individuals, imported from foreign traditions, or fundamentally changed under the pressures of language evolution. Yet these relationships are notoriously difficult to trace across languages and times. This paper introduces an unsupervised method for detecting structural similarities in poems using l…
▽ More
All poetic forms come from somewhere. Prosodic templates can be copied for generations, altered by individuals, imported from foreign traditions, or fundamentally changed under the pressures of language evolution. Yet these relationships are notoriously difficult to trace across languages and times. This paper introduces an unsupervised method for detecting structural similarities in poems using local sequence alignment. The method relies on encoding poetic texts as strings of prosodic features using a four-letter alphabet; these sequences are then aligned to derive a distance measure based on weighted symbol (mis)matches. Local alignment allows poems to be clustered according to emergent properties of their underlying prosodic patterns. We evaluate method performance on a meter recognition tasks against strong baselines and show its potential for cross-lingual and historical research using three short case studies: 1) mutations in quantitative meter in classical Latin, 2) European diffusion of the Renaissance hendecasyllable, and 3) comparative alignment of modern meters in 18--19th century Czech, German and Russian. We release an implementation of the algorithm as a Python package with an open license.
△ Less
Submitted 26 April, 2024;
originally announced April 2024.
-
Tailoring Domain Adaptation for Machine Translation Quality Estimation
Authors:
Javad Pourmostafa Roshan Sharami,
Dimitar Shterionov,
Frédéric Blain,
Eva Vanmassenhove,
Mirella De Sisto,
Chris Emmery,
Pieter Spronck
Abstract:
While quality estimation (QE) can play an important role in the translation process, its effectiveness relies on the availability and quality of training data. For QE in particular, high-quality labeled data is often lacking due to the high cost and effort associated with labeling such data. Aside from the data scarcity challenge, QE models should also be generalizable, i.e., they should be able t…
▽ More
While quality estimation (QE) can play an important role in the translation process, its effectiveness relies on the availability and quality of training data. For QE in particular, high-quality labeled data is often lacking due to the high cost and effort associated with labeling such data. Aside from the data scarcity challenge, QE models should also be generalizable, i.e., they should be able to handle data from different domains, both generic and specific. To alleviate these two main issues -- data scarcity and domain mismatch -- this paper combines domain adaptation and data augmentation within a robust QE system. Our method first trains a generic QE model and then fine-tunes it on a specific domain while retaining generic knowledge. Our results show a significant improvement for all the language pairs investigated, better cross-lingual inference, and a superior performance in zero-shot learning scenarios as compared to state-of-the-art baselines.
△ Less
Submitted 9 May, 2023; v1 submitted 18 April, 2023;
originally announced April 2023.
-
Astronomical orientations in sanctuaries of Daunia
Authors:
E. Antonello,
V. F. Polcaro,
A. M Tunzi Sisto,
M. LoZupone
Abstract:
Prehistoric sanctuaries of Daunia date back several thousand years. During the Neolithic and Bronze Age the farmers in that region dug hypogea and holes whose characteristics suggest a ritual use. In the present note we summarize the results of the astronomical analysis of the orientation of the row holes in three different sites, and we point out the possible use of the setting of the stars of Ce…
▽ More
Prehistoric sanctuaries of Daunia date back several thousand years. During the Neolithic and Bronze Age the farmers in that region dug hypogea and holes whose characteristics suggest a ritual use. In the present note we summarize the results of the astronomical analysis of the orientation of the row holes in three different sites, and we point out the possible use of the setting of the stars of Centaurus. An interesting archaeological confirmation of an archaeoastronomical prediction is also reported.
△ Less
Submitted 8 July, 2013;
originally announced July 2013.
-
Contemporary presence of dynamical and statistical production of intermediate mass fragments in midperipheral $^{58}$Ni+$^{58}$Ni collisions at 30 MeV/nucleon
Authors:
P. M. Milazzo,
G. Vannini,
M. Sisto,
C. Agodi,
R. Alba,
G. Bellia,
M. Belkacem,
M. Bruno,
M. Colonna,
N. Colonna,
R. Coniglione,
M. D'Agostino,
A. Del Zoppo,
L. Fabbietti,
P. Finocchiaro,
F. Gramegna,
I. Iori,
K. Loukachine,
C. Maiolino,
G. V. Margagliotti,
P. F. Mastinu,
E. Migneco,
A. Moroni,
P. Piattelli,
R. Rui
, et al. (3 additional authors not shown)
Abstract:
The $^{58}Ni+^{58}Ni$ reaction at 30 MeV/nucleon has been experimentally investigated at the Superconducting Cyclotron of the INFN Laboratori Nazionali del Sud. In midperipheral collisions the production of massive fragments (4$\le$Z$\le$12), consistent with the statistical fragmentation of the projectile-like residue and the dynamical formation of a neck, joining projectile-like and target-like…
▽ More
The $^{58}Ni+^{58}Ni$ reaction at 30 MeV/nucleon has been experimentally investigated at the Superconducting Cyclotron of the INFN Laboratori Nazionali del Sud. In midperipheral collisions the production of massive fragments (4$\le$Z$\le$12), consistent with the statistical fragmentation of the projectile-like residue and the dynamical formation of a neck, joining projectile-like and target-like residues, has been observed. The fragments coming from these different processes differ both in charge distribution and isotopic composition. In particular it is shown that these mechanisms leading to fragment production act contemporarily inside the same event.
△ Less
Submitted 19 March, 2001; v1 submitted 5 October, 2000;
originally announced October 2000.