Skip to main content

Showing 1–3 of 3 results for author: de Sá, J M C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.16624  [pdf, other

    cs.CL

    Semantic Change Characterization with LLMs using Rhetorics

    Authors: Jader Martins Camboim de Sá, Marcos Da Silveira, Cédric Pruski

    Abstract: Languages continually evolve in response to societal events, resulting in new terms and shifts in meanings. These changes have significant implications for computer applications, including automatic translation and chatbots, making it essential to characterize them accurately. The recent development of LLMs has notably advanced natural language understanding, particularly in sense inference and re… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

  2. arXiv:2402.19088  [pdf, other

    cs.CL cs.AI

    Survey in Characterization of Semantic Change

    Authors: Jader Martins Camboim de Sá, Marcos Da Silveira, Cédric Pruski

    Abstract: Live languages continuously evolve to integrate the cultural change of human societies. This evolution manifests through neologisms (new words) or \textbf{semantic changes} of words (new meaning to existing words). Understanding the meaning of words is vital for interpreting texts coming from different cultures (regionalism or slang), domains (e.g., technical terms), or periods. In computer scienc… ▽ More

    Submitted 18 July, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

  3. arXiv:2308.01849  [pdf, other

    cs.CL cs.LG

    Curricular Transfer Learning for Sentence Encoded Tasks

    Authors: Jader Martins Camboim de Sá, Matheus Ferraroni Sanches, Rafael Roque de Souza, Júlio Cesar dos Reis, Leandro Aparecido Villas

    Abstract: Fine-tuning language models in a downstream task is the standard approach for many state-of-the-art methodologies in the field of NLP. However, when the distribution between the source task and target task drifts, \textit{e.g.}, conversational environments, these gains tend to be diminished. This article proposes a sequence of pre-training steps (a curriculum) guided by "data hacking" and grammar… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.