Skip to main content

Showing 1–10 of 10 results for author: Favre, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.05491  [pdf, other

    cs.LG

    Statistical Deficiency for Task Inclusion Estimation

    Authors: Loïc Fosse, Frédéric Béchet, Benoît Favre, Géraldine Damnati, Gwénolé Lecorvé, Maxime Darrin, Philippe Formont, Pablo Piantanida

    Abstract: Tasks are central in machine learning, as they are the most natural objects to assess the capabilities of current models. The trend is to build general models able to address any task. Even though transfer learning and multitask learning try to leverage the underlying task space, no well-founded tools are available to study its structure. This study proposes a theoretically grounded setup to defin… ▽ More

    Submitted 13 March, 2025; v1 submitted 7 March, 2025; originally announced March 2025.

    Comments: 34 pages

  2. arXiv:2409.10070  [pdf, ps, other

    cs.CL cs.AI

    Increasing faithfulness in human-human dialog summarization with Spoken Language Understanding tasks

    Authors: Eunice Akani, Benoit Favre, Frederic Bechet, Romain Gemignani

    Abstract: Dialogue summarization aims to provide a concise and coherent summary of conversations between multiple speakers. While recent advancements in language models have enhanced this process, summarizing dialogues accurately and faithfully remains challenging due to the need to understand speaker interactions and capture relevant information. Indeed, abstractive models used for dialog summarization may… ▽ More

    Submitted 16 September, 2024; originally announced September 2024.

  3. arXiv:2311.04922  [pdf, other

    cs.CL cs.AI eess.AS eess.SP

    Are cascade dialogue state tracking models speaking out of turn in spoken dialogues?

    Authors: Lucas Druart, Léo Jacqmin, Benoît Favre, Lina Maria Rojas-Barahona, Valentin Vielzeuf

    Abstract: In Task-Oriented Dialogue (TOD) systems, correctly updating the system's understanding of the user's needs is key to a smooth interaction. Traditionally TOD systems are composed of several modules that interact with one another. While each of these components is the focus of active research communities, their behavior in interaction can be overlooked. This paper proposes a comprehensive analysis o… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

    Comments: Submitted to IEEE ICASSP 2024© 2023 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

  4. arXiv:2304.11073  [pdf, other

    eess.AS cs.AI cs.CL cs.SD

    OLISIA: a Cascade System for Spoken Dialogue State Tracking

    Authors: Léo Jacqmin, Lucas Druart, Yannick Estève, Benoît Favre, Lina Maria Rojas-Barahona, Valentin Vielzeuf

    Abstract: Though Dialogue State Tracking (DST) is a core component of spoken dialogue systems, recent work on this task mostly deals with chat corpora, disregarding the discrepancies between spoken and written language.In this paper, we propose OLISIA, a cascade system which integrates an Automatic Speech Recognition (ASR) model and a DST model. We introduce several adaptations in the ASR and DST modules to… ▽ More

    Submitted 31 August, 2023; v1 submitted 20 April, 2023; originally announced April 2023.

  5. arXiv:2210.12079  [pdf, other

    cs.CL cs.CV

    Do Vision-and-Language Transformers Learn Grounded Predicate-Noun Dependencies?

    Authors: Mitja Nikolaus, Emmanuelle Salin, Stephane Ayache, Abdellah Fourtassi, Benoit Favre

    Abstract: Recent advances in vision-and-language modeling have seen the development of Transformer architectures that achieve remarkable performance on multimodal reasoning tasks. Yet, the exact capabilities of these black-box models are still poorly understood. While much of previous work has focused on studying their ability to learn meaning at the word-level, their ability to track syntactic dependencies… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: To appear at EMNLP 2022

  6. arXiv:2207.14627  [pdf, other

    cs.CL

    "Do you follow me?": A Survey of Recent Approaches in Dialogue State Tracking

    Authors: Léo Jacqmin, Lina M. Rojas-Barahona, Benoit Favre

    Abstract: While communicating with a user, a task-oriented dialogue system has to track the user's needs at each turn according to the conversation history. This process called dialogue state tracking (DST) is crucial because it directly informs the downstream dialogue policy. DST has received a lot of interest in recent years with the text-to-text paradigm emerging as the favored approach. In this review p… ▽ More

    Submitted 29 July, 2022; originally announced July 2022.

    Comments: SIGDIAL 2022

  7. arXiv:2207.01893  [pdf, other

    cs.CL

    ASR-Generated Text for Language Model Pre-training Applied to Speech Tasks

    Authors: Valentin Pelloin, Franck Dary, Nicolas Herve, Benoit Favre, Nathalie Camelin, Antoine Laurent, Laurent Besacier

    Abstract: We aim at improving spoken language modeling (LM) using very large amount of automatically transcribed speech. We leverage the INA (French National Audiovisual Institute) collection and obtain 19GB of text after applying ASR on 350,000 hours of diverse TV shows. From this, spoken language models are trained either by fine-tuning an existing LM (FlauBERT) or through training a LM from scratch. New… ▽ More

    Submitted 5 July, 2022; originally announced July 2022.

    Comments: Interspeech 2022 (Camera Ready)

  8. arXiv:2201.03017  [pdf, other

    cs.CL cs.IR

    Zero-Shot and Few-Shot Classification of Biomedical Articles in Context of the COVID-19 Pandemic

    Authors: Simon Lupart, Benoit Favre, Vassilina Nikoulina, Salah Ait-Mokhtar

    Abstract: MeSH (Medical Subject Headings) is a large thesaurus created by the National Library of Medicine and used for fine-grained indexing of publications in the biomedical domain. In the context of the COVID-19 pandemic, MeSH descriptors have emerged in relation to articles published on the corresponding topic. Zero-shot classification is an adequate response for timely labeling of the stream of papers… ▽ More

    Submitted 11 January, 2022; v1 submitted 9 January, 2022; originally announced January 2022.

    Comments: to be published at the AAAI-22 Workshop on Scientific Document Understanding

  9. Robust Semantic Parsing with Adversarial Learning for Domain Generalization

    Authors: Gabriel Marzinotto, Geraldine Damnati, Frédéric Béchet, Benoit Favre

    Abstract: This paper addresses the issue of generalization for Semantic Parsing in an adversarial framework. Building models that are more robust to inter-document variability is crucial for the integration of Semantic Parsing technologies in real applications. The underlying question throughout this study is whether adversarial learning can be used to train models on a higher level of abstraction in order… ▽ More

    Submitted 1 October, 2019; originally announced October 2019.

    Journal ref: Proceedings of the 2019 Conference of the North, Jun 2019, Minneapolis - Minnesota, France. pp.166-173

  10. arXiv:1612.05202  [pdf, other

    cs.CL

    Building a robust sentiment lexicon with (almost) no resource

    Authors: Mickael Rouvier, Benoit Favre

    Abstract: Creating sentiment polarity lexicons is labor intensive. Automatically translating them from resourceful languages requires in-domain machine translation systems, which rely on large quantities of bi-texts. In this paper, we propose to replace machine translation by transferring words from the lexicon through word embeddings aligned across languages with a simple linear transform. The approach lea… ▽ More

    Submitted 15 December, 2016; originally announced December 2016.