Skip to main content

Showing 1–39 of 39 results for author: Clavel, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.01902  [pdf, other

    cs.CL cs.AI cs.LG

    Graphically Speaking: Unmasking Abuse in Social Media with Conversation Insights

    Authors: Célia Nouri, Jean-Philippe Cointet, Chloé Clavel

    Abstract: Detecting abusive language in social media conversations poses significant challenges, as identifying abusiveness often depends on the conversational context, characterized by the content and topology of preceding comments. Traditional Abusive Language Detection (ALD) models often overlook this context, which can lead to unreliable performance metrics. Recent Natural Language Processing (NLP) meth… ▽ More

    Submitted 2 April, 2025; originally announced April 2025.

  2. arXiv:2412.10271  [pdf, other

    cs.CL

    Benchmarking Linguistic Diversity of Large Language Models

    Authors: Yanzhu Guo, Guokan Shang, Chloé Clavel

    Abstract: The development and evaluation of Large Language Models (LLMs) has primarily focused on their task-solving capabilities, with recent models even surpassing human performance in some areas. However, this focus often neglects whether machine-generated language matches the human level of diversity, in terms of vocabulary choice, syntactic construction, and expression of meaning, raising questions abo… ▽ More

    Submitted 13 December, 2024; originally announced December 2024.

  3. arXiv:2412.04492  [pdf, other

    cs.CL cs.AI cs.HC cs.SI

    Socio-Emotional Response Generation: A Human Evaluation Protocol for LLM-Based Conversational Systems

    Authors: Lorraine Vanel, Ariel R. Ramos Vela, Alya Yacoubi, Chloé Clavel

    Abstract: Conversational systems are now capable of producing impressive and generally relevant responses. However, we have no visibility nor control of the socio-emotional strategies behind state-of-the-art Large Language Models (LLMs), which poses a problem in terms of their transparency and thus their trustworthiness for critical applications. Another issue is that current automated metrics are not able… ▽ More

    Submitted 26 November, 2024; originally announced December 2024.

    Journal ref: AHRI 2024, Sep 2024, Glasgow, United Kingdom

  4. arXiv:2408.08782  [pdf, ps, other

    cs.CL

    EmoDynamiX: Emotional Support Dialogue Strategy Prediction by Modelling MiXed Emotions and Discourse Dynamics

    Authors: Chenwei Wan, Matthieu Labeau, Chloé Clavel

    Abstract: Designing emotionally intelligent conversational systems to provide comfort and advice to people experiencing distress is a compelling area of research. Recently, with advancements in large language models (LLMs), end-to-end dialogue agents without explicit strategy prediction steps have become prevalent. However, implicit strategy planning lacks transparency, and recent studies show that LLMs' in… ▽ More

    Submitted 16 June, 2025; v1 submitted 16 August, 2024; originally announced August 2024.

    Comments: Accepted to NAACL 2025 main, long paper

  5. arXiv:2405.13769  [pdf, other

    cs.CL

    Do Language Models Enjoy Their Own Stories? Prompting Large Language Models for Automatic Story Evaluation

    Authors: Cyril Chhun, Fabian M. Suchanek, Chloé Clavel

    Abstract: Storytelling is an integral part of human experience and plays a crucial role in social interactions. Thus, Automatic Story Evaluation (ASE) and Generation (ASG) could benefit society in multiple ways, but they are challenging tasks which require high-level human abilities such as creativity, reasoning and deep understanding. Meanwhile, Large Language Models (LLM) now achieve state-of-the-art perf… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: TACL, pre-MIT Press publication version

  6. arXiv:2402.14616  [pdf, other

    cs.CL

    The Impact of Word Splitting on the Semantic Content of Contextualized Word Representations

    Authors: Aina Garí Soler, Matthieu Labeau, Chloé Clavel

    Abstract: When deriving contextualized word representations from language models, a decision needs to be made on how to obtain one for out-of-vocabulary (OOV) words that are segmented into subwords. What is the best way to represent these words with a single vector, and are these representations of worse quality than those of in-vocabulary words? We carry out an intrinsic evaluation of embeddings from diffe… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: Accepted to TACL

  7. arXiv:2311.11967  [pdf, other

    cs.CL

    Automatic Analysis of Substantiation in Scientific Peer Reviews

    Authors: Yanzhu Guo, Guokan Shang, Virgile Rennard, Michalis Vazirgiannis, Chloé Clavel

    Abstract: With the increasing amount of problematic peer reviews in top AI conferences, the community is urgently in need of automatic quality control measures. In this paper, we restrict our attention to substantiation -- one popular quality aspect indicating whether the claims in a review are sufficiently supported by evidence -- and provide a solution automatizing this evaluation process. To achieve this… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: Accepted to EMNLP 2023 Findings

  8. arXiv:2311.09807  [pdf, other

    cs.CL

    The Curious Decline of Linguistic Diversity: Training Language Models on Synthetic Text

    Authors: Yanzhu Guo, Guokan Shang, Michalis Vazirgiannis, Chloé Clavel

    Abstract: This study investigates the consequences of training language models on synthetic data generated by their predecessors, an increasingly prevalent practice given the prominence of powerful generative models. Diverging from the usual emphasis on performance metrics, we focus on the impact of this training methodology on linguistic diversity, especially when conducted recursively over time. To assess… ▽ More

    Submitted 16 April, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: Accepted to NAACL 2024 Findings

  9. arXiv:2311.09761  [pdf, other

    cs.CL cs.AI cs.LG

    MAFALDA: A Benchmark and Comprehensive Study of Fallacy Detection and Classification

    Authors: Chadi Helwe, Tom Calamai, Pierre-Henri Paris, Chloé Clavel, Fabian Suchanek

    Abstract: We introduce MAFALDA, a benchmark for fallacy classification that merges and unites previous fallacy datasets. It comes with a taxonomy that aligns, refines, and unifies existing classifications of fallacies. We further provide a manual annotation of a part of the dataset together with manual explanations for each annotation. We propose a new annotation scheme tailored for subjective NLP tasks, an… ▽ More

    Submitted 9 April, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

  10. arXiv:2307.15582  [pdf, other

    cs.CL

    When to generate hedges in peer-tutoring interactions

    Authors: Alafate Abulimiti, Chloé Clavel, Justine Cassell

    Abstract: This paper explores the application of machine learning techniques to predict where hedging occurs in peer-tutoring interactions. The study uses a naturalistic face-to-face dataset annotated for natural language turns, conversational strategies, tutoring strategies, and nonverbal behaviours. These elements are processed into a vector representation of the previous turns, which serves as input to s… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

    Comments: In Proceedings of the 16th Annual Conference ub Discourse and Dialogue (SIGDIAL). Sept 11-15, Prague Czechia

    Journal ref: In Proceedings of the 16th Annual Conference in Discourse and Dialogue (SIGDIAL). Sept. 11-15, Prague, Czechia (2023)

  11. "You might think about slightly revising the title": identifying hedges in peer-tutoring interactions

    Authors: Yann Raphalen, Chloé Clavel, Justine Cassell

    Abstract: Hedges play an important role in the management of conversational interaction. In peer tutoring, they are notably used by tutors in dyads (pairs of interlocutors) experiencing low rapport to tone down the impact of instructions and negative feedback. Pursuing the objective of building a tutoring agent that manages rapport with students in order to improve learning, we used a multimodal peer-tutori… ▽ More

    Submitted 18 June, 2023; originally announced June 2023.

    Comments: Published in Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (ACL), 2022

    Journal ref: Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (ACL), Volume 1: long papers (2022)

  12. arXiv:2306.14696  [pdf, other

    cs.CL cs.AI

    How About Kind of Generating Hedges using End-to-End Neural Models?

    Authors: Alafate Abulimiti, Chloé Clavel, Justine Cassell

    Abstract: Hedging is a strategy for softening the impact of a statement in conversation. In reducing the strength of an expression, it may help to avoid embarrassment (more technically, ``face threat'') to one's listener. For this reason, it is often found in contexts of instruction, such as tutoring. In this work, we develop a model of hedge generation based on i) fine-tuning state-of-the-art language mode… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

    MSC Class: 68T50 ACM Class: I.2.7

  13. arXiv:2301.10761  [pdf, other

    cs.CL cs.HC

    Fillers in Spoken Language Understanding: Computational and Psycholinguistic Perspectives

    Authors: Tanvi Dinkar, Chloé Clavel, Ioana Vasilescu

    Abstract: Disfluencies (i.e. interruptions in the regular flow of speech), are ubiquitous to spoken discourse. Fillers ("uh", "um") are disfluencies that occur the most frequently compared to other kinds of disfluencies. Yet, to the best of our knowledge, there isn't a resource that brings together the research perspectives influencing Spoken Language Understanding (SLU) on these speech events. This aim of… ▽ More

    Submitted 24 March, 2023; v1 submitted 25 January, 2023; originally announced January 2023.

    Comments: \footnote{This article has been published in the journal "Traitement Automatique des Langues" 63(3): 37-62, 2022,@ATALA. The original manuscript is available on the web site www.atala.org}

  14. arXiv:2210.17378  [pdf, other

    cs.CL

    Questioning the Validity of Summarization Datasets and Improving Their Factual Consistency

    Authors: Yanzhu Guo, Chloé Clavel, Moussa Kamal Eddine, Michalis Vazirgiannis

    Abstract: The topic of summarization evaluation has recently attracted a surge of attention due to the rapid development of abstractive summarization systems. However, the formulation of the task is rather ambiguous, neither the linguistic nor the natural language processing community has succeeded in giving a mutually agreed-upon definition. Due to this lack of well-defined formulation, a large number of p… ▽ More

    Submitted 31 October, 2022; originally announced October 2022.

    Comments: Accepted to EMNLP 2022

  15. arXiv:2208.11646  [pdf, other

    cs.CL

    Of Human Criteria and Automatic Metrics: A Benchmark of the Evaluation of Story Generation

    Authors: Cyril Chhun, Pierre Colombo, Chloé Clavel, Fabian M. Suchanek

    Abstract: Research on Automatic Story Generation (ASG) relies heavily on human and automatic evaluation. However, there is no consensus on which human evaluation criteria to use, and no analysis of how well automatic criteria correlate with them. In this paper, we propose to re-evaluate ASG evaluation. We introduce a set of 6 orthogonal and comprehensive human criteria, carefully motivated by the social sci… ▽ More

    Submitted 15 September, 2022; v1 submitted 24 August, 2022; originally announced August 2022.

    Comments: 43 pages, 38 figures. Proceedings of the 29th International Conference on Computational Linguistics (COLING 2022)

  16. arXiv:2207.08256  [pdf, other

    cs.HC cs.AI cs.CL cs.LG

    Representation Learning of Image Schema

    Authors: Fajrian Yunus, Chloé Clavel, Catherine Pelachaud

    Abstract: Image schema is a recurrent pattern of reasoning where one entity is mapped into another. Image schema is similar to conceptual metaphor and is also related to metaphoric gesture. Our main goal is to generate metaphoric gestures for an Embodied Conversational Agent. We propose a technique to learn the vector representation of image schemas. As far as we are aware of, this is the first work which… ▽ More

    Submitted 17 July, 2022; originally announced July 2022.

  17. arXiv:2203.16891  [pdf, other

    cs.CL cs.AI cs.HC cs.LG

    A survey of neural models for the automatic analysis of conversation: Towards a better integration of the social sciences

    Authors: Chloé Clavel, Matthieu Labeau, Justine Cassell

    Abstract: Some exciting new approaches to neural architectures for the analysis of conversation have been introduced over the past couple of years. These include neural architectures for detecting emotion, dialogue acts, and sentiment polarity. They take advantage of some of the key attributes of contemporary machine learning, such as recurrent neural networks with attention mechanisms and transformer-based… ▽ More

    Submitted 31 March, 2022; originally announced March 2022.

  18. arXiv:2112.01589  [pdf, other

    cs.CL cs.AI

    InfoLM: A New Metric to Evaluate Summarization & Data2Text Generation

    Authors: Pierre Colombo, Chloe Clavel, Pablo Piantanida

    Abstract: Assessing the quality of natural language generation systems through human annotation is very expensive. Additionally, human annotation campaigns are time-consuming and include non-reusable human labour. In practice, researchers rely on automatic metrics as a proxy of quality. In the last decade, many string-based metrics (e.g., BLEU) have been introduced. However, such metrics usually rely on exa… ▽ More

    Submitted 25 March, 2022; v1 submitted 2 December, 2021; originally announced December 2021.

    Journal ref: AAAI 2022

  19. arXiv:2110.09424  [pdf, other

    cs.CV cs.CL cs.HC cs.LG

    Don't Judge Me by My Face : An Indirect Adversarial Approach to Remove Sensitive Information From Multimodal Neural Representation in Asynchronous Job Video Interviews

    Authors: Léo Hemamou, Arthur Guillon, Jean-Claude Martin, Chloé Clavel

    Abstract: se of machine learning for automatic analysis of job interview videos has recently seen increased interest. Despite claims of fair output regarding sensitive information such as gender or ethnicity of the candidates, the current approaches rarely provide proof of unbiased decision-making, or that sensitive information is not used. Recently, adversarial methods have been proved to effectively remov… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

    Comments: published in ACII 2021

  20. arXiv:2110.03389  [pdf, other

    cs.CL cs.AI

    Beam Search with Bidirectional Strategies for Neural Response Generation

    Authors: Pierre Colombo, Chouchang Yang, Giovanna Varni, Chloé Clavel

    Abstract: Sequence-to-sequence neural networks have been widely used in language-based applications as they have flexible capabilities to learn various language models. However, when seeking for the optimal language response through trained neural networks, current existing approaches such as beam-search decoder strategies are still not able reaching to promising performances. Instead of developing various… ▽ More

    Submitted 7 October, 2021; originally announced October 2021.

  21. arXiv:2109.09366  [pdf, other

    cs.CL cs.LG

    Few-Shot Emotion Recognition in Conversation with Sequential Prototypical Networks

    Authors: Gaël Guibon, Matthieu Labeau, Hélène Flamein, Luce Lefeuvre, Chloé Clavel

    Abstract: Several recent studies on dyadic human-human interactions have been done on conversations without specific business objectives. However, many companies might benefit from studies dedicated to more precise environments such as after sales services or customer satisfaction surveys. In this work, we place ourselves in the scope of a live chat customer service in which we want to detect emotions and t… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

    Journal ref: The 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP 2021), Nov 2021, Punta Cana, Dominican Republic

  22. arXiv:2109.00922  [pdf, other

    cs.LG cs.AI cs.CL

    Improving Multimodal fusion via Mutual Dependency Maximisation

    Authors: Pierre Colombo, Emile Chapuis, Matthieu Labeau, Chloe Clavel

    Abstract: Multimodal sentiment analysis is a trending area of research, and the multimodal fusion is one of its most active topic. Acknowledging humans communicate through a variety of channels (i.e visual, acoustic, linguistic), multimodal systems aim at integrating different unimodal representations into a synthetic one. So far, a consequent effort has been made on developing complex architectures allowin… ▽ More

    Submitted 9 September, 2021; v1 submitted 31 August, 2021; originally announced September 2021.

    Journal ref: EMNLP 2021

  23. arXiv:2108.12465  [pdf, other

    cs.CL cs.AI

    Code-switched inspired losses for generic spoken dialog representations

    Authors: Emile Chapuis, Pierre Colombo, Matthieu Labeau, Chloe Clavel

    Abstract: Spoken dialog systems need to be able to handle both multiple languages and multilinguality inside a conversation (\textit{e.g} in case of code-switching). In this work, we introduce new pretraining losses tailored to learn multilingual spoken dialog representations. The goal of these losses is to expose the model to code-switched language. To scale up training, we automatically build a pretrainin… ▽ More

    Submitted 9 September, 2021; v1 submitted 27 August, 2021; originally announced August 2021.

    Journal ref: EMNLP 2021

  24. arXiv:2108.12463  [pdf, other

    cs.CL cs.AI

    Automatic Text Evaluation through the Lens of Wasserstein Barycenters

    Authors: Pierre Colombo, Guillaume Staerman, Chloe Clavel, Pablo Piantanida

    Abstract: A new metric \texttt{BaryScore} to evaluate text generation based on deep contextualized embeddings e.g., BERT, Roberta, ELMo) is introduced. This metric is motivated by a new framework relying on optimal transport tools, i.e., Wasserstein distance and barycenter. By modelling the layer output of deep contextualized embeddings as a probability distribution rather than by a vector embedding; this f… ▽ More

    Submitted 9 September, 2021; v1 submitted 27 August, 2021; originally announced August 2021.

    Journal ref: EMNLP 2021

  25. arXiv:2105.02685  [pdf, other

    cs.AI

    A Novel Estimator of Mutual Information for Learning to Disentangle Textual Representations

    Authors: Pierre Colombo, Chloe Clavel, Pablo Piantanida

    Abstract: Learning disentangled representations of textual data is essential for many natural language tasks such as fair classification, style transfer and sentence generation, among others. The existent dominant approaches in the context of text data {either rely} on training an adversary (discriminator) that aims at making attribute values difficult to be inferred from the latent code {or rely on minimis… ▽ More

    Submitted 6 May, 2021; originally announced May 2021.

    Journal ref: ACL 2021

  26. arXiv:2104.04429  [pdf, other

    cs.CL cs.HC

    Studying Alignment in a Collaborative Learning Activity via Automatic Methods: The Link Between What We Say and Do

    Authors: Utku Norman, Tanvi Dinkar, Barbara Bruno, Chloé Clavel

    Abstract: A dialogue is successful when there is alignment between the speakers at different linguistic levels. In this work, we consider the dialogue occurring between interlocutors engaged in a collaborative learning task, where they are not only evaluated on how well they performed, but also on how much they learnt. The main contribution of this work is to propose new automatic measures to study alignmen… ▽ More

    Submitted 14 April, 2022; v1 submitted 9 April, 2021; originally announced April 2021.

    Comments: * The authors contributed equally to this work. This article a preprint under review

  27. arXiv:2009.11340  [pdf, other

    cs.CL

    The importance of fillers for text representations of speech transcripts

    Authors: Tanvi Dinkar, Pierre Colombo, Matthieu Labeau, Chloé Clavel

    Abstract: While being an essential component of spoken language, fillers (e.g."um" or "uh") often remain overlooked in Spoken Language Understanding (SLU) tasks. We explore the possibility of representing them with deep contextualised embeddings, showing improvements on modelling spoken language and two downstream tasks - predicting a speaker's stance and expressed confidence.

    Submitted 1 October, 2020; v1 submitted 23 September, 2020; originally announced September 2020.

    Comments: To appear in EMNLP 2020

  28. arXiv:2009.11152  [pdf, other

    cs.CL cs.AI

    Hierarchical Pre-training for Sequence Labelling in Spoken Dialog

    Authors: Emile Chapuis, Pierre Colombo, Matteo Manica, Matthieu Labeau, Chloe Clavel

    Abstract: Sequence labelling tasks like Dialog Act and Emotion/Sentiment identification are a key component of spoken dialog systems. In this work, we propose a new approach to learn generic representations adapted to spoken dialog, which we evaluate on a new benchmark we call Sequence labellIng evaLuatIon benChmark fOr spoken laNguagE benchmark (\texttt{SILICONE}). \texttt{SILICONE} is model-agnostic and c… ▽ More

    Submitted 8 February, 2021; v1 submitted 23 September, 2020; originally announced September 2020.

    Journal ref: EMNLP 2020

  29. arXiv:2008.07643  [pdf, other

    cs.HC cs.CL cs.CV cs.SD eess.AS

    Sequence-to-Sequence Predictive Model: From Prosody To Communicative Gestures

    Authors: Fajrian Yunus, Chloé Clavel, Catherine Pelachaud

    Abstract: Communicative gestures and speech acoustic are tightly linked. Our objective is to predict the timing of gestures according to the acoustic. That is, we want to predict when a certain gesture occurs. We develop a model based on a recurrent neural network with attention mechanism. The model is trained on a corpus of natural dyadic interaction where the speech acoustic and the gesture phases and typ… ▽ More

    Submitted 23 April, 2021; v1 submitted 17 August, 2020; originally announced August 2020.

  30. On-the-fly Detection of User Engagement Decrease in Spontaneous Human-Robot Interaction, International Journal of Social Robotics, 2019

    Authors: Atef Ben Youssef, Giovanna Varni, Slim Essid, Chloé Clavel

    Abstract: In this paper, we consider the detection of a decrease of engagement by users spontaneously interacting with a socially assistive robot in a public space. We first describe the UE-HRI dataset that collects spontaneous Human-Robot Interactions following the guidelines provided by the Affective Computing research community to collect data "in-the-wild". We then analyze the users' behaviors, focusing… ▽ More

    Submitted 20 April, 2020; originally announced April 2020.

    Journal ref: International Journal of Social Robotics December 2019

  31. arXiv:2003.11593  [pdf, other

    stat.ML cs.CL cs.LG

    Heavy-tailed Representations, Text Polarity Classification & Data Augmentation

    Authors: Hamid Jalalzai, Pierre Colombo, Chloé Clavel, Eric Gaussier, Giovanna Varni, Emmanuel Vignon, Anne Sabourin

    Abstract: The dominant approaches to text representation in natural language rely on learning embeddings on massive corpora which have convenient properties such as compositionality and distance preservation. In this paper, we develop a novel method to learn a heavy-tailed embedding with desirable regularity properties regarding the distributional tails, which allows to analyze the points far away from the… ▽ More

    Submitted 25 March, 2021; v1 submitted 25 March, 2020; originally announced March 2020.

    Journal ref: Advances in Neural Information Processing Systems (NeurIPS), Dec 2020

  32. arXiv:2002.09419  [pdf

    cs.CL

    Guider l'attention dans les modeles de sequence a sequence pour la prediction des actes de dialogue

    Authors: Pierre Colombo, Emile Chapuis, Matteo Manica, Emmanuel Vignon, Giovanna Varni, Chloe Clavel

    Abstract: The task of predicting dialog acts (DA) based on conversational dialog is a key component in the development of conversational agents. Accurately predicting DAs requires a precise modeling of both the conversation and the global tag dependencies. We leverage seq2seq approaches widely adopted in Neural Machine Translation (NMT) to improve the modelling of tag sequentiality. Seq2seq models are known… ▽ More

    Submitted 26 February, 2020; v1 submitted 21 February, 2020; originally announced February 2020.

    Comments: in French

    Journal ref: WACAI 2020

  33. arXiv:2002.08801  [pdf, other

    cs.CL cs.LG

    Guiding attention in Sequence-to-sequence models for Dialogue Act prediction

    Authors: Pierre Colombo, Emile Chapuis, Matteo Manica, Emmanuel Vignon, Giovanna Varni, Chloe Clavel

    Abstract: The task of predicting dialog acts (DA) based on conversational dialog is a key component in the development of conversational agents. Accurately predicting DAs requires a precise modeling of both the conversation and the global tag dependencies. We leverage seq2seq approaches widely adopted in Neural Machine Translation (NMT) to improve the modelling of tag sequentiality. Seq2seq models are known… ▽ More

    Submitted 26 February, 2020; v1 submitted 20 February, 2020; originally announced February 2020.

    Journal ref: AAAI 2020

  34. arXiv:1909.08845  [pdf, other

    cs.CV cs.HC

    Slices of Attention in Asynchronous Video Job Interviews

    Authors: Léo Hemamou, Ghazi Felhi, Jean-Claude Martin, Chloé Clavel

    Abstract: The impact of non verbal behaviour in a hiring decision remains an open question. Investigating this question is important, as it could provide a better understanding on how to train candidates for job interviews and make recruiters be aware of influential non verbal behaviour. This research has recently been accelerated due to the development of tools for the automatic analysis of social signals,… ▽ More

    Submitted 19 September, 2019; originally announced September 2019.

    Comments: Accepted at 2019 8th International Conference on Affective Computing and Intelligent Interaction (ACII)

  35. arXiv:1908.11216  [pdf, other

    cs.CL cs.AI cs.IR

    From the Token to the Review: A Hierarchical Multimodal approach to Opinion Mining

    Authors: Alexandre Garcia, Pierre Colombo, Slim Essid, Florence d'Alché-Buc, Chloé Clavel

    Abstract: The task of predicting fine grained user opinion based on spontaneous spoken language is a key problem arising in the development of Computational Agents as well as in the development of social network based opinion miners. Unfortunately, gathering reliable data on which a model can be trained is notoriously difficult and existing works rely only on coarsely labeled opinions. In this work we aim a… ▽ More

    Submitted 10 September, 2019; v1 submitted 29 August, 2019; originally announced August 2019.

    Comments: Accepted to 2019 Conference on Empirical Methods in Natural Language Processing (EMNLP) and 9th International Joint Conference on Natural Language Processing (IJCNLP)

  36. HireNet: a Hierarchical Attention Model for the Automatic Analysis of Asynchronous Video Job Interviews

    Authors: Léo Hemamou, Ghazi Felhi, Vincent Vandenbussche, Jean-Claude Martin, Chloé Clavel

    Abstract: New technologies drastically change recruitment techniques. Some research projects aim at designing interactive systems that help candidates practice job interviews. Other studies aim at the automatic detection of social signals (e.g. smile, turn of speech, etc...) in videos of job interviews. These studies are limited with respect to the number of interviews they process, but also by the fact tha… ▽ More

    Submitted 25 July, 2019; originally announced July 2019.

    Comments: AAAI 2019

    Journal ref: Vol 33 (2019): Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence

  37. arXiv:1902.10102  [pdf, other

    cs.MM cs.CL

    A multimodal movie review corpus for fine-grained opinion mining

    Authors: Alexandre Garcia, Slim Essid, Florence d'Alché-Buc, Chloé Clavel

    Abstract: In this paper, we introduce a set of opinion annotations for the POM movie review dataset, composed of 1000 videos. The annotation campaign is motivated by the development of a hierarchical opinion prediction framework allowing one to predict the different components of the opinions (e.g. polarity and aspect) and to identify the corresponding textual spans. The resulting annotations have been gath… ▽ More

    Submitted 29 April, 2021; v1 submitted 26 February, 2019; originally announced February 2019.

  38. arXiv:1806.07787  [pdf, other

    cs.CL

    Opinion Dynamics Modeling for Movie Review Transcripts Classification with Hidden Conditional Random Fields

    Authors: Valentin Barriere, Chloé Clavel, Slim Essid

    Abstract: In this paper, the main goal is to detect a movie reviewer's opinion using hidden conditional random fields. This model allows us to capture the dynamics of the reviewer's opinion in the transcripts of long unsegmented audio reviews that are analyzed by our system. High level linguistic features are computed at the level of inter-pausal segments. The features include syntactic features, a statisti… ▽ More

    Submitted 20 June, 2018; originally announced June 2018.

    Comments: Oral Interspeech 2017

  39. arXiv:1803.08355  [pdf, other

    cs.LG cs.AI stat.ML

    Structured Output Learning with Abstention: Application to Accurate Opinion Prediction

    Authors: Alexandre Garcia, Slim Essid, Chloé Clavel, Florence d'Alché-Buc

    Abstract: Motivated by Supervised Opinion Analysis, we propose a novel framework devoted to Structured Output Learning with Abstention (SOLA). The structure prediction model is able to abstain from predicting some labels in the structured output at a cost chosen by the user in a flexible way. For that purpose, we decompose the problem into the learning of a pair of predictors, one devoted to structured abst… ▽ More

    Submitted 8 June, 2018; v1 submitted 22 March, 2018; originally announced March 2018.

    Journal ref: Proceedings of Machine Learning Research 80 (2018) 1695-1703