Skip to main content

Showing 1–31 of 31 results for author: Morency, L

Searching in archive stat. Search in all archives.
.
  1. arXiv:2410.14812  [pdf, ps, other

    cs.CL stat.ME

    Isolated Causal Effects of Natural Language

    Authors: Victoria Lin, Louis-Philippe Morency, Eli Ben-Michael

    Abstract: As language technologies become widespread, it is important to understand how changes in language affect reader perceptions and behaviors. These relationships may be formalized as the isolated causal effect of some focal language-encoded intervention (e.g., factual inaccuracies) on an external outcome (e.g., readers' beliefs). In this paper, we introduce a formal estimation framework for isolated… ▽ More

    Submitted 4 June, 2025; v1 submitted 18 October, 2024; originally announced October 2024.

    Comments: ICML 2025

  2. arXiv:2402.14979  [pdf, other

    cs.LG cs.CL stat.ME

    Optimizing Language Models for Human Preferences is a Causal Inference Problem

    Authors: Victoria Lin, Eli Ben-Michael, Louis-Philippe Morency

    Abstract: As large language models (LLMs) see greater use in academic and commercial settings, there is increasing interest in methods that allow language models to generate texts aligned with human preferences. In this paper, we present an initial exploration of language model optimization for human preferences from direct outcome datasets, where each sample consists of a text and an associated numerical o… ▽ More

    Submitted 5 June, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: UAI 2024

  3. arXiv:2310.20697  [pdf, other

    cs.CL stat.ME

    Text-Transport: Toward Learning Causal Effects of Natural Language

    Authors: Victoria Lin, Louis-Philippe Morency, Eli Ben-Michael

    Abstract: As language technologies gain prominence in real-world settings, it is important to understand how changes to language affect reader perceptions. This can be formalized as the causal effect of varying a linguistic attribute (e.g., sentiment) on a reader's response to the text. In this paper, we introduce Text-Transport, a method for estimation of causal effects from natural language under any text… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023

  4. arXiv:2306.04539  [pdf, other

    cs.LG cs.CL cs.CV cs.IT stat.ML

    Multimodal Learning Without Labeled Multimodal Data: Guarantees and Applications

    Authors: Paul Pu Liang, Chun Kai Ling, Yun Cheng, Alex Obolenskiy, Yudong Liu, Rohan Pandey, Alex Wilf, Louis-Philippe Morency, Ruslan Salakhutdinov

    Abstract: In many machine learning systems that jointly learn from multiple modalities, a core research question is to understand the nature of multimodal interactions: how modalities combine to provide new task-relevant information that was not present in either alone. We study this challenge of interaction quantification in a semi-supervised setting with only labeled unimodal data and naturally co-occurri… ▽ More

    Submitted 13 June, 2024; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: ICLR 2024, Code available at: https://github.com/pliang279/PID

  5. arXiv:2210.04714  [pdf, other

    cs.CL cs.LG stat.ML

    Uncertainty Quantification with Pre-trained Language Models: A Large-Scale Empirical Analysis

    Authors: Yuxin Xiao, Paul Pu Liang, Umang Bhatt, Willie Neiswanger, Ruslan Salakhutdinov, Louis-Philippe Morency

    Abstract: Pre-trained language models (PLMs) have gained increasing popularity due to their compelling prediction performance in diverse natural language processing (NLP) tasks. When formulating a PLM-based prediction pipeline for NLP tasks, it is also crucial for the pipeline to minimize the calibration error, especially in safety-critical applications. That is, the pipeline should reliably indicate when w… ▽ More

    Submitted 14 October, 2022; v1 submitted 10 October, 2022; originally announced October 2022.

    Comments: Accepted by EMNLP 2022 (Findings)

  6. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  7. arXiv:2110.13422  [pdf, other

    cs.LG cs.AI stat.ML

    Relay Variational Inference: A Method for Accelerated Encoderless VI

    Authors: Amir Zadeh, Santiago Benoit, Louis-Philippe Morency

    Abstract: Variational Inference (VI) offers a method for approximating intractable likelihoods. In neural VI, inference of approximate posteriors is commonly done using an encoder. Alternatively, encoderless VI offers a framework for learning generative models from data without encountering suboptimalities caused by amortization via an encoder (e.g. in presence of missing or uncertain data). However, in abs… ▽ More

    Submitted 13 January, 2023; v1 submitted 26 October, 2021; originally announced October 2021.

  8. arXiv:2101.00574  [pdf, other

    cs.LG cs.AI stat.ML

    StarNet: Gradient-free Training of Deep Generative Models using Determined System of Linear Equations

    Authors: Amir Zadeh, Santiago Benoit, Louis-Philippe Morency

    Abstract: In this paper we present an approach for training deep generative models solely based on solving determined systems of linear equations. A network that uses this approach, called a StarNet, has the following desirable properties: 1) training requires no gradient as solution to the system of linear equations is not stochastic, 2) is highly scalable when solving the system of linear equations w.r.t… ▽ More

    Submitted 3 January, 2021; originally announced January 2021.

    Comments: Work in progress at CMU

  9. arXiv:2012.02359  [pdf, other

    cs.LG cs.CY stat.AP

    Multimodal Privacy-preserving Mood Prediction from Mobile Data: A Preliminary Study

    Authors: Terrance Liu, Paul Pu Liang, Michal Muszynski, Ryo Ishii, David Brent, Randy Auerbach, Nicholas Allen, Louis-Philippe Morency

    Abstract: Mental health conditions remain under-diagnosed even in countries with common access to advanced medical care. The ability to accurately and efficiently predict mood from easily collectible data has several important implications towards the early detection and intervention of mental health disorders. One promising data source to help monitor human behavior is from daily smartphone usage. However,… ▽ More

    Submitted 3 December, 2020; originally announced December 2020.

  10. arXiv:2009.00001  [pdf, other

    cs.HC stat.AP

    Toward Multimodal Modeling of Emotional Expressiveness

    Authors: Victoria Lin, Jeffrey M. Girard, Michael A. Sayette, Louis-Philippe Morency

    Abstract: Emotional expressiveness captures the extent to which a person tends to outwardly display their emotions through behavior. Due to the close relationship between emotional expressiveness and behavioral health, as well as the crucial role that it plays in social interaction, the ability to automatically predict emotional expressiveness stands to spur advances in science, medicine, and industry. In t… ▽ More

    Submitted 31 August, 2020; originally announced September 2020.

    Comments: V. Lin and J.M. Girard contributed equally to this research. This paper was accepted to ICMI 2020

  11. arXiv:2007.03626  [pdf, other

    cs.CL cs.CV cs.LG stat.ML

    What Gives the Answer Away? Question Answering Bias Analysis on Video QA Datasets

    Authors: Jianing Yang, Yuying Zhu, Yongxin Wang, Ruitao Yi, Amir Zadeh, Louis-Philippe Morency

    Abstract: Question answering biases in video QA datasets can mislead multimodal model to overfit to QA artifacts and jeopardize the model's ability to generalize. Understanding how strong these QA biases are and where they come from helps the community measure progress more accurately and provide researchers insights to debug their models. In this paper, we analyze QA biases in popular video question answer… ▽ More

    Submitted 7 July, 2020; originally announced July 2020.

  12. arXiv:2006.05576  [pdf, other

    cs.LG stat.ML

    Self-supervised Learning from a Multi-view Perspective

    Authors: Yao-Hung Hubert Tsai, Yue Wu, Ruslan Salakhutdinov, Louis-Philippe Morency

    Abstract: As a subset of unsupervised representation learning, self-supervised representation learning adopts self-defined signals as supervision and uses the learned representation for downstream tasks, such as object detection and image captioning. Many proposed approaches for self-supervised learning follow naturally a multi-view perspective, where the input (e.g., original images) and the self-supervise… ▽ More

    Submitted 22 March, 2021; v1 submitted 9 June, 2020; originally announced June 2020.

  13. arXiv:2006.05553  [pdf, other

    cs.LG stat.ME stat.ML

    Neural Methods for Point-wise Dependency Estimation

    Authors: Yao-Hung Hubert Tsai, Han Zhao, Makoto Yamada, Louis-Philippe Morency, Ruslan Salakhutdinov

    Abstract: Since its inception, the neural estimation of mutual information (MI) has demonstrated the empirical success of modeling expected dependency between high-dimensional random variables. However, MI is an aggregate statistic and cannot be used to measure point-wise dependency between different events. In this work, instead of estimating the expected dependency, we focus on estimating point-wise depen… ▽ More

    Submitted 14 October, 2020; v1 submitted 9 June, 2020; originally announced June 2020.

  14. arXiv:2002.06541  [pdf, other

    cs.LG cs.IT stat.ML

    Learning Not to Learn in the Presence of Noisy Labels

    Authors: Liu Ziyin, Blair Chen, Ru Wang, Paul Pu Liang, Ruslan Salakhutdinov, Louis-Philippe Morency, Masahito Ueda

    Abstract: Learning in the presence of label noise is a challenging yet important task: it is crucial to design models that are robust in the presence of mislabeled datasets. In this paper, we discover that a new class of loss functions called the gambler's loss provides strong robustness to label noise across various levels of corruption. We show that training with this loss function encourages the model to… ▽ More

    Submitted 16 February, 2020; originally announced February 2020.

  15. arXiv:2001.01523  [pdf, other

    cs.LG cs.DC stat.ML

    Think Locally, Act Globally: Federated Learning with Local and Global Representations

    Authors: Paul Pu Liang, Terrance Liu, Liu Ziyin, Nicholas B. Allen, Randy P. Auerbach, David Brent, Ruslan Salakhutdinov, Louis-Philippe Morency

    Abstract: Federated learning is a method of training models on private data distributed over multiple devices. To keep device data private, the global model is trained by only communicating parameters and updates which poses scalability challenges for large models. To this end, we propose a new federated learning algorithm that jointly learns compact local representations on each device and a global model a… ▽ More

    Submitted 14 July, 2020; v1 submitted 6 January, 2020; originally announced January 2020.

    Comments: NeurIPS 2019 Workshop on Federated Learning distinguished student paper award. Code: https://github.com/pliang279/LG-FedAvg

  16. arXiv:1912.09423  [pdf, ps, other

    cs.LG cs.NE stat.ML

    Pseudo-Encoded Stochastic Variational Inference

    Authors: Amir Zadeh, Smon Hessner, Yao-Chong Lim, Louis-Phlippe Morency

    Abstract: Posterior inference in directed graphical models is commonly done using a probabilistic encoder (a.k.a inference model) conditioned on the input. Often this inference model is trained jointly with the probabilistic decoder (a.k.a generator model). If probabilistic encoder encounters complexities during training (e.g. suboptimal complxity or parameterization), then learning reaches a suboptimal obj… ▽ More

    Submitted 19 December, 2019; originally announced December 2019.

  17. arXiv:1912.04523  [pdf, other

    cs.CV stat.AP

    Context-Dependent Models for Predicting and Characterizing Facial Expressiveness

    Authors: Victoria Lin, Jeffrey M. Girard, Louis-Philippe Morency

    Abstract: In recent years, extensive research has emerged in affective computing on topics like automatic emotion recognition and determining the signals that characterize individual emotions. Much less studied, however, is expressiveness, or the extent to which someone shows any feeling or emotion. Expressiveness is related to personality and mental health and plays a crucial role in social interaction. As… ▽ More

    Submitted 10 December, 2019; originally announced December 2019.

  18. arXiv:1911.09826  [pdf, other

    cs.LG cs.CL stat.ML

    Factorized Multimodal Transformer for Multimodal Sequential Learning

    Authors: Amir Zadeh, Chengfeng Mao, Kelly Shi, Yiwei Zhang, Paul Pu Liang, Soujanya Poria, Louis-Philippe Morency

    Abstract: The complex world around us is inherently multimodal and sequential (continuous). Information is scattered across different modalities and requires multiple continuous sensors to be captured. As machine learning leaps towards better generalization to real world, multimodal sequential learning becomes a fundamental research area. Arguably, modeling arbitrarily distributed spatio-temporal dynamics w… ▽ More

    Submitted 21 November, 2019; originally announced November 2019.

  19. arXiv:1911.09783  [pdf, other

    cs.LG cs.SD eess.AS stat.ML

    WildMix Dataset and Spectro-Temporal Transformer Model for Monoaural Audio Source Separation

    Authors: Amir Zadeh, Tianjun Ma, Soujanya Poria, Louis-Philippe Morency

    Abstract: Monoaural audio source separation is a challenging research area in machine learning. In this area, a mixture containing multiple audio sources is given, and a model is expected to disentangle the mixture into isolated atomic sources. In this paper, we first introduce a challenging new dataset for monoaural source separation called WildMix. WildMix is designed with the goal of extending the bounda… ▽ More

    Submitted 21 November, 2019; originally announced November 2019.

  20. arXiv:1908.11775  [pdf, ps, other

    cs.LG stat.ML

    Transformer Dissection: A Unified Understanding of Transformer's Attention via the Lens of Kernel

    Authors: Yao-Hung Hubert Tsai, Shaojie Bai, Makoto Yamada, Louis-Philippe Morency, Ruslan Salakhutdinov

    Abstract: Transformer is a powerful architecture that achieves superior performance on various sequence learning tasks, including neural machine translation, language understanding, and sequence prediction. At the core of the Transformer is the attention mechanism, which concurrently processes all inputs in the streams. In this paper, we present a new formulation of attention via the lens of the kernel. To… ▽ More

    Submitted 11 November, 2019; v1 submitted 30 August, 2019; originally announced August 2019.

    Comments: EMNLP 2019

  21. arXiv:1908.05787  [pdf, other

    cs.LG cs.CL stat.ML

    Integrating Multimodal Information in Large Pretrained Transformers

    Authors: Wasifur Rahman, Md. Kamrul Hasan, Sangwu Lee, Amir Zadeh, Chengfeng Mao, Louis-Philippe Morency, Ehsan Hoque

    Abstract: Recent Transformer-based contextual word representations, including BERT and XLNet, have shown state-of-the-art performance in multiple disciplines within NLP. Fine-tuning the trained contextual models on task-specific datasets has been the key to achieving superior performance downstream. While fine-tuning these pre-trained models is straightforward for lexical applications (applications with onl… ▽ More

    Submitted 21 November, 2020; v1 submitted 15 August, 2019; originally announced August 2019.

  22. arXiv:1907.01011  [pdf, other

    cs.LG cs.CL stat.ML

    Learning Representations from Imperfect Time Series Data via Tensor Rank Regularization

    Authors: Paul Pu Liang, Zhun Liu, Yao-Hung Hubert Tsai, Qibin Zhao, Ruslan Salakhutdinov, Louis-Philippe Morency

    Abstract: There has been an increased interest in multimodal language processing including multimodal dialog, question answering, sentiment analysis, and speech recognition. However, naturally occurring multimodal data is often imperfect as a result of imperfect modalities, missing entries or noise corruption. To address these concerns, we present a regularization method based on tensor rank minimization. O… ▽ More

    Submitted 1 July, 2019; originally announced July 2019.

  23. arXiv:1907.00208  [pdf, other

    cs.LG stat.ML

    Deep Gamblers: Learning to Abstain with Portfolio Theory

    Authors: Liu Ziyin, Zhikang Wang, Paul Pu Liang, Ruslan Salakhutdinov, Louis-Philippe Morency, Masahito Ueda

    Abstract: We deal with the \textit{selective classification} problem (supervised-learning problem with a rejection option), where we want to achieve the best performance at a certain level of coverage of the data. We transform the original $m$-class classification problem to $(m+1)$-class where the $(m+1)$-th class represents the model abstaining from making a prediction due to disconfidence. Inspired by po… ▽ More

    Submitted 1 October, 2019; v1 submitted 29 June, 2019; originally announced July 2019.

    Comments: Camera-Ready version for NeurIPS2019. Link to our code updated

  24. arXiv:1906.02125  [pdf, other

    cs.CL cs.AI cs.LG cs.SD eess.AS stat.ML

    Strong and Simple Baselines for Multimodal Utterance Embeddings

    Authors: Paul Pu Liang, Yao Chong Lim, Yao-Hung Hubert Tsai, Ruslan Salakhutdinov, Louis-Philippe Morency

    Abstract: Human language is a rich multimodal signal consisting of spoken words, facial expressions, body gestures, and vocal intonations. Learning representations for these spoken utterances is a complex research problem due to the presence of multiple heterogeneous sources of information. Recent advances in multimodal learning have followed the general trend of building more complex models that utilize va… ▽ More

    Submitted 28 February, 2020; v1 submitted 14 May, 2019; originally announced June 2019.

    Comments: NAACL 2019 oral presentation

  25. arXiv:1904.06618  [pdf, other

    cs.LG cs.CL stat.ML

    UR-FUNNY: A Multimodal Language Dataset for Understanding Humor

    Authors: Md Kamrul Hasan, Wasifur Rahman, Amir Zadeh, Jianyuan Zhong, Md Iftekhar Tanveer, Louis-Philippe Morency, Mohammed, Hoque

    Abstract: Humor is a unique and creative communicative behavior displayed during social interactions. It is produced in a multimodal manner, through the usage of words (text), gestures (vision) and prosodic cues (acoustic). Understanding humor from these three modalities falls within boundaries of multimodal language; a recent research trend in natural language processing that models natural language as it… ▽ More

    Submitted 13 April, 2019; originally announced April 2019.

    Journal ref: EMNLP-IJCNLP, 2019, 2046-2056

  26. arXiv:1903.00840  [pdf, other

    cs.LG cs.AI stat.ML

    Variational Auto-Decoder: A Method for Neural Generative Modeling from Incomplete Data

    Authors: Amir Zadeh, Yao-Chong Lim, Paul Pu Liang, Louis-Philippe Morency

    Abstract: Learning a generative model from partial data (data with missingness) is a challenging area of machine learning research. We study a specific implementation of the Auto-Encoding Variational Bayes (AEVB) algorithm, named in this paper as a Variational Auto-Decoder (VAD). VAD is a generic framework which uses Variational Bayes and Markov Chain Monte Carlo (MCMC) methods to learn a generative model f… ▽ More

    Submitted 3 January, 2021; v1 submitted 3 March, 2019; originally announced March 2019.

    Comments: Link to code and data available from https://github.com/A2Zadeh/Variational-Autodecoder

  27. arXiv:1812.07809  [pdf, other

    cs.LG cs.CL cs.CV cs.HC stat.ML

    Found in Translation: Learning Robust Joint Representations by Cyclic Translations Between Modalities

    Authors: Hai Pham, Paul Pu Liang, Thomas Manzini, Louis-Philippe Morency, Barnabas Poczos

    Abstract: Multimodal sentiment analysis is a core research area that studies speaker sentiment expressed from the language, visual, and acoustic modalities. The central challenge in multimodal learning involves inferring joint representations that can process and relate information from these modalities. However, existing work learns joint representations by requiring all modalities as input and as a result… ▽ More

    Submitted 28 February, 2020; v1 submitted 19 December, 2018; originally announced December 2018.

    Comments: AAAI 2019, code available at https://github.com/hainow/MCTN

  28. arXiv:1808.03920  [pdf, other

    cs.LG cs.AI cs.CL cs.NE stat.ML

    Multimodal Language Analysis with Recurrent Multistage Fusion

    Authors: Paul Pu Liang, Ziyin Liu, Amir Zadeh, Louis-Philippe Morency

    Abstract: Computational modeling of human multimodal language is an emerging research area in natural language processing spanning the language, visual and acoustic modalities. Comprehending multimodal language requires modeling not only the interactions within each modality (intra-modal interactions) but more importantly the interactions between modalities (cross-modal interactions). In this paper, we prop… ▽ More

    Submitted 12 August, 2018; originally announced August 2018.

    Comments: EMNLP 2018

  29. arXiv:1806.06176  [pdf, other

    cs.LG cs.CL cs.CV stat.ML

    Learning Factorized Multimodal Representations

    Authors: Yao-Hung Hubert Tsai, Paul Pu Liang, Amir Zadeh, Louis-Philippe Morency, Ruslan Salakhutdinov

    Abstract: Learning multimodal representations is a fundamentally complex research problem due to the presence of multiple heterogeneous sources of information. Although the presence of multiple modalities provides additional valuable information, there are two key challenges to address when learning from multimodal data: 1) models must learn the complex intra-modal and cross-modal interactions for predictio… ▽ More

    Submitted 14 May, 2019; v1 submitted 15 June, 2018; originally announced June 2018.

    Comments: ICLR 2019

  30. arXiv:1806.00064  [pdf, other

    cs.AI cs.LG stat.ML

    Efficient Low-rank Multimodal Fusion with Modality-Specific Factors

    Authors: Zhun Liu, Ying Shen, Varun Bharadhwaj Lakshminarasimhan, Paul Pu Liang, Amir Zadeh, Louis-Philippe Morency

    Abstract: Multimodal research is an emerging field of artificial intelligence, and one of the main research problems in this field is multimodal fusion. The fusion of multimodal data is the process of integrating multiple unimodal representations into one compact multimodal representation. Previous research in this field has exploited the expressiveness of tensors for multimodal representation. However, the… ▽ More

    Submitted 31 May, 2018; originally announced June 2018.

    Comments: * Equal contribution. 10 pages. Accepted by ACL 2018

  31. arXiv:1802.00924  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Multimodal Sentiment Analysis with Word-Level Fusion and Reinforcement Learning

    Authors: Minghai Chen, Sen Wang, Paul Pu Liang, Tadas Baltrušaitis, Amir Zadeh, Louis-Philippe Morency

    Abstract: With the increasing popularity of video sharing websites such as YouTube and Facebook, multimodal sentiment analysis has received increasing attention from the scientific community. Contrary to previous works in multimodal sentiment analysis which focus on holistic information in speech segments such as bag of words representations and average facial expression intensity, we develop a novel deep a… ▽ More

    Submitted 3 February, 2018; originally announced February 2018.

    Comments: ICMI 2017 Oral Presentation, Honorable Mention Award