Skip to main content

Showing 1–50 of 72 results for author: Ungar, L

.
  1. arXiv:2505.21588  [pdf, ps, other

    cs.MA cs.AI

    Herd Behavior: Investigating Peer Influence in LLM-based Multi-Agent Systems

    Authors: Young-Min Cho, Sharath Chandra Guntuku, Lyle Ungar

    Abstract: Recent advancements in Large Language Models (LLMs) have enabled the emergence of multi-agent systems where LLMs interact, collaborate, and make decisions in shared environments. While individual model behavior has been extensively studied, the dynamics of peer influence in such systems remain underexplored. In this paper, we investigate herd behavior, the tendency of agents to align their outputs… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

    Comments: Preprint

  2. arXiv:2505.11739  [pdf, other

    cs.CL cs.AI

    ZeroTuning: Unlocking the Initial Token's Power to Enhance Large Language Models Without Training

    Authors: Feijiang Han, Xiaodong Yu, Jianheng Tang, Lyle Ungar

    Abstract: Recently, training-free methods for improving large language models (LLMs) have attracted growing interest, with token-level attention tuning emerging as a promising and interpretable direction. However, existing methods typically rely on auxiliary mechanisms to identify important or irrelevant task-specific tokens, introducing potential bias and limiting applicability. In this paper, we uncover a… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

  3. arXiv:2504.20519  [pdf

    cs.CY cs.HC

    Conversations with AI Chatbots Increase Short-Term Vaccine Intentions But Do Not Outperform Standard Public Health Messaging

    Authors: Neil K. R. Sehgal, Sunny Rai, Manuel Tonneau, Anish K. Agarwal, Joseph Cappella, Melanie Kornides, Lyle Ungar, Alison Buttenheim, Sharath Chandra Guntuku

    Abstract: Large language model (LLM) based chatbots show promise in persuasive communication, but existing studies often rely on weak controls or focus on belief change rather than behavioral intentions or outcomes. This pre-registered multi-country (US, Canada, UK) randomized controlled trial involving 930 vaccine-hesitant parents evaluated brief (three-minute) multi-turn conversations with LLM-based chatb… ▽ More

    Submitted 29 April, 2025; v1 submitted 29 April, 2025; originally announced April 2025.

  4. arXiv:2504.14225  [pdf, other

    cs.CL

    Know Me, Respond to Me: Benchmarking LLMs for Dynamic User Profiling and Personalized Responses at Scale

    Authors: Bowen Jiang, Zhuoqun Hao, Young-Min Cho, Bryan Li, Yuan Yuan, Sihao Chen, Lyle Ungar, Camillo J. Taylor, Dan Roth

    Abstract: Large Language Models (LLMs) have emerged as personalized assistants for users across a wide range of tasks -- from offering writing support to delivering tailored recommendations or consultations. Over time, the interaction history between a user and an LLM can provide extensive information about an individual's traits and preferences. However, open questions remain on how well LLMs today can eff… ▽ More

    Submitted 19 April, 2025; originally announced April 2025.

  5. Exploring Socio-Cultural Challenges and Opportunities in Designing Mental Health Chatbots for Adolescents in India

    Authors: Neil K. R. Sehgal, Hita Kambhamettu, Sai Preethi Matam, Lyle Ungar, Sharath Chandra Guntuku

    Abstract: Mental health challenges among Indian adolescents are shaped by unique cultural and systemic barriers, including high social stigma and limited professional support. Through a mixed-methods study involving a survey of 278 adolescents and follow-up interviews with 12 participants, we explore how adolescents perceive mental health challenges and interact with digital tools. Quantitative results high… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

    Journal ref: Extended Abstracts of the CHI Conference on Human Factors in Computing Systems 2025

  6. arXiv:2502.10999  [pdf, other

    cs.CV cs.AI cs.CL cs.MM

    ControlText: Unlocking Controllable Fonts in Multilingual Text Rendering without Font Annotations

    Authors: Bowen Jiang, Yuan Yuan, Xinyi Bai, Zhuoqun Hao, Alyson Yin, Yaojie Hu, Wenyu Liao, Lyle Ungar, Camillo J. Taylor

    Abstract: This work demonstrates that diffusion models can achieve font-controllable multilingual text rendering using just raw images without font label annotations. Visual text rendering remains a significant challenge. While recent methods condition diffusion on glyphs, it is impossible to retrieve exact font annotations from large-scale, real-world datasets, which prevents user-specified font control. T… ▽ More

    Submitted 16 February, 2025; originally announced February 2025.

    Comments: This is preliminary work and code will be released at github.com/bowen-upenn/ControlText

  7. arXiv:2502.02880  [pdf, other

    cs.HC

    Learning not cheating: AI assistance can enhance rather than hinder skill development

    Authors: Benjamin Lira, Todd Rogers, Daniel G. Goldstein, Lyle Ungar, Angela L. Duckworth

    Abstract: It is widely believed that outsourcing cognitive work to AI boosts immediate productivity at the expense of long-term human capital development. An overlooked possibility is that AI tools can support skill development by providing just-in-time, high-quality, personalized examples. In this investigation, lay forecasters predicted that practicing writing cover letters with an AI tool would impair le… ▽ More

    Submitted 22 February, 2025; v1 submitted 4 February, 2025; originally announced February 2025.

    Comments: 25 pages, 13 figures, submitted to PNAS

  8. arXiv:2412.16882  [pdf, other

    cs.AI cs.CL

    PsychAdapter: Adapting LLM Transformers to Reflect Traits, Personality and Mental Health

    Authors: Huy Vu, Huy Anh Nguyen, Adithya V Ganesan, Swanie Juhng, Oscar N. E. Kjell, Joao Sedoc, Margaret L. Kern, Ryan L. Boyd, Lyle Ungar, H. Andrew Schwartz, Johannes C. Eichstaedt

    Abstract: Artificial intelligence-based language generators are now a part of most people's lives. However, by default, they tend to generate "average" language without reflecting the ways in which people differ. Here, we propose a lightweight modification to the standard language model transformer architecture - "PsychAdapter" - that uses empirically derived trait-language patterns to generate natural lang… ▽ More

    Submitted 31 December, 2024; v1 submitted 22 December, 2024; originally announced December 2024.

  9. arXiv:2411.12877  [pdf, other

    cs.HC cs.AI

    The Illusion of Empathy: How AI Chatbots Shape Conversation Perception

    Authors: Tingting Liu, Salvatore Giorgi, Ankit Aich, Allison Lahnala, Brenda Curtis, Lyle Ungar, João Sedoc

    Abstract: As AI chatbots increasingly incorporate empathy, understanding user-centered perceptions of chatbot empathy and its impact on conversation quality remains essential yet under-explored. This study examines how chatbot identity and perceived empathy influence users' overall conversation experience. Analyzing 155 conversations from two datasets, we found that while GPT-based chatbots were rated signi… ▽ More

    Submitted 6 March, 2025; v1 submitted 19 November, 2024; originally announced November 2024.

  10. arXiv:2409.13684  [pdf, other

    cs.LG cs.AI

    The FIX Benchmark: Extracting Features Interpretable to eXperts

    Authors: Helen Jin, Shreya Havaldar, Chaehyeon Kim, Anton Xue, Weiqiu You, Helen Qu, Marco Gatti, Daniel A Hashimoto, Bhuvnesh Jain, Amin Madani, Masao Sako, Lyle Ungar, Eric Wong

    Abstract: Feature-based methods are commonly used to explain model predictions, but these methods often implicitly assume that interpretable features are readily available. However, this is often not the case for high-dimensional data, and it can be hard even for domain experts to mathematically specify which features are important. Can we instead automatically extract collections or groups of features that… ▽ More

    Submitted 23 December, 2024; v1 submitted 20 September, 2024; originally announced September 2024.

  11. arXiv:2409.00262  [pdf, other

    cs.CL

    DiverseDialogue: A Methodology for Designing Chatbots with Human-Like Diversity

    Authors: Xiaoyu Lin, Xinkai Yu, Ankit Aich, Salvatore Giorgi, Lyle Ungar

    Abstract: Large Language Models (LLMs), which simulate human users, are frequently employed to evaluate chatbots in applications such as tutoring and customer service. Effective evaluation necessitates a high degree of human-like diversity within these simulations. In this paper, we demonstrate that conversations generated by GPT-4o mini, when used as simulated human participants, systematically differ from… ▽ More

    Submitted 30 August, 2024; originally announced September 2024.

  12. arXiv:2406.14462  [pdf, other

    cs.CL

    Modeling Human Subjectivity in LLMs Using Explicit and Implicit Human Factors in Personas

    Authors: Salvatore Giorgi, Tingting Liu, Ankit Aich, Kelsey Isman, Garrick Sherman, Zachary Fried, João Sedoc, Lyle H. Ungar, Brenda Curtis

    Abstract: Large language models (LLMs) are increasingly being used in human-centered social scientific tasks, such as data annotation, synthetic data creation, and engaging in dialog. However, these tasks are highly subjective and dependent on human factors, such as one's environment, attitudes, beliefs, and lived experiences. Thus, it may be the case that employing LLMs (which do not have such human factor… ▽ More

    Submitted 17 October, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

    Comments: Accepted at Findings of EMNLP 2024

  13. arXiv:2406.12679  [pdf, other

    cs.CL

    Vernacular? I Barely Know Her: Challenges with Style Control and Stereotyping

    Authors: Ankit Aich, Tingting Liu, Salvatore Giorgi, Kelsey Isman, Lyle Ungar, Brenda Curtis

    Abstract: Large Language Models (LLMs) are increasingly being used in educational and learning applications. Research has demonstrated that controlling for style, to fit the needs of the learner, fosters increased understanding, promotes inclusion, and helps with knowledge distillation. To understand the capabilities and limitations of contemporary LLMs in style control, we evaluated five state-of-the-art m… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  14. arXiv:2406.11622  [pdf, other

    cs.CL

    Building Knowledge-Guided Lexica to Model Cultural Variation

    Authors: Shreya Havaldar, Salvatore Giorgi, Sunny Rai, Young-Min Cho, Thomas Talhelm, Sharath Chandra Guntuku, Lyle Ungar

    Abstract: Cultural variation exists between nations (e.g., the United States vs. China), but also within regions (e.g., California vs. Texas, Los Angeles vs. San Francisco). Measuring this regional cultural variation can illuminate how and why people think and behave differently. Historically, it has been difficult to computationally model cultural variation due to a lack of training data and scalability co… ▽ More

    Submitted 14 October, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: Accepted at NAACL 2024

  15. arXiv:2406.00509  [pdf, other

    cs.LG cs.AI

    Empirical influence functions to understand the logic of fine-tuning

    Authors: Jordan K. Matelsky, Lyle Ungar, Konrad P. Kording

    Abstract: Understanding the process of learning in neural networks is crucial for improving their performance and interpreting their behavior. This can be approximately understood by asking how a model's output is influenced when we fine-tune on a new training sample. There are desiderata for such influences, such as decreasing influence with semantic distance, sparseness, noise invariance, transitive causa… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  16. arXiv:2405.06058  [pdf, other

    cs.AI cs.CL cs.CY cs.HC

    Large Language Models Show Human-like Social Desirability Biases in Survey Responses

    Authors: Aadesh Salecha, Molly E. Ireland, Shashanka Subrahmanya, João Sedoc, Lyle H. Ungar, Johannes C. Eichstaedt

    Abstract: As Large Language Models (LLMs) become widely used to model and simulate human behavior, understanding their biases becomes critical. We developed an experimental framework using Big Five personality surveys and uncovered a previously undetected social desirability bias in a wide range of LLMs. By systematically varying the number of questions LLMs were exposed to, we demonstrate their ability to… ▽ More

    Submitted 21 November, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

    Comments: 3 pages, 2 figures, accepted at PNAS Nexus

  17. arXiv:2402.11477  [pdf, other

    cs.CY

    Cross-Cultural Differences in Mental Health Expressions on Social Media

    Authors: Sunny Rai, Khushi Shelat, Devansh R Jain, Kishen Sivabalan, Young Min Cho, Maitreyi Redkar, Samindara Sawant, Lyle H. Ungar, Sharath Chandra Guntuku

    Abstract: Culture moderates the way individuals perceive and express mental distress. Current understandings of mental health expressions on social media, however, are predominantly derived from WEIRD (Western, Educated, Industrialized, Rich, and Democratic) contexts. To address this gap, we examine mental health posts on Reddit made by individuals geolocated in India, to identify variations in social media… ▽ More

    Submitted 8 February, 2025; v1 submitted 18 February, 2024; originally announced February 2024.

  18. arXiv:2402.11333  [pdf, other

    cs.CY

    Social Norms in Cinema: A Cross-Cultural Analysis of Shame, Pride and Prejudice

    Authors: Sunny Rai, Khushang Jilesh Zaveri, Shreya Havaldar, Soumna Nema, Lyle Ungar, Sharath Chandra Guntuku

    Abstract: Shame and pride are social emotions expressed across cultures to motivate and regulate people's thoughts, feelings, and behaviors. In this paper, we introduce the first cross-cultural dataset of over 10k shame/pride-related expressions, with underlying social expectations from ~5.4K Bollywood and Hollywood movies. We examine how and why shame and pride are expressed across cultures using a blend o… ▽ More

    Submitted 8 February, 2025; v1 submitted 17 February, 2024; originally announced February 2024.

  19. arXiv:2401.05254  [pdf, other

    cs.CY cs.CL

    Language-based Valence and Arousal Expressions between the United States and China: a Cross-Cultural Examination

    Authors: Young-Min Cho, Dandan Pang, Stuti Thapa, Garrick Sherman, Lyle Ungar, Louis Tay, Sharath Chandra Guntuku

    Abstract: While affective expressions on social media have been extensively studied, most research has focused on the Western context. This paper explores cultural differences in affective expressions by comparing valence and arousal on Twitter/X (geolocated to the US) and Sina Weibo (in Mainland China). Using the NRC-VAD lexicon to measure valence and arousal, we identify distinct patterns of emotional exp… ▽ More

    Submitted 8 February, 2025; v1 submitted 10 January, 2024; originally announced January 2024.

    Comments: Accepted to Findings of NAACL 2025

  20. arXiv:2311.00577  [pdf, other

    stat.ML cs.LG econ.EM stat.ME

    Personalized Assignment to One of Many Treatment Arms via Regularized and Clustered Joint Assignment Forests

    Authors: Rahul Ladhania, Jann Spiess, Lyle Ungar, Wenbo Wu

    Abstract: We consider learning personalized assignments to one of many treatment arms from a randomized controlled trial. Standard methods that estimate heterogeneous treatment effects separately for each arm may perform poorly in this case due to excess variance. We instead propose methods that pool information across treatment arms: First, we consider a regularized forest-based assignment algorithm based… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

  21. arXiv:2310.17017  [pdf, other

    cs.CL cs.AI

    An Integrative Survey on Mental Health Conversational Agents to Bridge Computer Science and Medical Perspectives

    Authors: Young Min Cho, Sunny Rai, Lyle Ungar, João Sedoc, Sharath Chandra Guntuku

    Abstract: Mental health conversational agents (a.k.a. chatbots) are widely studied for their potential to offer accessible support to those experiencing mental health challenges. Previous surveys on the topic primarily consider papers published in either computer science or medicine, leading to a divide in understanding and hindering the sharing of beneficial knowledge between both domains. To bridge this g… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: Accepted in EMNLP 2023 Main Conference, camera ready

  22. arXiv:2310.07135  [pdf, other

    cs.CL

    Comparing Styles across Languages: A Cross-Cultural Exploration of Politeness

    Authors: Shreya Havaldar, Matthew Pressimone, Eric Wong, Lyle Ungar

    Abstract: Understanding how styles differ across languages is advantageous for training both humans and computers to generate culturally appropriate text. We introduce an explanation framework to extract stylistic differences from multilingual LMs and compare styles across languages. Our framework (1) generates comprehensive style lexica in any language and (2) consolidates feature importances from LMs into… ▽ More

    Submitted 26 March, 2025; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023

  23. arXiv:2308.15352  [pdf

    cs.CL cs.SI physics.soc-ph

    Historical patterns of rice farming explain modern-day language use in China and Japan more than modernization and urbanization

    Authors: Sharath Chandra Guntuku, Thomas Talhelm, Garrick Sherman, Angel Fan, Salvatore Giorgi, Liuqing Wei, Lyle H. Ungar

    Abstract: We used natural language processing to analyze a billion words to study cultural differences on Weibo, one of China's largest social media platforms. We compared predictions from two common explanations about cultural differences in China (economic development and urban-rural differences) against the less-obvious legacy of rice versus wheat farming. Rice farmers had to coordinate shared irrigation… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

    Comments: Includes Supplemental Materials

  24. arXiv:2307.01370  [pdf, other

    cs.CL

    Multilingual Language Models are not Multicultural: A Case Study in Emotion

    Authors: Shreya Havaldar, Sunny Rai, Bhumika Singhal, Langchen Liu, Sharath Chandra Guntuku, Lyle Ungar

    Abstract: Emotions are experienced and expressed differently across the world. In order to use Large Language Models (LMs) for multilingual tasks that require emotional sensitivity, LMs must reflect this cultural variation in emotion. In this study, we investigate whether the widely-used multilingual LMs in 2023 reflect differences in emotional expressions across cultures and languages. We find that embeddi… ▽ More

    Submitted 9 July, 2023; v1 submitted 3 July, 2023; originally announced July 2023.

    Comments: Accepted to WASSA at ACL 2023

  25. arXiv:2306.00976  [pdf, other

    cs.CL

    TopEx: Topic-based Explanations for Model Comparison

    Authors: Shreya Havaldar, Adam Stein, Eric Wong, Lyle Ungar

    Abstract: Meaningfully comparing language models is challenging with current explanation methods. Current explanations are overwhelming for humans due to large vocabularies or incomparable across models. We present TopEx, an explanation method that enables a level playing field for comparing language models via model-agnostic topics. We demonstrate how TopEx can identify similarities and differences between… ▽ More

    Submitted 1 June, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: Accepted to ICLR 2023, Tiny Papers Track

  26. arXiv:2305.14757  [pdf, other

    cs.CL

    Psychological Metrics for Dialog System Evaluation

    Authors: Salvatore Giorgi, Shreya Havaldar, Farhan Ahmed, Zuhaib Akhtar, Shalaka Vaidya, Gary Pan, Lyle H. Ungar, H. Andrew Schwartz, Joao Sedoc

    Abstract: We present metrics for evaluating dialog systems through a psychologically-grounded "human" lens in which conversational agents express a diversity of both states (e.g., emotion) and traits (e.g., personality), just as people do. We present five interpretable metrics from established psychology that are fundamental to human communication and relationships: emotional entropy, linguistic style and e… ▽ More

    Submitted 15 September, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

  27. Interactive Concept Learning for Uncovering Latent Themes in Large Text Collections

    Authors: Maria Leonor Pacheco, Tunazzina Islam, Lyle Ungar, Ming Yin, Dan Goldwasser

    Abstract: Experts across diverse disciplines are often interested in making sense of large text collections. Traditionally, this challenge is approached either by noisy unsupervised techniques such as topic models, or by following a manual theme discovery process. In this paper, we expand the definition of a theme to account for more than just a word distribution, and include generalized concepts deemed rel… ▽ More

    Submitted 21 October, 2024; v1 submitted 8 May, 2023; originally announced May 2023.

    Comments: Accepted to Findings of ACL: ACL 2023

  28. arXiv:2211.11087  [pdf, other

    cs.CL cs.AI

    Conceptor-Aided Debiasing of Large Language Models

    Authors: Li S. Yifei, Lyle Ungar, João Sedoc

    Abstract: Pre-trained large language models (LLMs) reflect the inherent social biases of their training corpus. Many methods have been proposed to mitigate this issue, but they often fail to debias or they sacrifice model accuracy. We use conceptors--a soft projection method--to identify and remove the bias subspace in LLMs such as BERT and GPT. We propose two methods of applying conceptors (1) bias subspac… ▽ More

    Submitted 30 October, 2023; v1 submitted 20 November, 2022; originally announced November 2022.

    Comments: 25 pages

  29. arXiv:2210.07469  [pdf, other

    cs.CL

    StyLEx: Explaining Style Using Human Lexical Annotations

    Authors: Shirley Anugrah Hayati, Kyumin Park, Dheeraj Rajagopal, Lyle Ungar, Dongyeop Kang

    Abstract: Large pre-trained language models have achieved impressive results on various style classification tasks, but they often learn spurious domain-specific words to make predictions (Hayati et al., 2021). While human explanation highlights stylistic tokens as important features for this task, we observe that model explanations often do not align with them. To tackle this issue, we introduce StyLEx, a… ▽ More

    Submitted 14 April, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: EACL 2023

  30. arXiv:2205.12698  [pdf, other

    cs.CL

    Empathic Conversations: A Multi-level Dataset of Contextualized Conversations

    Authors: Damilola Omitaomu, Shabnam Tafreshi, Tingting Liu, Sven Buechel, Chris Callison-Burch, Johannes Eichstaedt, Lyle Ungar, João Sedoc

    Abstract: Empathy is a cognitive and emotional reaction to an observed situation of others. Empathy has recently attracted interest because it has numerous applications in psychology and AI, but it is unclear how different forms of empathy (e.g., self-report vs counterpart other-report, concern vs. distress) interact with other affective phenomena or demographics like gender and age. To better understand th… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

    Comments: 21 pages

  31. A Holistic Framework for Analyzing the COVID-19 Vaccine Debate

    Authors: Maria Leonor Pacheco, Tunazzina Islam, Monal Mahajan, Andrey Shor, Ming Yin, Lyle Ungar, Dan Goldwasser

    Abstract: The Covid-19 pandemic has led to infodemic of low quality information leading to poor health decisions. Combating the outcomes of this infodemic is not only a question of identifying false claims, but also reasoning about the decisions individuals make. In this work we propose a holistic analysis framework connecting stance and reason analysis, and fine-grained entity level moral sentiment analysi… ▽ More

    Submitted 3 May, 2022; originally announced May 2022.

    Comments: Accepted to NAACL 2022

    Journal ref: Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

  32. arXiv:2202.01802  [pdf, other

    cs.CL cs.AI cs.HC cs.LG

    Different Affordances on Facebook and SMS Text Messaging Do Not Impede Generalization of Language-Based Predictive Models

    Authors: Tingting Liu, Salvatore Giorgi, Xiangyu Tao, Sharath Chandra Guntuku, Douglas Bellew, Brenda Curtis, Lyle Ungar

    Abstract: Adaptive mobile device-based health interventions often use machine learning models trained on non-mobile device data, such as social media text, due to the difficulty and high expense of collecting large text message (SMS) data. Therefore, understanding the differences and generalization of models between these platforms is crucial for proper deployment. We examined the psycho-linguistic differen… ▽ More

    Submitted 23 May, 2023; v1 submitted 3 February, 2022; originally announced February 2022.

    Comments: Accepted to the 17th International AAAI Conference on Web and Social Media (ICWSM), 2023

  33. arXiv:2201.07372  [pdf, other

    cs.LG cs.AI

    Prospective Learning: Principled Extrapolation to the Future

    Authors: Ashwin De Silva, Rahul Ramesh, Lyle Ungar, Marshall Hussain Shuler, Noah J. Cowan, Michael Platt, Chen Li, Leyla Isik, Seung-Eon Roh, Adam Charles, Archana Venkataraman, Brian Caffo, Javier J. How, Justus M Kebschull, John W. Krakauer, Maxim Bichuch, Kaleab Alemayehu Kinfu, Eva Yezerets, Dinesh Jayaraman, Jong M. Shin, Soledad Villar, Ian Phillips, Carey E. Priebe, Thomas Hartung, Michael I. Miller , et al. (18 additional authors not shown)

    Abstract: Learning is a process which can update decision rules, based on past experience, such that future performance improves. Traditionally, machine learning is often evaluated under the assumption that the future will be identical to the past in distribution or change adversarially. But these assumptions can be either too optimistic or pessimistic for many problems in the real world. Real world scenari… ▽ More

    Submitted 13 July, 2023; v1 submitted 18 January, 2022; originally announced January 2022.

    Comments: Accepted at the 2nd Conference on Lifelong Learning Agents (CoLLAs), 2023

  34. arXiv:2110.15726  [pdf, other

    cs.CL cs.AI cs.CY cs.SI

    Social Media Reveals Urban-Rural Differences in Stress across China

    Authors: Jesse Cui, Tingdan Zhang, Kokil Jaidka, Dandan Pang, Garrick Sherman, Vinit Jakhetiya, Lyle Ungar, Sharath Chandra Guntuku

    Abstract: Modeling differential stress expressions in urban and rural regions in China can provide a better understanding of the effects of urbanization on psychological well-being in a country that has rapidly grown economically in the last two decades. This paper studies linguistic differences in the experiences and expressions of stress in urban-rural China from Weibo posts from over 65,000 users across… ▽ More

    Submitted 3 November, 2021; v1 submitted 19 October, 2021; originally announced October 2021.

    Comments: Accepted at AAAI Conference on Web and Social Media (ICWSM) 2022

  35. arXiv:2109.02738  [pdf, other

    cs.CL

    Does BERT Learn as Humans Perceive? Understanding Linguistic Styles through Lexica

    Authors: Shirley Anugrah Hayati, Dongyeop Kang, Lyle Ungar

    Abstract: People convey their intention and attitude through linguistic styles of the text that they write. In this study, we investigate lexicon usages across styles throughout two lenses: human perception and machine word importance, since words differ in the strength of the stylistic cues that they provide. To collect labels of human perception, we curate a new dataset, Hummingbird, on top of benchmarkin… ▽ More

    Submitted 12 November, 2021; v1 submitted 6 September, 2021; originally announced September 2021.

    Comments: Accepted at EMNLP 2021 Main Conference, updated typos and Appendix

  36. arXiv:2011.03983  [pdf, other

    cs.CL cs.HC cs.SI

    Detecting Emerging Symptoms of COVID-19 using Context-based Twitter Embeddings

    Authors: Roshan Santosh, H. Andrew Schwartz, Johannes C. Eichstaedt, Lyle H. Ungar, Sharath C. Guntuku

    Abstract: In this paper, we present an iterative graph-based approach for the detection of symptoms of COVID-19, the pathology of which seems to be evolving. More generally, the method can be applied to finding context-specific words and texts (e.g. symptom mentions) in large imbalanced corpora (e.g. all tweets mentioning #COVID-19). Given the novelty of COVID-19, we also test if the proposed approach gener… ▽ More

    Submitted 8 November, 2020; originally announced November 2020.

    Comments: In proceedings of EMNLP 2020 (Empirical Methods in NLP) workshop on COVID-19

  37. arXiv:2010.04900  [pdf, other

    cs.CL cs.AI

    Toward Micro-Dialect Identification in Diaglossic and Code-Switched Environments

    Authors: Muhammad Abdul-Mageed, Chiyu Zhang, AbdelRahim Elmadany, Lyle Ungar

    Abstract: Although the prediction of dialects is an important language processing task, with a wide range of applications, existing work is largely limited to coarse-grained varieties. Inspired by geolocation research, we propose the novel task of Micro-Dialect Identification (MDI) and introduce MARBERT, a new language model with striking abilities to predict a fine-grained variety (as small as that of a ci… ▽ More

    Submitted 7 December, 2020; v1 submitted 10 October, 2020; originally announced October 2020.

    Comments: Accepted in EMNLP 2020

  38. arXiv:2008.02449  [pdf, other

    cs.SI cs.CY

    Studying Politeness across Cultures Using English Twitter and Mandarin Weibo

    Authors: Mingyang Li, Louis Hickman, Louis Tay, Lyle Ungar, Sharath Chandra Guntuku

    Abstract: Modeling politeness across cultures helps to improve intercultural communication by uncovering what is considered appropriate and polite. We study the linguistic features associated with politeness across US English and Mandarin Chinese. First, we annotate 5,300 Twitter posts from the US and 5,300 Sina Weibo posts from China for politeness scores. Next, we develop an English and Chinese politeness… ▽ More

    Submitted 24 August, 2020; v1 submitted 6 August, 2020; originally announced August 2020.

    Comments: Accepted for CSCW 2020. To be published in PACM HCI

  39. arXiv:2006.07155  [pdf, other

    cs.LG stat.ML

    Generalized SHAP: Generating multiple types of explanations in machine learning

    Authors: Dillon Bowen, Lyle Ungar

    Abstract: Many important questions about a model cannot be answered just by explaining how much each feature contributes to its output. To answer a broader set of questions, we generalize a popular, mathematically well-grounded explanation technique, Shapley Additive Explanations (SHAP). Our new method - Generalized Shapley Additive Explanations (G-SHAP) - produces many additional types of explanations, inc… ▽ More

    Submitted 15 June, 2020; v1 submitted 12 June, 2020; originally announced June 2020.

    Comments: 12 pages, 7 figures. Based on a submission to NeurIPS 2020. Dillon Bowen is credited with the original concept, code, data analysis, and initial paper draft. Lyle Ungar is credited with contributions to the draft and mathematical notation. Documentation can be found at https://dsbowen.github.io/gshap/

  40. arXiv:1912.01079  [pdf, other

    cs.CL cs.IR

    Learning Word Ratings for Empathy and Distress from Document-Level User Responses

    Authors: João Sedoc, Sven Buechel, Yehonathan Nachmany, Anneke Buffone, Lyle Ungar

    Abstract: Despite the excellent performance of black box approaches to modeling sentiment and emotion, lexica (sets of informative words and associated weights) that characterize different emotions are indispensable to the NLP community because they allow for interpretable and robust predictions. Emotion analysis of text is increasing in popularity in NLP; however, manually creating lexica for psychological… ▽ More

    Submitted 16 May, 2020; v1 submitted 2 December, 2019; originally announced December 2019.

    Comments: LREC 2020 camera-ready copy

    Journal ref: Proceedings of The 12th Language Resources and Evaluation Conference (LREC 2020). Pages 1657-1666

  41. arXiv:1911.03855  [pdf, other

    cs.SI cs.CL cs.CY

    Correcting Sociodemographic Selection Biases for Population Prediction from Social Media

    Authors: Salvatore Giorgi, Veronica Lynn, Keshav Gupta, Farhan Ahmed, Sandra Matz, Lyle Ungar, H. Andrew Schwartz

    Abstract: Social media is increasingly used for large-scale population predictions, such as estimating community health statistics. However, social media users are not typically a representative sample of the intended population -- a "selection bias". Within the social sciences, such a bias is typically addressed with restratification techniques, where observations are reweighted according to how under- or… ▽ More

    Submitted 7 June, 2022; v1 submitted 10 November, 2019; originally announced November 2019.

    Comments: Published at the 16th International AAAI Conference on Web and Social Media (ICWSM) 2022

  42. arXiv:1911.00637  [pdf, other

    cs.CL cs.LG

    Sentence-Level BERT and Multi-Task Learning of Age and Gender in Social Media

    Authors: Muhammad Abdul-Mageed, Chiyu Zhang, Arun Rajendran, AbdelRahim Elmadany, Michael Przystupa, Lyle Ungar

    Abstract: Social media currently provide a window on our lives, making it possible to learn how people from different places, with different backgrounds, ages, and genders use language. In this work we exploit a newly-created Arabic dataset with ground truth age and gender labels to learn these attributes both individually and in a multi-task setting at the sentence level. Our models are based on variations… ▽ More

    Submitted 1 November, 2019; originally announced November 2019.

  43. arXiv:1910.14243  [pdf, other

    cs.CL cs.LG

    DiaNet: BERT and Hierarchical Attention Multi-Task Learning of Fine-Grained Dialect

    Authors: Muhammad Abdul-Mageed, Chiyu Zhang, AbdelRahim Elmadany, Arun Rajendran, Lyle Ungar

    Abstract: Prediction of language varieties and dialects is an important language processing task, with a wide range of applications. For Arabic, the native tongue of ~ 300 million people, most varieties remain unsupported. To ease this bottleneck, we present a very large scale dataset covering 319 cities from all 21 Arab countries. We introduce a hierarchical attention multi-task learning (HA-MTL) approach… ▽ More

    Submitted 30 October, 2019; originally announced October 2019.

  44. arXiv:1906.05993  [pdf, other

    cs.CL

    Conceptor Debiasing of Word Representations Evaluated on WEAT

    Authors: Saket Karve, Lyle Ungar, João Sedoc

    Abstract: Bias in word embeddings such as Word2Vec has been widely investigated, and many efforts made to remove such bias. We show how to use conceptors debiasing to post-process both traditional and contextualized word embeddings. Our conceptor debiasing can simultaneously remove racial and gender biases and, unlike standard debiasing methods, can make effect use of heterogeneous lists of biased words. We… ▽ More

    Submitted 13 June, 2019; originally announced June 2019.

  45. arXiv:1904.09187  [pdf, other

    cs.LG stat.ML

    Continual Learning for Sentence Representations Using Conceptors

    Authors: Tianlin Liu, Lyle Ungar, João Sedoc

    Abstract: Distributed representations of sentences have become ubiquitous in natural language processing tasks. In this paper, we consider a continual learning scenario for sentence representations: Given a sequence of corpora, we aim to optimize the sentence encoder with respect to the new corpus while maintaining its accuracy on the old corpora. To address this problem, we propose to initialize sentence e… ▽ More

    Submitted 18 April, 2019; originally announced April 2019.

    Comments: Accepted by NAACL-2019

  46. arXiv:1904.02671  [pdf, other

    cs.CL

    Studying Cultural Differences in Emoji Usage across the East and the West

    Authors: Sharath Chandra Guntuku, Mingyang Li, Louis Tay, Lyle H. Ungar

    Abstract: Global acceptance of Emojis suggests a cross-cultural, normative use of Emojis. Meanwhile, nuances in Emoji use across cultures may also exist due to linguistic differences in expressing emotions and diversity in conceptualizing topics. Indeed, literature in cross-cultural psychology has found both normative and culture-specific ways in which emotions are expressed. In this paper, using social med… ▽ More

    Submitted 4 April, 2019; originally announced April 2019.

    Comments: ICWSM 2019

  47. arXiv:1904.02670  [pdf, other

    cs.HC cs.SI

    What Twitter Profile and Posted Images Reveal About Depression and Anxiety

    Authors: Sharath Chandra Guntuku, Daniel Preotiuc-Pietro, Johannes C. Eichstaedt, Lyle H. Ungar

    Abstract: Previous work has found strong links between the choice of social media images and users' emotions, demographics and personality traits. In this study, we examine which attributes of profile and posted images are associated with depression and anxiety of Twitter users. We used a sample of 28,749 Facebook users to build a language prediction model of survey-reported depression and anxiety, and vali… ▽ More

    Submitted 4 April, 2019; originally announced April 2019.

    Comments: ICWSM 2019

  48. Expert-Augmented Machine Learning

    Authors: E. D. Gennatas, J. H. Friedman, L. H. Ungar, R. Pirracchio, E. Eaton, L. Reichman, Y. Interian, C. B. Simone, A. Auerbach, E. Delgado, M. J. Van der Laan, T. D. Solberg, G. Valdes

    Abstract: Machine Learning is proving invaluable across disciplines. However, its success is often limited by the quality and quantity of available data, while its adoption by the level of trust that models afford users. Human vs. machine performance is commonly compared empirically to decide whether a certain task should be performed by a computer or an expert. In reality, the optimal learning strategy may… ▽ More

    Submitted 5 January, 2021; v1 submitted 22 March, 2019; originally announced March 2019.

  49. arXiv:1811.11002  [pdf, other

    cs.CL cs.LG stat.ML

    Correcting the Common Discourse Bias in Linear Representation of Sentences using Conceptors

    Authors: Tianlin Liu, João Sedoc, Lyle Ungar

    Abstract: Distributed representations of words, better known as word embeddings, have become important building blocks for natural language processing tasks. Numerous studies are devoted to transferring the success of unsupervised word embeddings to sentence embeddings. In this paper, we introduce a simple representation of sentences in which a sentence embedding is represented as a weighted average of word… ▽ More

    Submitted 17 November, 2018; originally announced November 2018.

    Comments: Accepted by the BioCreative/OHNLP workshop of ACM-BCB 2018

  50. arXiv:1811.11001  [pdf, other

    cs.CL cs.LG stat.ML

    Unsupervised Post-processing of Word Vectors via Conceptor Negation

    Authors: Tianlin Liu, Lyle Ungar, João Sedoc

    Abstract: Word vectors are at the core of many natural language processing tasks. Recently, there has been interest in post-processing word vectors to enrich their semantic information. In this paper, we introduce a novel word vector post-processing technique based on matrix conceptors (Jaeger2014), a family of regularized identity maps. More concretely, we propose to use conceptors to suppress those latent… ▽ More

    Submitted 2 December, 2018; v1 submitted 17 November, 2018; originally announced November 2018.

    Comments: Accepted by AAAI-2019