Skip to main content

Showing 1–50 of 83 results for author: May, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2508.18541  [pdf, ps, other

    cs.CY

    Uncovering Intervention Opportunities for Suicide Prevention with Language Model Assistants

    Authors: Jaspreet Ranjit, Hyundong J. Cho, Claire J. Smerdon, Yoonsoo Nam, Myles Phung, Jonathan May, John R. Blosnich, Swabha Swayamdipta

    Abstract: Warning: This paper discusses topics of suicide and suicidal ideation, which may be distressing to some readers. The National Violent Death Reporting System (NVDRS) documents information about suicides in the United States, including free text narratives (e.g., circumstances surrounding a suicide). In a demanding public health data pipeline, annotators manually extract structured information fro… ▽ More

    Submitted 29 August, 2025; v1 submitted 25 August, 2025; originally announced August 2025.

    Comments: Project Website: https://dill-lab.github.io/interventions_lm_assistants/

  2. arXiv:2508.18297  [pdf, ps, other

    cs.CV cs.AI cs.CL

    Can VLMs Recall Factual Associations From Visual References?

    Authors: Dhananjay Ashok, Ashutosh Chaubey, Hirona J. Arai, Jonathan May, Jesse Thomason

    Abstract: Through a controlled study, we identify a systematic deficiency in the multimodal grounding of Vision Language Models (VLMs). While VLMs can recall factual associations when provided a textual reference to an entity; their ability to do so is significantly diminished when the reference is visual instead. Forcing VLMs to rely on image representations of an entity halves their ability to recall fact… ▽ More

    Submitted 22 August, 2025; originally announced August 2025.

    Comments: To appear at EMNLP 2025 (Findings)

  3. arXiv:2507.21389  [pdf, ps, other

    cs.AI cs.CL

    Teaching Language Models To Gather Information Proactively

    Authors: Tenghao Huang, Sihao Chen, Muhao Chen, Jonathan May, Longqi Yang, Mengting Wan, Pei Zhou

    Abstract: Large language models (LLMs) are increasingly expected to function as collaborative partners, engaging in back-and-forth dialogue to solve complex, ambiguous problems. However, current LLMs often falter in real-world settings, defaulting to passive responses or narrow clarifications when faced with incomplete or under-specified prompts, falling short of proactively gathering the missing informatio… ▽ More

    Submitted 28 July, 2025; originally announced July 2025.

  4. arXiv:2506.21586  [pdf, ps, other

    cs.CL cs.AI cs.CV

    Can Vision Language Models Understand Mimed Actions?

    Authors: Hyundong Cho, Spencer Lin, Tejas Srinivasan, Michael Saxon, Deuksin Kwon, Natali T. Chavez, Jonathan May

    Abstract: Nonverbal communication (NVC) plays an integral role in human language, but studying NVC in general is challenging because of its broad scope and high variance in interpretation among individuals and cultures. However, mime -- the theatrical technique of suggesting intent using only gesture, expression, and movement -- is a subset of NVC that consists of explicit and embodied actions with much low… ▽ More

    Submitted 7 August, 2025; v1 submitted 17 June, 2025; originally announced June 2025.

    Comments: ACL 2025 Findings

  5. arXiv:2502.17383  [pdf, ps, other

    cs.CL

    Which Questions Improve Learning the Most? Utility Estimation of Questions with LM-based Simulations

    Authors: Dong-Ho Lee, Hyundong Cho, Jonathan May, Jay Pujara

    Abstract: Asking good questions is critical for comprehension and learning, yet evaluating and generating such questions remains a challenging problem. Prior work on inquisitive questions focuses on learner-generated, curiosity-driven queries and evaluates them using indirect metrics, such as salience or information gain, that do not directly capture a question's impact on actual learning outcomes. We intro… ▽ More

    Submitted 7 August, 2025; v1 submitted 24 February, 2025; originally announced February 2025.

    Comments: 17 pages, 5 figures, 6 tables

  6. arXiv:2502.13329  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Language Models Can Predict Their Own Behavior

    Authors: Dhananjay Ashok, Jonathan May

    Abstract: The text produced by language models (LMs) can exhibit specific `behaviors,' such as a failure to follow alignment training, that we hope to detect and react to during deployment. Identifying these behaviors can often only be done post facto, i.e., after the entire text of the output has been generated. We provide evidence that there are times when we can predict how an LM will behave early in com… ▽ More

    Submitted 22 September, 2025; v1 submitted 18 February, 2025; originally announced February 2025.

    Comments: Presented at the Thirty-Ninth Annual Conference on Neural Information Processing Systems (2025)

  7. arXiv:2502.12436  [pdf, ps, other

    cs.CL

    Should I Trust You? Detecting Deception in Negotiations using Counterfactual RL

    Authors: Wichayaporn Wongkamjan, Yanze Wang, Feng Gu, Denis Peskoff, Jonathan K. Kummerfeld, Jonathan May, Jordan Lee Boyd-Graber

    Abstract: An increasingly common socio-technical problem is people being taken in by offers that sound ``too good to be true'', where persuasion and trust shape decision-making. This paper investigates how \abr{ai} can help detect these deceptive scenarios. We analyze how humans strategically deceive each other in \textit{Diplomacy}, a board game that requires both natural language communication and strateg… ▽ More

    Submitted 5 June, 2025; v1 submitted 17 February, 2025; originally announced February 2025.

    Comments: ACL Findings 2025

  8. arXiv:2502.08972  [pdf, other

    cs.CL cs.AI

    Tuning-Free Personalized Alignment via Trial-Error-Explain In-Context Learning

    Authors: Hyundong Cho, Karishma Sharma, Nicolaas Jedema, Leonardo F. R. Ribeiro, Alessandro Moschitti, Ravi Krishnan, Jonathan May

    Abstract: Language models are aligned to the collective voice of many, resulting in generic outputs that do not align with specific users' styles. In this work, we present Trial-Error-Explain In-Context Learning (TICL), a tuning-free method that personalizes language models for text generation tasks with fewer than 10 examples per user. TICL iteratively expands an in-context learning prompt via a trial-erro… ▽ More

    Submitted 5 April, 2025; v1 submitted 13 February, 2025; originally announced February 2025.

    Comments: NAACL 2025 Findings

  9. arXiv:2501.12485  [pdf, ps, other

    cs.AI

    R2D2: Remembering, Replaying and Dynamic Decision Making with a Reflective Agentic Memory

    Authors: Tenghao Huang, Kinjal Basu, Ibrahim Abdelaziz, Pavan Kapanipathi, Jonathan May, Muhao Chen

    Abstract: The proliferation of web agents necessitates advanced navigation and interaction strategies within complex web environments. Current models often struggle with efficient navigation and action execution due to limited visibility and understanding of web structures. Our proposed R2D2 framework addresses these challenges by integrating two paradigms: Remember and Reflect. The Remember paradigm uses a… ▽ More

    Submitted 22 July, 2025; v1 submitted 21 January, 2025; originally announced January 2025.

    Comments: ACL 2025

  10. arXiv:2411.18811  [pdf, other

    cs.CL cs.AI cs.DL

    NewsEdits 2.0: Learning the Intentions Behind Updating News

    Authors: Alexander Spangher, Kung-Hsiang Huang, Hyundong Cho, Jonathan May

    Abstract: As events progress, news articles often update with new information: if we are not cautious, we risk propagating outdated facts. In this work, we hypothesize that linguistic features indicate factual fluidity, and that we can predict which facts in a news article will update using solely the text of a news article (i.e. not external resources like search engines). We test this hypothesis, first, b… ▽ More

    Submitted 27 November, 2024; originally announced November 2024.

    Comments: 9 pages main body, 11 pages appendix

  11. arXiv:2411.13779  [pdf, other

    cs.CL cs.AI cs.LG

    NewsInterview: a Dataset and a Playground to Evaluate LLMs' Ground Gap via Informational Interviews

    Authors: Michael Lu, Hyundong Justin Cho, Weiyan Shi, Jonathan May, Alexander Spangher

    Abstract: Large Language Models (LLMs) have demonstrated impressive capabilities in generating coherent text but often struggle with grounding language and strategic dialogue. To address this gap, we focus on journalistic interviews, a domain rich in grounding communication and abundant in data. We curate a dataset of 40,000 two-person informational interviews from NPR and CNN, and reveal that LLMs are sign… ▽ More

    Submitted 20 November, 2024; originally announced November 2024.

  12. arXiv:2411.09109  [pdf, other

    cs.CL

    Personalized Help for Optimizing Low-Skilled Users' Strategy

    Authors: Feng Gu, Wichayaporn Wongkamjan, Jonathan K. Kummerfeld, Denis Peskoff, Jonathan May, Jordan Boyd-Graber

    Abstract: AIs can beat humans in game environments; however, how helpful those agents are to human remains understudied. We augment CICERO, a natural language agent that demonstrates superhuman performance in Diplomacy, to generate both move and message advice based on player intentions. A dozen Diplomacy games with novice and experienced players, with varying advice settings, show that some of the generate… ▽ More

    Submitted 21 February, 2025; v1 submitted 13 November, 2024; originally announced November 2024.

    Comments: 9 pages, 3 figures

  13. arXiv:2411.05192  [pdf, other

    cs.CL cs.AI

    Explaining Mixtures of Sources in News Articles

    Authors: Alexander Spangher, James Youn, Matt DeButts, Nanyun Peng, Emilio Ferrara, Jonathan May

    Abstract: Human writers plan, then write. For large language models (LLMs) to play a role in longer-form article generation, we must understand the planning steps humans make before writing. We explore one kind of planning, source-selection in news, as a case-study for evaluating plans in long-form generation. We ask: why do specific stories call for specific kinds of sources? We imagine a generative proces… ▽ More

    Submitted 7 November, 2024; originally announced November 2024.

    Comments: 9 pages

  14. arXiv:2410.13098  [pdf, ps, other

    cs.CL cs.AI cs.LG

    A Little Human Data Goes A Long Way

    Authors: Dhananjay Ashok, Jonathan May

    Abstract: Faced with an expensive human annotation process, creators of NLP systems increasingly turn to synthetic data generation. While this method shows promise, the extent to which synthetic data can replace human annotation is poorly understood. We investigate the use of synthetic data in Fact Verification (FV) and Question Answering (QA) by studying the effects of incrementally replacing human generat… ▽ More

    Submitted 19 August, 2025; v1 submitted 16 October, 2024; originally announced October 2024.

    Comments: ACL 2025

  15. arXiv:2409.14672  [pdf, other

    cs.AI

    Speechworthy Instruction-tuned Language Models

    Authors: Hyundong Cho, Nicolaas Jedema, Leonardo F. R. Ribeiro, Karishma Sharma, Pedro Szekely, Alessandro Moschitti, Ruben Janssen, Jonathan May

    Abstract: Current instruction-tuned language models are exclusively trained with textual preference data and thus are often not aligned with the unique requirements of other modalities, such as speech. To better align language models with the speech domain, we explore (i) prompting strategies grounded in radio-industry best practices and (ii) preference learning using a novel speech-based preference data of… ▽ More

    Submitted 22 September, 2024; originally announced September 2024.

    Comments: EMNLP2024

  16. arXiv:2409.12832  [pdf, other

    cs.CL cs.AI

    FoodPuzzle: Developing Large Language Model Agents as Flavor Scientists

    Authors: Tenghao Huang, Donghee Lee, John Sweeney, Jiatong Shi, Emily Steliotes, Matthew Lange, Jonathan May, Muhao Chen

    Abstract: Flavor development in the food industry is increasingly challenged by the need for rapid innovation and precise flavor profile creation. Traditional flavor research methods typically rely on iterative, subjective testing, which lacks the efficiency and scalability required for modern demands. This paper presents three contributions to address the challenges. Firstly, we define a new problem domain… ▽ More

    Submitted 6 October, 2024; v1 submitted 19 September, 2024; originally announced September 2024.

  17. arXiv:2407.17770  [pdf, other

    cs.CL

    BotEval: Facilitating Interactive Human Evaluation

    Authors: Hyundong Cho, Thamme Gowda, Yuyang Huang, Zixun Lu, Tianli Tong, Jonathan May

    Abstract: Following the rapid progress in natural language processing (NLP) models, language models are applied to increasingly more complex interactive tasks such as negotiations and conversation moderations. Having human evaluators directly interact with these NLP models is essential for adequately evaluating the performance on such interactive tasks. We develop BotEval, an easily customizable, open-sourc… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

    Comments: ACL 2024 SDT, 10 pages

  18. arXiv:2407.13248  [pdf, other

    cs.CL

    Are Large Language Models Capable of Generating Human-Level Narratives?

    Authors: Yufei Tian, Tenghao Huang, Miri Liu, Derek Jiang, Alexander Spangher, Muhao Chen, Jonathan May, Nanyun Peng

    Abstract: This paper investigates the capability of LLMs in storytelling, focusing on narrative development and plot progression. We introduce a novel computational framework to analyze narratives through three discourse-level aspects: i) story arcs, ii) turning points, and iii) affective dimensions, including arousal and valence. By leveraging expert and automatic annotations, we uncover significant discre… ▽ More

    Submitted 4 October, 2024; v1 submitted 18 July, 2024; originally announced July 2024.

    Comments: EMNLP 2024

  19. arXiv:2406.11581  [pdf, other

    cs.CL

    Style Transfer with Multi-iteration Preference Optimization

    Authors: Shuai Liu, Jonathan May

    Abstract: Numerous recent techniques for text style transfer characterize their approaches as variants of reinforcement learning and preference optimization. In this work, we consider the relationship between these approaches and a class of optimization approaches developed primarily for (non-neural) statistical machine translation, formerly known as `tuning'. Inspired by these techniques from the past, we… ▽ More

    Submitted 28 July, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

  20. arXiv:2406.10764  [pdf, other

    cs.CL

    GNOME: Generating Negotiations through Open-Domain Mapping of Exchanges

    Authors: Darshan Deshpande, Shambhavi Sinha, Anirudh Ravi Kumar, Debaditya Pal, Jonathan May

    Abstract: Language Models have previously shown strong negotiation capabilities in closed domains where the negotiation strategy prediction scope is constrained to a specific setup. In this paper, we first show that these models are not generalizable beyond their original training domain despite their wide-scale pretraining. Following this, we propose an automated framework called GNOME, which processes exi… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

  21. More Victories, Less Cooperation: Assessing Cicero's Diplomacy Play

    Authors: Wichayaporn Wongkamjan, Feng Gu, Yanze Wang, Ulf Hermjakob, Jonathan May, Brandon M. Stewart, Jonathan K. Kummerfeld, Denis Peskoff, Jordan Lee Boyd-Graber

    Abstract: The boardgame Diplomacy is a challenging setting for communicative and cooperative artificial intelligence. The most prominent communicative Diplomacy AI, Cicero, has excellent strategic abilities, exceeding human players. However, the best Diplomacy players master communication, not just tactics, which is why the game has received attention as an AI challenge. This work seeks to understand the de… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  22. arXiv:2405.15760  [pdf, other

    cs.CL cs.CY

    GPT is Not an Annotator: The Necessity of Human Annotation in Fairness Benchmark Construction

    Authors: Virginia K. Felkner, Jennifer A. Thompson, Jonathan May

    Abstract: Social biases in LLMs are usually measured via bias benchmark datasets. Current benchmarks have limitations in scope, grounding, quality, and human effort required. Previous work has shown success with a community-sourced, rather than crowd-sourced, approach to benchmark development. However, this work still required considerable effort from annotators with relevant lived experience. This paper ex… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: Accepted to ACL 2024 (main conference)

    ACM Class: I.2.7; K.4.2

  23. arXiv:2404.09510  [pdf, other

    eess.SP cs.LG cs.NE q-bio.NC

    Listen to the Waves: Using a Neuronal Model of the Human Auditory System to Predict Ocean Waves

    Authors: Artur Matysiak, Volker Roeber, Henrik Kalisch, Reinhard König, Patrick J. C. May

    Abstract: Artificial neural networks (ANNs) have evolved from the 1940s primitive models of brain function to become tools for artificial intelligence. They comprise many units, artificial neurons, interlinked through weighted connections. ANNs are trained to perform tasks through learning rules that modify the connection weights. With these rules being in the focus of research, ANNs have become a branch of… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 23 pages, 6 figures

  24. arXiv:2404.08801  [pdf, other

    cs.LG cs.CL

    Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

    Authors: Xuezhe Ma, Xiaomeng Yang, Wenhan Xiong, Beidi Chen, Lili Yu, Hao Zhang, Jonathan May, Luke Zettlemoyer, Omer Levy, Chunting Zhou

    Abstract: The quadratic complexity and weak length extrapolation of Transformers limits their ability to scale to long sequences, and while sub-quadratic solutions like linear attention and state space models exist, they empirically underperform Transformers in pretraining efficiency and downstream task accuracy. We introduce Megalodon, a neural architecture for efficient sequence modeling with unlimited co… ▽ More

    Submitted 16 April, 2024; v1 submitted 12 April, 2024; originally announced April 2024.

    Comments: 9 pages, 6 figures and 8 tables

  25. arXiv:2403.08043  [pdf, other

    cs.CL

    Authorship Style Transfer with Policy Optimization

    Authors: Shuai Liu, Shantanu Agarwal, Jonathan May

    Abstract: Authorship style transfer aims to rewrite a given text into a specified target while preserving the original meaning in the source. Existing approaches rely on the availability of a large number of target style exemplars for model training. However, these overlook cases where a limited number of target style examples are available. The development of parameter-efficient transfer learning technique… ▽ More

    Submitted 28 July, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

  26. arXiv:2311.10781  [pdf, other

    cs.CL cs.AI

    Can Language Model Moderators Improve the Health of Online Discourse?

    Authors: Hyundong Cho, Shuai Liu, Taiwei Shi, Darpan Jain, Basem Rizk, Yuyang Huang, Zixun Lu, Nuan Wen, Jonathan Gratch, Emilio Ferrara, Jonathan May

    Abstract: Conversational moderation of online communities is crucial to maintaining civility for a constructive environment, but it is challenging to scale and harmful to moderators. The inclusion of sophisticated natural language generation modules as a force multiplier to aid human moderators is a tantalizing prospect, but adequate evaluation approaches have so far been elusive. In this paper, we establis… ▽ More

    Submitted 6 May, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: 9 pages, NAACL 2024 Main

  27. arXiv:2311.09734  [pdf, other

    cs.CL

    Tracking the Newsworthiness of Public Documents

    Authors: Alexander Spangher, Emilio Ferrara, Ben Welsh, Nanyun Peng, Serdar Tumgoren, Jonathan May

    Abstract: Journalists must find stories in huge amounts of textual data (e.g. leaks, bills, press releases) as part of their jobs: determining when and why text becomes news can help us understand coverage patterns and help us build assistive tools. Yet, this is challenging because very few labelled links exist, language use between corpora is very different, and text may be covered for a variety of reasons… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: 9 pages, 7 pages appendix

  28. arXiv:2309.08185  [pdf, other

    cs.CL

    Multilingual Sentence-Level Semantic Search using Meta-Distillation Learning

    Authors: Meryem M'hamdi, Jonathan May, Franck Dernoncourt, Trung Bui, Seunghyun Yoon

    Abstract: Multilingual semantic search is the task of retrieving relevant contents to a query expressed in different language combinations. This requires a better semantic understanding of the user's intent and its contextual meaning. Multilingual semantic search is less explored and more challenging than its monolingual or bilingual counterparts, due to the lack of multilingual parallel resources for this… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

  29. arXiv:2306.15087  [pdf, other

    cs.CL cs.CY

    WinoQueer: A Community-in-the-Loop Benchmark for Anti-LGBTQ+ Bias in Large Language Models

    Authors: Virginia K. Felkner, Ho-Chun Herbert Chang, Eugene Jang, Jonathan May

    Abstract: We present WinoQueer: a benchmark specifically designed to measure whether large language models (LLMs) encode biases that are harmful to the LGBTQ+ community. The benchmark is community-sourced, via application of a novel method that generates a bias benchmark from a community survey. We apply our benchmark to several popular LLMs and find that off-the-shelf models generally do exhibit considerab… ▽ More

    Submitted 17 October, 2024; v1 submitted 26 June, 2023; originally announced June 2023.

    Comments: Accepted to ACL 2023 (main conference). This version corrects a bug found in evaluation code after publication. General findings have not changed, but tables 5 and 6 and figure 1 have been corrected

  30. arXiv:2306.07206  [pdf, other

    cs.CL cs.AI

    RECAP: Retrieval-Enhanced Context-Aware Prefix Encoder for Personalized Dialogue Response Generation

    Authors: Shuai Liu, Hyundong J. Cho, Marjorie Freedman, Xuezhe Ma, Jonathan May

    Abstract: Endowing chatbots with a consistent persona is essential to an engaging conversation, yet it remains an unresolved challenge. In this work, we propose a new retrieval-enhanced approach for personalized response generation. Specifically, we design a hierarchical transformer retriever trained on dialogue domain data to perform personalized retrieval and a context-aware prefix encoder that fuses the… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

  31. arXiv:2305.14904  [pdf, other

    cs.CL cs.AI cs.CY

    Identifying Informational Sources in News Articles

    Authors: Alexander Spangher, Nanyun Peng, Jonathan May, Emilio Ferrara

    Abstract: News articles are driven by the informational sources journalists use in reporting. Modeling when, how and why sources get used together in stories can help us better understand the information we consume and even help journalists with the task of producing it. In this work, we take steps toward this goal by constructing the largest and widest-ranging annotated dataset, to date, of informational s… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: 13 pages

  32. arXiv:2305.13751  [pdf, other

    cs.CL

    Challenges in Context-Aware Neural Machine Translation

    Authors: Linghao Jin, Jacqueline He, Jonathan May, Xuezhe Ma

    Abstract: Context-aware neural machine translation involves leveraging information beyond sentence-level context to resolve inter-sentential discourse dependencies and improve document-level translation quality, and has given rise to a number of recent techniques. However, despite well-reasoned intuitions, most context-aware translation models show only modest improvements over sentence-level systems. In th… ▽ More

    Submitted 23 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Accepted to EMNLP 2023

  33. arXiv:2305.13721  [pdf, ps, other

    cs.CL cs.AI

    Continual Dialogue State Tracking via Example-Guided Question Answering

    Authors: Hyundong Cho, Andrea Madotto, Zhaojiang Lin, Khyathi Raghavi Chandu, Satwik Kottur, Jing Xu, Jonathan May, Chinnadhurai Sankar

    Abstract: Dialogue systems are frequently updated to accommodate new services, but naively updating them by continually training with data for new services in diminishing performance on previously learnt services. Motivated by the insight that dialogue state tracking (DST), a crucial component of dialogue systems that estimates the user's goal as a conversation proceeds, is a simple natural language underst… ▽ More

    Submitted 29 September, 2025; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: 11 pages, EMNLP 2023

  34. arXiv:2305.10731  [pdf, other

    cs.CL

    Analyzing Norm Violations in Live-Stream Chat

    Authors: Jihyung Moon, Dong-Ho Lee, Hyundong Cho, Woojeong Jin, Chan Young Park, Minwoo Kim, Jonathan May, Jay Pujara, Sungjoon Park

    Abstract: Toxic language, such as hate speech, can deter users from participating in online communities and enjoying popular platforms. Previous approaches to detecting toxic language and norm violations have been primarily concerned with conversations from online forums and social media, such as Reddit and Twitter. These approaches are less effective when applied to conversations on live-streaming platform… ▽ More

    Submitted 7 October, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

    Comments: 17 pages, 8 figures, 15 tables

  35. arXiv:2305.09846  [pdf, other

    cs.CL cs.SI

    CPL-NoViD: Context-Aware Prompt-based Learning for Norm Violation Detection in Online Communities

    Authors: Zihao He, Jonathan May, Kristina Lerman

    Abstract: Detecting norm violations in online communities is critical to maintaining healthy and safe spaces for online discussions. Existing machine learning approaches often struggle to adapt to the diverse rules and interpretations across different communities due to the inherent challenges of fine-tuning models for such context-specific tasks. In this paper, we introduce Context-aware Prompt-based Learn… ▽ More

    Submitted 16 April, 2024; v1 submitted 16 May, 2023; originally announced May 2023.

  36. arXiv:2212.00339  [pdf, other

    cs.CL cs.CY

    Anger Breeds Controversy: Analyzing Controversy and Emotions on Reddit

    Authors: Kai Chen, Zihao He, Rong-Ching Chang, Jonathan May, Kristina Lerman

    Abstract: Emotions play an important role in interpersonal interactions and social conflict, yet their function in the development of controversy and disagreement in online conversations has not been explored. To address this gap, we study controversy on Reddit, a popular network of online discussion forums. We collect discussions from a wide variety of topical forums and use emotion detection to recognize… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

  37. arXiv:2210.05096  [pdf, other

    cs.CL cs.AI cs.CY

    Checks and Strategies for Enabling Code-Switched Machine Translation

    Authors: Thamme Gowda, Mozhdeh Gheini, Jonathan May

    Abstract: Code-switching is a common phenomenon among multilingual speakers, where alternation between two or more languages occurs within the context of a single conversation. While multilingual humans can seamlessly switch back and forth between languages, multilingual neural machine translation (NMT) models are not robust to such sudden changes in input. This work explores multilingual NMT models' abilit… ▽ More

    Submitted 10 October, 2022; originally announced October 2022.

  38. arXiv:2209.10655  [pdf, other

    cs.LG

    Mega: Moving Average Equipped Gated Attention

    Authors: Xuezhe Ma, Chunting Zhou, Xiang Kong, Junxian He, Liangke Gui, Graham Neubig, Jonathan May, Luke Zettlemoyer

    Abstract: The design choices in the Transformer attention mechanism, including weak inductive bias and quadratic computational complexity, have limited its application for modeling long sequences. In this paper, we introduce Mega, a simple, theoretically grounded, single-head gated attention mechanism equipped with (exponential) moving average to incorporate inductive bias of position-aware local dependenci… ▽ More

    Submitted 28 January, 2023; v1 submitted 21 September, 2022; originally announced September 2022.

    Comments: Accepted by ICLR 2023. Final version (updating MT results). 13 pages, 4 figures and 7 tables

  39. arXiv:2206.11484  [pdf, other

    cs.CL cs.CY

    Towards WinoQueer: Developing a Benchmark for Anti-Queer Bias in Large Language Models

    Authors: Virginia K. Felkner, Ho-Chun Herbert Chang, Eugene Jang, Jonathan May

    Abstract: This paper presents exploratory work on whether and to what extent biases against queer and trans people are encoded in large language models (LLMs) such as BERT. We also propose a method for reducing these biases in downstream tasks: finetuning the models on data written by and/or about queer people. To measure anti-queer bias, we introduce a new benchmark dataset, WinoQueer, modeled after other… ▽ More

    Submitted 7 July, 2022; v1 submitted 23 June, 2022; originally announced June 2022.

    Comments: Accepted to Queer in AI Workshop @ NAACL 2022. Updated 07/07 with minor typographical fixes

    ACM Class: I.2.7

  40. arXiv:2206.11083  [pdf, other

    cs.CL cs.AI

    Investigating the Benefits of Free-Form Rationales

    Authors: Jiao Sun, Swabha Swayamdipta, Jonathan May, Xuezhe Ma

    Abstract: Free-form rationales aim to aid model interpretability by supplying the background knowledge that can help understand model decisions. Crowdsourced rationales are provided for commonsense QA instances in popular datasets such as CoS-E and ECQA, but their utility remains under-investigated. We present human studies which show that ECQA rationales indeed provide additional background information to… ▽ More

    Submitted 25 October, 2022; v1 submitted 25 May, 2022; originally announced June 2022.

    Comments: EMNLP 2022, Findings

  41. arXiv:2206.07106  [pdf, other

    cs.CL

    NewsEdits: A News Article Revision Dataset and a Document-Level Reasoning Challenge

    Authors: Alexander Spangher, Xiang Ren, Jonathan May, Nanyun Peng

    Abstract: News article revision histories provide clues to narrative and factual evolution in news articles. To facilitate analysis of this evolution, we present the first publicly available dataset of news revision histories, NewsEdits. Our dataset is large-scale and multilingual; it contains 1.2 million articles with 4.6 million versions from over 22 English- and French-language newspaper sources based in… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

    Journal ref: 2022 Annual Conference of the North American Chapter of the Association for Computational Linguistics

  42. arXiv:2205.12527  [pdf, other

    cs.CL

    Segmenting Numerical Substitution Ciphers

    Authors: Nada Aldarrab, Jonathan May

    Abstract: Deciphering historical substitution ciphers is a challenging problem. Example problems that have been previously studied include detecting cipher type, detecting plaintext language, and acquiring the substitution key for segmented ciphers. However, attacking unsegmented, space-free ciphers is still a challenging task. Segmentation (i.e. finding substitution units) is the first step towards crackin… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

  43. arXiv:2205.12514  [pdf, other

    cs.CL

    Machine Translation Robustness to Natural Asemantic Variation

    Authors: Jacob Bremerman, Xiang Ren, Jonathan May

    Abstract: Current Machine Translation (MT) models still struggle with more challenging input, such as noisy data and tail-end words and phrases. Several works have addressed this robustness issue by identifying specific categories of noise and variation then tuning models to perform better on them. An important yet under-studied category involves minor variations in nuance (non-typos) that preserve meaning… ▽ More

    Submitted 9 November, 2022; v1 submitted 25 May, 2022; originally announced May 2022.

    Comments: Accepted to EMNLP 2022

  44. arXiv:2205.12453  [pdf, other

    cs.CL

    Know Where You're Going: Meta-Learning for Parameter-Efficient Fine-Tuning

    Authors: Mozhdeh Gheini, Xuezhe Ma, Jonathan May

    Abstract: A recent family of techniques, dubbed lightweight fine-tuning methods, facilitates parameter-efficient transfer learning by updating only a small set of additional parameters while keeping the parameters of the pretrained language model frozen. While proven to be an effective method, there are no existing studies on if and how such knowledge of the downstream fine-tuning approach should affect the… ▽ More

    Submitted 8 December, 2022; v1 submitted 24 May, 2022; originally announced May 2022.

  45. arXiv:2205.11152  [pdf, other

    cs.CL

    Cross-lingual Lifelong Learning

    Authors: Meryem M'hamdi, Xiang Ren, Jonathan May

    Abstract: The longstanding goal of multi-lingual learning has been to develop a universal cross-lingual model that can withstand the changes in multi-lingual data distributions. There has been a large amount of work to adapt such multi-lingual models to unseen target languages. However, the majority of work in this direction focuses on the standard one-hop transfer learning pipeline from source to target la… ▽ More

    Submitted 28 December, 2023; v1 submitted 23 May, 2022; originally announced May 2022.

    Comments: Camera-Ready Version of this paper published at ACL 2023 (https://aclanthology.org/2023.acl-long.217/)

  46. arXiv:2205.00344  [pdf, other

    cs.CL cs.AI

    Opponent Modeling in Negotiation Dialogues by Related Data Adaptation

    Authors: Kushal Chawla, Gale M. Lucas, Jonathan May, Jonathan Gratch

    Abstract: Opponent modeling is the task of inferring another party's mental state within the context of social interactions. In a multi-issue negotiation, it involves inferring the relative importance that the opponent assigns to each issue under discussion, which is crucial for finding high-value deals. A practical model for this task needs to infer these priorities of the opponent on the fly based on part… ▽ More

    Submitted 3 May, 2022; v1 submitted 30 April, 2022; originally announced May 2022.

    Comments: Appearing at Findings of NAACL 2022

  47. arXiv:2112.08321  [pdf, other

    cs.CL

    Know Thy Strengths: Comprehensive Dialogue State Tracking Diagnostics

    Authors: Hyundong Cho, Chinnadhurai Sankar, Christopher Lin, Kaushik Ram Sadagopan, Shahin Shayandeh, Asli Celikyilmaz, Jonathan May, Ahmad Beirami

    Abstract: Recent works that revealed the vulnerability of dialogue state tracking (DST) models to distributional shifts have made holistic comparisons on robustness and qualitative analyses increasingly important for understanding their relative performance. We present our findings from standardized and comprehensive DST diagnoses, which have previously been sparse and uncoordinated, using our toolkit, Chec… ▽ More

    Submitted 4 November, 2022; v1 submitted 15 December, 2021; originally announced December 2021.

    Comments: EMNLP2022

  48. arXiv:2111.04862  [pdf, other

    cs.CV cs.AI cs.CL cs.CR

    Explaining Face Presentation Attack Detection Using Natural Language

    Authors: Hengameh Mirzaalian, Mohamed E. Hussein, Leonidas Spinoulas, Jonathan May, Wael Abd-Almageed

    Abstract: A large number of deep neural network based techniques have been developed to address the challenging problem of face presentation attack detection (PAD). Whereas such techniques' focus has been on improving PAD performance in terms of classification accuracy and robustness against unseen attacks and environmental conditions, there exists little attention on the explainability of PAD predictions.… ▽ More

    Submitted 8 November, 2021; originally announced November 2021.

    Comments: To Appear in the Proceedings of the IEEE International Conference on Automatic Face and Gesture Recognition 2021

  49. arXiv:2109.10475  [pdf, other

    cs.CL cs.AI

    Salience-Aware Event Chain Modeling for Narrative Understanding

    Authors: Xiyang Zhang, Muhao Chen, Jonathan May

    Abstract: Storytelling, whether via fables, news reports, documentaries, or memoirs, can be thought of as the communication of interesting and related events that, taken together, form a concrete process. It is desirable to extract the event chains that represent such processes. However, this extraction remains a challenging problem. We posit that this is due to the nature of the texts from which chains are… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.

    Comments: EMNLP 2021

  50. arXiv:2108.11063  [pdf, other

    cs.CL

    Viola: A Topic Agnostic Generate-and-Rank Dialogue System

    Authors: Hyundong Cho, Basel Shbita, Kartik Shenoy, Shuai Liu, Nikhil Patel, Hitesh Pindikanti, Jennifer Lee, Jonathan May

    Abstract: We present Viola, an open-domain dialogue system for spoken conversation that uses a topic-agnostic dialogue manager based on a simple generate-and-rank approach. Leveraging recent advances of generative dialogue systems powered by large language models, Viola fetches a batch of response candidates from various neural dialogue models trained with different datasets and knowledge-grounding inputs.… ▽ More

    Submitted 25 August, 2021; originally announced August 2021.

    Comments: Alexa Prize Socialbot Grand Challenge 4 Proceedings, 23 pages