Skip to main content

Showing 1–11 of 11 results for author: Cai, Z G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.23340  [pdf

    cs.CL

    Information Loss in LLMs' Multilingual Translation: The Role of Training Data, Language Proximity, and Language Family

    Authors: Yumeng Lin, Xufeng Duan, David Haslett, Yige Chen, Zhenguang G. Cai

    Abstract: Large language models have achieved impressive progress in multilingual translation, yet they continue to face challenges with certain language pairs-particularly those with limited training data or significant linguistic divergence from English. This study systematically investigates how training data, language proximity, and language family affect information loss in multilingual translation. We… ▽ More

    Submitted 29 June, 2025; originally announced June 2025.

  2. arXiv:2505.19548  [pdf, ps, other

    cs.CL cs.AI

    How Syntax Specialization Emerges in Language Models

    Authors: Xufeng Duan, Zhaoqian Yao, Yunhao Zhang, Shaonan Wang, Zhenguang G. Cai

    Abstract: Large language models (LLMs) have been found to develop surprising internal specializations: Individual neurons, attention heads, and circuits become selectively sensitive to syntactic structure, reflecting patterns observed in the human brain. While this specialization is well-documented, how it emerges during training and what influences its development remains largely unknown. In this work, w… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  3. arXiv:2502.01299  [pdf

    q-bio.NC cs.CL

    Probabilistic adaptation of language comprehension for individual speakers: Evidence from neural oscillations

    Authors: Hanlin Wu, Xiaohui Rao, Zhenguang G. Cai

    Abstract: Listeners adapt language comprehension based on their mental representations of speakers, but how these representations are dynamically updated remains unclear. We investigated whether listeners probabilistically adapt their comprehension based on the likelihood of speakers producing stereotype-incongruent utterances. Our findings reveal two potential mechanisms: a speaker-general mechanism that a… ▽ More

    Submitted 3 February, 2025; originally announced February 2025.

  4. arXiv:2412.13612  [pdf, ps, other

    cs.CL cs.AI

    Large Language Models for Automated Literature Review: An Evaluation of Reference Generation, Abstract Writing, and Review Composition

    Authors: Xuemei Tang, Xufeng Duan, Zhenguang G. Cai

    Abstract: Large language models (LLMs) have emerged as a potential solution to automate the complex processes involved in writing literature reviews, such as literature collection, organization, and summarization. However, it is yet unclear how good LLMs are at automating comprehensive and reliable literature reviews. This study introduces a framework to automatically evaluate the performance of LLMs in thr… ▽ More

    Submitted 18 June, 2025; v1 submitted 18 December, 2024; originally announced December 2024.

    Comments: 12 pages, 5 figures, 5 tables

  5. arXiv:2412.07238  [pdf

    cs.CL q-bio.NC

    Speaker effects in spoken language comprehension

    Authors: Hanlin Wu, Zhenguang G. Cai

    Abstract: The identity of a speaker significantly influences spoken language comprehension by affecting both perception and expectation. This review explores speaker effects, focusing on how speaker information impacts language processing. We propose an integrative model featuring the interplay between bottom-up perception-based processes driven by acoustic details and top-down expectation-based processes d… ▽ More

    Submitted 10 December, 2024; originally announced December 2024.

    Comments: 44 pages, 1 figure

  6. arXiv:2409.17525  [pdf

    q-bio.NC cs.CL

    When A Man Says He Is Pregnant: ERP Evidence for A Rational Account of Speaker-contextualized Language Comprehension

    Authors: Hanlin Wu, Zhenguang G. Cai

    Abstract: Spoken language is often, if not always, understood in a context formed by the identity of the speaker. For example, we can easily make sense of an utterance such as "I'm going to have a manicure this weekend" or "The first time I got pregnant I had a hard time" when spoken by a woman, but it would be harder to understand when it is spoken by a man. Previous event-related potential (ERP) studies h… ▽ More

    Submitted 25 January, 2025; v1 submitted 26 September, 2024; originally announced September 2024.

    Comments: The manuscript is under review

  7. arXiv:2409.15890  [pdf, other

    cs.CL

    HLB: Benchmarking LLMs' Humanlikeness in Language Use

    Authors: Xufeng Duan, Bei Xiao, Xuemei Tang, Zhenguang G. Cai

    Abstract: As synthetic data becomes increasingly prevalent in training language models, particularly through generated dialogue, concerns have emerged that these models may deviate from authentic human language patterns, potentially losing the richness and creativity inherent in human communication. This highlights the critical need to assess the humanlikeness of language models in real-world language use.… ▽ More

    Submitted 24 September, 2024; originally announced September 2024.

  8. arXiv:2409.15827  [pdf, other

    cs.CL

    Unveiling Language Competence Neurons: A Psycholinguistic Approach to Model Interpretability

    Authors: Xufeng Duan, Xinyu Zhou, Bei Xiao, Zhenguang G. Cai

    Abstract: As large language models (LLMs) advance in their linguistic capacity, understanding how they capture aspects of language competence remains a significant challenge. This study therefore employs psycholinguistic paradigms in English, which are well-suited for probing deeper cognitive aspects of language processing, to explore neuron-level representations in language model across three tasks: sound-… ▽ More

    Submitted 11 December, 2024; v1 submitted 24 September, 2024; originally announced September 2024.

  9. arXiv:2409.12435  [pdf, other

    cs.CL

    Linguistic Minimal Pairs Elicit Linguistic Similarity in Large Language Models

    Authors: Xinyu Zhou, Delong Chen, Samuel Cahyawijaya, Xufeng Duan, Zhenguang G. Cai

    Abstract: We introduce a novel analysis that leverages linguistic minimal pairs to probe the internal linguistic representations of Large Language Models (LLMs). By measuring the similarity between LLM activation differences across minimal pairs, we quantify the and gain insight into the linguistic knowledge captured by LLMs. Our large-scale experiments, spanning 100+ LLMs and 150k minimal pairs in three la… ▽ More

    Submitted 13 December, 2024; v1 submitted 18 September, 2024; originally announced September 2024.

    Comments: COLING 2025

  10. arXiv:2406.11116  [pdf

    cs.CL

    Grammaticality Representation in ChatGPT as Compared to Linguists and Laypeople

    Authors: Zhuang Qiu, Xufeng Duan, Zhenguang G. Cai

    Abstract: Large language models (LLMs) have demonstrated exceptional performance across various linguistic tasks. However, it remains uncertain whether LLMs have developed human-like fine-grained grammatical intuition. This preregistered study (https://osf.io/t5nes) presents the first large-scale investigation of ChatGPT's grammatical intuition, building upon a previous study that collected laypeople's gram… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: 23 pages

  11. arXiv:2303.08014  [pdf

    cs.CL

    Do large language models resemble humans in language use?

    Authors: Zhenguang G. Cai, Xufeng Duan, David A. Haslett, Shuqi Wang, Martin J. Pickering

    Abstract: Large language models (LLMs) such as ChatGPT and Vicuna have shown remarkable capacities in comprehending and producing language. However, their internal workings remain a black box, and it is unclear whether LLMs and chatbots can develop humanlike characteristics in language use. Cognitive scientists have devised many experiments that probe, and have made great progress in explaining, how people… ▽ More

    Submitted 25 March, 2024; v1 submitted 10 March, 2023; originally announced March 2023.