Computation and Language

Authors and titles for recent submissions

See today's new changes

Total of 352 entries : 1-100 101-200 149-248 201-300 301-352

Showing up to 100 entries per page: fewer | more | all

[149] arXiv:2509.07122 (cross-list from cs.AI) [pdf, html, other]: Title: Neuro-Symbolic Frameworks: Conceptual Characterization and Empirical Comparative Analysis

Sania Sinha, Tanawan Premsri, Danial Kamali, Parisa Kordjamshidi

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Symbolic Computation (cs.SC)
[150] arXiv:2509.07098 (cross-list from cs.AI) [pdf, html, other]: Title: Instruction Agent: Enhancing Agent with Expert Demonstration

Yinheng Li, Hailey Hultquist, Justin Wagle, Kazuhito Koishida

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[151] arXiv:2509.07017 (cross-list from cs.AI) [pdf, html, other]: Title: From Eigenmodes to Proofs: Integrating Graph Spectral Operators with Symbolic Interpretable Reasoning

Andrew Kiruluta, Priscilla Burity

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[152] arXiv:2509.07006 (cross-list from cs.CY) [pdf, html, other]: Title: ArGen: Auto-Regulation of Generative AI via GRPO and Policy-as-Code

Kapil Madan

Comments: 53 pages, 7 figures, 8 tables. Open-source implementation available at: this https URL. Work explores the integration of policy-as-code for AI alignment, with a case study in culturally-nuanced, ethical AI using Dharmic principles

Subjects: Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[153] arXiv:2509.06994 (cross-list from cs.CV) [pdf, html, other]: Title: VLMs-in-the-Wild: Bridging the Gap Between Academic Benchmarks and Enterprise Reality

Srihari Bandraupalli, Anupam Purwar

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[154] arXiv:2509.06982 (cross-list from cs.LG) [pdf, html, other]: Title: CARE: Decoding Time Safety Alignment via Rollback and Introspection Intervention

Xiaomeng Hu, Fei Huang, Chenhan Yuan, Junyang Lin, Tsung-Yi Ho

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)

[155] arXiv:2509.06952 [pdf, other]: Title: On the Same Wavelength? Evaluating Pragmatic Reasoning in Language Models across Broad Concepts

Linlu Qiu, Cedegao E. Zhang, Joshua B. Tenenbaum, Yoon Kim, Roger P. Levy

Comments: EMNLP 2025 (Main)

Subjects: Computation and Language (cs.CL)
[156] arXiv:2509.06949 [pdf, other]: Title: Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models

Yinjie Wang, Ling Yang, Bowen Li, Ye Tian, Ke Shen, Mengdi Wang

Comments: Code and Models: this https URL

Subjects: Computation and Language (cs.CL)
[157] arXiv:2509.06948 [pdf, html, other]: Title: Beyond Two-Stage Training: Cooperative SFT and RL for LLM Reasoning

Liang Chen, Xueting Han, Li Shen, Jing Bai, Kam-Fai Wong

Subjects: Computation and Language (cs.CL)
[158] arXiv:2509.06902 [pdf, other]: Title: Proof-Carrying Numbers (PCN): A Protocol for Trustworthy Numeric Answers from LLMs via Claim Verification

Aivin V. Solatorio

Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR); Databases (cs.DB); Machine Learning (cs.LG)
[159] arXiv:2509.06888 [pdf, html, other]: Title: mmBERT: A Modern Multilingual Encoder with Annealed Language Learning

Marc Marone, Orion Weller, William Fleshman, Eugene Yang, Dawn Lawrie, Benjamin Van Durme

Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[160] arXiv:2509.06883 [pdf, other]: Title: UNH at CheckThat! 2025: Fine-tuning Vs Prompting in Claim Extraction

Joe Wilder, Nikhil Kadapala, Benji Xu, Mohammed Alsaadi, Aiden Parsons, Mitchell Rogers, Palash Agarwal, Adam Hassick, Laura Dietz

Comments: 16 pages,3 tables, CLEF 2025 Working Notes, 9-12 September 2025, Madrid, Spain

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[161] arXiv:2509.06870 [pdf, other]: Title: The Majority is not always right: RL training for solution aggregation

Wenting Zhao, Pranjal Aggarwal, Swarnadeep Saha, Asli Celikyilmaz, Jason Weston, Ilia Kulikov

Subjects: Computation and Language (cs.CL)
[162] arXiv:2509.06838 [pdf, html, other]: Title: EPT Benchmark: Evaluation of Persian Trustworthiness in Large Language Models

Mohammad Reza Mirbagheri, Mohammad Mahdi Mirkamali, Zahra Motoshaker Arani, Ali Javeri, Amir Mahdi Sadeghzadeh, Rasool Jalili

Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[163] arXiv:2509.06836 [pdf, html, other]: Title: COMPACT: Common-token Optimized Model Pruning Across Channels and Tokens

Eugene Kwek, Wenpeng Yin

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[164] arXiv:2509.06813 [pdf, html, other]: Title: A Comparative Benchmark of Large Language Models for Labelling Wind Turbine Maintenance Logs

Max Malyi, Jonathan Shek, Alasdair McDonald, Andre Biscaya

Comments: Associated GitHub repository: this https URL

Subjects: Computation and Language (cs.CL)
[165] arXiv:2509.06809 [pdf, html, other]: Title: Saturation-Driven Dataset Generation for LLM Mathematical Reasoning in the TPTP Ecosystem

Valentin Quesnel, Damien Sileo

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[166] arXiv:2509.06807 [pdf, html, other]: Title: MoGU V2: Toward a Higher Pareto Frontier Between Model Usability and Security

Yanrui Du, Fenglei Fan, Sendong Zhao, Jiawei Cao, Ting Liu, Bing Qin

Subjects: Computation and Language (cs.CL)
[167] arXiv:2509.06806 [pdf, other]: Title: MachineLearningLM: Scaling Many-shot In-context Learning via Continued Pretraining

Haoyu Dong, Pengkun Zhang, Mingzhe Lu, Yanzhen Shen, Guolin Ke

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[168] arXiv:2509.06795 [pdf, html, other]: Title: Anchoring Refusal Direction: Mitigating Safety Risks in Tuning via Projection Constraint

Yanrui Du, Fenglei Fan, Sendong Zhao, Jiawei Cao, Qika Lin, Kai He, Ting Liu, Bing Qin, Mengling Feng

Subjects: Computation and Language (cs.CL)
[169] arXiv:2509.06704 [pdf, html, other]: Title: Will Annotators Disagree? Identifying Subjectivity in Value-Laden Arguments

Amir Homayounirad, Enrico Liscio, Tong Wang, Catholijn M. Jonker, Luciano C.Siebert

Comments: Accepted at Findings of EMNLP 2025

Subjects: Computation and Language (cs.CL)
[170] arXiv:2509.06675 [pdf, html, other]: Title: ParCzech4Speech: A New Speech Corpus Derived from Czech Parliamentary Data

Vladislav Stankov, Matyáš Kopp, Ondřej Bojar

Journal-ref: In: Proceedings of the 28th International Conference on Text, Speech, and Dialogue (TSD 2025), pp.299-308

Subjects: Computation and Language (cs.CL)
[171] arXiv:2509.06652 [pdf, html, other]: Title: IntrEx: A Dataset for Modeling Engagement in Educational Conversations

Xingwei Tan, Mahathi Parvatham, Chiara Gambi, Gabriele Pergola

Comments: EMNLP 2025 Findings camera-ready, 9+7 pages

Subjects: Computation and Language (cs.CL)
[172] arXiv:2509.06650 [pdf, html, other]: Title: Domain-Aware RAG: MoL-Enhanced RL for Efficient Training and Scalable Retrieval

Hao Lin, Peitong Xie, Jingxue Chen, Jie Lin, Qingkun Tang, Qianchun Lu

Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[173] arXiv:2509.06637 [pdf, html, other]: Title: Modelling Intertextuality with N-gram Embeddings

Yi Xing

Subjects: Computation and Language (cs.CL)
[174] arXiv:2509.06631 [pdf, html, other]: Title: Guided Decoding and Its Critical Role in Retrieval-Augmented Generation

Özgür Uğur, Musa Yılmaz, Esra Şavirdi, Özay Ezerceli, Mahmut El Huseyni, Selva Taş, Reyhan Bayraktar

Subjects: Computation and Language (cs.CL)
[175] arXiv:2509.06596 [pdf, html, other]: Title: HAVE: Head-Adaptive Gating and ValuE Calibration for Hallucination Mitigation in Large Language Models

Xin Tong, Zhi Lin, Jingya Wang, Bo Jin

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[176] arXiv:2509.06531 [pdf, html, other]: Title: SLiNT: Structure-aware Language Model with Injection and Contrastive Training for Knowledge Graph Completion

Mengxue Yang, Chun Yang, Jiaqi Zhu, Jiafan Li, Jingqi Zhang, Yuyang Li, Ying Li

Comments: Accepted by EMNLP Findings 2025

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[177] arXiv:2509.06524 [pdf, html, other]: Title: LAMDAS: LLM as an Implicit Classifier for Domain-specific Data Selection

Jian Wu, Hang Yu, Bingchang Liu, Wenjie Yang, Peng Di, Jianguo Li, Yue Zhang

Subjects: Computation and Language (cs.CL)
[178] arXiv:2509.06518 [pdf, html, other]: Title: Crown, Frame, Reverse: Layer-Wise Scaling Variants for LLM Pre-Training

Andrei Baroian, Kasper Notebomer

Comments: The reported results are skewed due to a data type mismatch. The dataset was saved with int32, but the data loader interpreted it as uint16. As a result, each 32-bit token was incorrectly split into two 16-bit tokens. Outcome: a consistent artifact where every other token is zero

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[179] arXiv:2509.06501 [pdf, other]: Title: WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents

Junteng Liu, Yunji Li, Chi Zhang, Jingyang Li, Aili Chen, Ke Ji, Weiyu Cheng, Zijia Wu, Chengyu Du, Qidi Xu, Jiayuan Song, Zhengmao Zhu, Wenhu Chen, Pengyu Zhao, Junxian He

Subjects: Computation and Language (cs.CL)
[180] arXiv:2509.06401 [pdf, html, other]: Title: Do LLMs exhibit the same commonsense capabilities across languages?

Ivan Martínez-Murillo, Elena Lloret, Paloma Moreda, Albert Gatt

Subjects: Computation and Language (cs.CL)
[181] arXiv:2509.06356 [pdf, html, other]: Title: PL-CA: A Parametric Legal Case Augmentation Framework

Ao Chang, Yubo Chen, Jun Zhao

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[182] arXiv:2509.06350 [pdf, html, other]: Title: Mask-GCG: Are All Tokens in Adversarial Suffixes Necessary for Jailbreak Attacks?

Junjie Mu, Zonghao Ying, Zhekui Fan, Zonglei Jing, Yaoyuan Zhang, Zhengmin Yu, Wenxin Zhang, Quanchen Zou, Xiangzheng Zhang

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
[183] arXiv:2509.06277 [pdf, html, other]: Title: No Encore: Unlearning as Opt-Out in Music Generation

Jinju Kim, Taehan Kim, Abdul Waheed, Rita Singh

Comments: Work in progress. 7 pages

Subjects: Computation and Language (cs.CL)
[184] arXiv:2509.06200 [pdf, html, other]: Title: MSLEF: Multi-Segment LLM Ensemble Finetuning in Recruitment

Omar Walid, Mohamed T. Younes, Khaled Shaban, Mai Hassan, Ali Hamdi

Comments: Accepted in AICCSA 2025

Subjects: Computation and Language (cs.CL)
[185] arXiv:2509.06196 [pdf, html, other]: Title: Augmented Fine-Tuned LLMs for Enhanced Recruitment Automation

Mohamed T. Younes, Omar Walid, Khaled Shaban, Ali Hamdi, Mai Hassan

Comments: Accepted in AICCSA 2025

Subjects: Computation and Language (cs.CL)
[186] arXiv:2509.06184 [pdf, html, other]: Title: Understanding the Influence of Synthetic Data for Text Embedders

Jacob Mitchell Springer, Vaibhav Adlakha, Siva Reddy, Aditi Raghunathan, Marius Mosbach

Comments: ACL Findings 2025

Subjects: Computation and Language (cs.CL)
[187] arXiv:2509.06164 [pdf, other]: Title: Benchmarking Gender and Political Bias in Large Language Models

Jinrui Yang, Xudong Han, Timothy Baldwin

Comments: The 8th International Conference on Natural Language and Speech Processing (Oral)

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[188] arXiv:2509.06100 [pdf, html, other]: Title: Orthogonal Low-rank Adaptation in Lie Groups for Continual Learning of Large Language Models

Kefan Cao, Shuaicheng Wu

Comments: 13 pages, 3 figures

Subjects: Computation and Language (cs.CL)
[189] arXiv:2509.06079 [pdf, html, other]: Title: Multimodal Reasoning for Science: Technical Report and 1st Place Solution to the ICML 2025 SeePhys Challenge

Hao Liang, Ruitao Wu, Bohan Zeng, Junbo Niu, Wentao Zhang, Bin Dong

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[190] arXiv:2509.06074 [pdf, html, other]: Title: Multimodal Fine-grained Context Interaction Graph Modeling for Conversational Speech Synthesis

Zhenqi Jia, Rui Liu, Berrak Sisman, Haizhou Li

Comments: Accepted by EMNLP 2025

Subjects: Computation and Language (cs.CL)
[191] arXiv:2509.06065 [pdf, html, other]: Title: KatotohananQA: Evaluating Truthfulness of Large Language Models in Filipino

Lorenzo Alfred Nery, Ronald Dawson Catignas, Thomas James Tiam-Lee

Comments: 14 pages, 1 figure, 9 tables, 1 listing. To appear in Proceedings of NLPIR 2025

Subjects: Computation and Language (cs.CL)
[192] arXiv:2509.05915 [pdf, other]: Title: Accelerating Large Language Model Inference via Early-Exiting Algorithms

Sangmin Bae

Comments: PhD Dissertation

Subjects: Computation and Language (cs.CL)
[193] arXiv:2509.05908 [pdf, html, other]: Title: Enhancing the Robustness of Contextual ASR to Varying Biasing Information Volumes Through Purified Semantic Correlation Joint Modeling

Yue Gu, Zhihao Du, Ying Shi, Shiliang Zhang, Qian Chen, Jiqing Han

Comments: Accepted by IEEE Transactions on Audio, Speech and Language Processing, 2025 (this https URL). DOI: https://doi.org/10.1109/TASLPRO.2025.3606198

Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[194] arXiv:2509.05882 [pdf, html, other]: Title: Let's Roleplay: Examining LLM Alignment in Collaborative Dialogues

Abhijnan Nath, Carine Graff, Nikhil Krishnaswamy

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[195] arXiv:2509.05878 [pdf, html, other]: Title: MedFactEval and MedAgentBrief: A Framework and Workflow for Generating and Evaluating Factual Clinical Summaries

François Grolleau, Emily Alsentzer, Timothy Keyes, Philip Chung, Akshay Swaminathan, Asad Aali, Jason Hom, Tridu Huynh, Thomas Lew, April S. Liang, Weihan Chu, Natasha Z. Steele, Christina F. Lin, Jingkun Yang, Kameron C. Black, Stephen P. Ma, Fateme N. Haredasht, Nigam H. Shah, Kevin Schulman, Jonathan H. Chen

Subjects: Computation and Language (cs.CL)
[196] arXiv:2509.05867 [pdf, html, other]: Title: ZhiFangDanTai: Fine-tuning Graph-based Retrieval-Augmented Generation Model for Traditional Chinese Medicine Formula

ZiXuan Zhang, Bowen Hao, Yingjie Li, Hongzhi Yin

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[197] arXiv:2509.05863 [pdf, html, other]: Title: LatinX: Aligning a Multilingual TTS Model with Direct Preference Optimization

Luis Felipe Chary, Miguel Arjona Ramirez

Subjects: Computation and Language (cs.CL)
[198] arXiv:2509.05741 [pdf, html, other]: Title: Enhancing Factual Accuracy and Citation Generation in LLMs via Multi-Stage Self-Verification

Fernando Gabriela García, Qiyang Shi, Zilin Feng

Subjects: Computation and Language (cs.CL)
[199] arXiv:2509.05729 [pdf, html, other]: Title: QCSE: A Pretrained Quantum Context-Sensitive Word Embedding for Natural Language Processing

Charles M. Varmantchaonala, Niclas GÖtting, Nils-Erik SchÜtte, Jean Louis E. K. Fendji, Christopher Gies

Subjects: Computation and Language (cs.CL)
[200] arXiv:2509.05719 [pdf, html, other]: Title: Exploring Subjective Tasks in Farsi: A Survey Analysis and Evaluation of Language Models

Donya Rooein, Flor Miriam Plaza-del-Arco, Debora Nozza, Dirk Hovy

Subjects: Computation and Language (cs.CL)
[201] arXiv:2509.05716 [pdf, html, other]: Title: A Survey of the State-of-the-Art in Conversational Question Answering Systems

Manoj Madushanka Perera, Adnan Mahmood, Kasun Eranda Wijethilake, Fahmida Islam, Maryam Tahermazandarani, Quan Z. Sheng

Comments: 42 pages, 12 figures, 4 tables

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[202] arXiv:2509.05691 [pdf, html, other]: Title: Revealing the Numeracy Gap: An Empirical Investigation of Text Embedding Models

Ningyuan Deng, Hanyu Duan, Yixuan Tang, Yi Yang

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[203] arXiv:2509.05668 [pdf, html, other]: Title: Llama-GENBA-10B: A Trilingual Large Language Model for German, English and Bavarian

Michael Hoffmann, Jophin John, Stefan Schweter, Gokul Ramakrishnan, Hoi-Fong Mak, Alice Zhang, Dmitry Gaynullin, Nicolay J. Hammer

Comments: Michael Hoffmann and Jophin John contributed equally to this work

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[204] arXiv:2509.05660 [pdf, html, other]: Title: Cross-Question Method Reuse in Large Language Models: From Word-Level Prediction to Rational Logical-Layer Reasoning

Hong Su

Subjects: Computation and Language (cs.CL)
[205] arXiv:2509.05657 [pdf, html, other]: Title: LM-Searcher: Cross-domain Neural Architecture Search with LLMs via Unified Numerical Encoding

Yuxuan Hu, Jihao Liu, Ke Wang, Jinliang Zhen, Weikang Shi, Manyuan Zhang, Qi Dou, Rui Liu, Aojun Zhou, Hongsheng Li

Comments: EMNLP 2025 Main

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[206] arXiv:2509.05635 [pdf, html, other]: Title: Few-Shot Query Intent Detection via Relation-Aware Prompt Learning

Liang Zhang, Yuan Li, Shijie Zhang, Zheng Zhang, Xitong Li

Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[207] arXiv:2509.05617 [pdf, other]: Title: From Joy to Fear: A Benchmark of Emotion Estimation in Pop Song Lyrics

Shay Dahary, Avi Edana, Alexander Apartsin, Yehudit Aperstein

Comments: 5 pages, 2 figures

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[208] arXiv:2509.05609 [pdf, html, other]: Title: New Insights into Optimal Alignment of Acoustic and Linguistic Representations for Knowledge Transfer in ASR

Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[209] arXiv:2509.05607 [pdf, html, other]: Title: Beyond Keywords: Driving Generative Search Engine Optimization with Content-Centric Agents

Qiyuan Chen, Jiahe Chen, Hongsen Huang, Qian Shao, Jintai Chen, Renjie Hua, Hongxia Xu, Ruijia Wu, Ren Chuan, Jian Wu

Comments: Technical Report

Subjects: Computation and Language (cs.CL)
[210] arXiv:2509.05605 [pdf, html, other]: Title: Icon$^{2}$: Aligning Large Language Models Using Self-Synthetic Preference Data via Inherent Regulation

Qiyuan Chen, Hongsen Huang, Qian Shao, Jiahe Chen, Jintai Chen, Hongxia Xu, Renjie Hua, Ren Chuan, Jian Wu

Comments: EMNLP 2025 Main

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[211] arXiv:2509.05602 [pdf, html, other]: Title: Mitigating Spurious Correlations Between Question and Answer via Chain-of-Thought Correctness Perception Distillation

Hongyan Xie, Yitong Yao, Yikun Ban, Zixuan Huang, Deqing Wang, Zhenhe Wu, Haoxiang Su, Chao Wang, Shuangyong Song

Comments: PrePrint

Subjects: Computation and Language (cs.CL)
[212] arXiv:2509.05566 [pdf, html, other]: Title: Ad hoc conventions generalize to new referents

Anya Ji, Claire Augusta Bergey, Ron Eliav, Yoav Artzi, Robert D. Hawkins

Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY)
[213] arXiv:2509.05553 [pdf, html, other]: Title: Using Contrastive Learning to Improve Two-Way Reasoning in Large Language Models: The Obfuscation Task as a Case Study

Serge Lionel Nikiema, Jordan Samhi, Micheline Bénédicte Moumoula, Albérick Euraste Djiré, Abdoul Kader Kaboré, Jacques Klein, Tegawendé F. Bissyandé

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[214] arXiv:2509.05505 [pdf, html, other]: Title: Biomedical Literature Q&A System Using Retrieval-Augmented Generation (RAG)

Mansi Garg, Lee-Chi Wang, Bhavesh Ghanchi, Sanjana Dumpala, Shreyash Kakde, Yen Chih Chen

Comments: 10 pages, 6 figures, 3 tables

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[215] arXiv:2509.05486 [pdf, html, other]: Title: The Token Tax: Systematic Bias in Multilingual Tokenization

Jessica M. Lundin, Ada Zhang, Nihal Karim, Hamza Louzan, Victor Wei, David Adelani, Cody Carroll

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[216] arXiv:2509.05484 [pdf, html, other]: Title: From Staff Messages to Actionable Insights: A Multi-Stage LLM Classification Framework for Healthcare Analytics

Hajar Sakai, Yi-En Tseng, Mohammadsadegh Mikaeili, Joshua Bosire, Franziska Jovin

Subjects: Computation and Language (cs.CL)
[217] arXiv:2509.05440 [pdf, html, other]: Title: Direct-Scoring NLG Evaluators Can Use Pairwise Comparisons Too

Logan Lawrence, Ashton Williamson, Alexander Shelton

Comments: 12 pages, 18 tables, 1 figure

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[218] arXiv:2509.05425 [pdf, html, other]: Title: No Translation Needed: Forecasting Quality from Fertility and Metadata

Jessica M. Lundin, Ada Zhang, David Adelani, Cody Carroll

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[219] arXiv:2509.05396 [pdf, html, other]: Title: Talk Isn't Always Cheap: Understanding Failure Modes in Multi-Agent Debate

Andrea Wynn, Harsh Satija, Gillian Hadfield

Comments: ICML MAS Workshop 2025

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA)
[220] arXiv:2509.05385 [pdf, html, other]: Title: A Lightweight Framework for Trigger-Guided LoRA-Based Self-Adaptation in LLMs

Jiacheng Wei, Faguo Wu, Xiao Zhang

Comments: 11 pages, 7 figures, conference

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[221] arXiv:2509.05360 [pdf, html, other]: Title: Beyond ROUGE: N-Gram Subspace Features for LLM Hallucination Detection

Jerry Li, Evangelos Papalexakis

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[222] arXiv:2509.05359 [pdf, html, other]: Title: An Empirical Analysis of Discrete Unit Representations in Speech Language Modeling Pre-training

Yanis Labrak, Richard Dufour, Mickaël Rouvier

Comments: Published in International Conference on Text, Speech, and Dialogue, 13-24

Journal-ref: International Conference on Text, Speech, and Dialogue 2025

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Audio and Speech Processing (eess.AS)
[223] arXiv:2509.06945 (cross-list from cs.CV) [pdf, html, other]: Title: Interleaving Reasoning for Better Text-to-Image Generation

Wenxuan Huang, Shuang Chen, Zheyong Xie, Shaosheng Cao, Shixiang Tang, Yufan Shen, Qingyu Yin, Wenbo Hu, Xiaoman Wang, Yuntian Tang, Junbo Qiao, Yue Guo, Yao Hu, Zhenfei Yin, Philip Torr, Yu Cheng, Wanli Ouyang, Shaohui Lin

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[224] arXiv:2509.06941 (cross-list from cs.LG) [pdf, html, other]: Title: Outcome-based Exploration for LLM Reasoning

Yuda Song, Julia Kempe, Remi Munos

Comments: 26 pages, 11 figures

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[225] arXiv:2509.06920 (cross-list from cs.CR) [pdf, html, other]: Title: An Ethically Grounded LLM-Based Approach to Insider Threat Synthesis and Detection

Haywood Gelman, John D. Hastings, David Kenley

Comments: 6 pages, 5 figures, 5 tables

Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computers and Society (cs.CY)
[226] arXiv:2509.06917 (cross-list from cs.AI) [pdf, html, other]: Title: Paper2Agent: Reimagining Research Papers As Interactive and Reliable AI Agents

Jiacheng Miao, Joe R. Davis, Jonathan K. Pritchard, James Zou

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[227] arXiv:2509.06861 (cross-list from cs.AI) [pdf, html, other]: Title: Test-Time Scaling in Reasoning Models Is Not Effective for Knowledge-Intensive Tasks Yet

James Xu Zhao, Bryan Hooi, See-Kiong Ng

Comments: 20 pages, 4 figures, 6 tables

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[228] arXiv:2509.06822 (cross-list from cs.AI) [pdf, other]: Title: RAFFLES: Reasoning-based Attribution of Faults for LLM Systems

Chenyang Zhu, Spencer Hong, Jingyu Wu, Kushal Chawla, Charlotte Tang, Youbing Yin, Nathan Wolfe, Erin Babinsky, Daben Liu

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[229] arXiv:2509.06736 (cross-list from cs.AI) [pdf, html, other]: Title: VehicleWorld: A Highly Integrated Multi-Device Environment for Intelligent Vehicle Interaction

Jie Yang, Jiajun Chen, Zhangyue Yin, Shuo Chen, Yuxin Wang, Yiran Guo, Yuan Li, Yining Zheng, Xuanjing Huang, Xipeng Qiu

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Robotics (cs.RO)
[230] arXiv:2509.06733 (cross-list from cs.AI) [pdf, html, other]: Title: Reinforcement Learning Foundations for Deep Research Systems: A Survey

Wenjun Li, Zhi Chen, Jingru Lin, Hannan Cao, Wei Han, Sheng Liang, Zhi Zhang, Kuicai Dong, Dexun Li, Chen Zhang, Yong Liu

Comments: 38 pages, first version

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR)
[231] arXiv:2509.06415 (cross-list from cs.CV) [pdf, html, other]: Title: Index-Preserving Lightweight Token Pruning for Efficient Document Understanding in Vision-Language Models

Jaemin Son, Sujin Choi, Inyong Yun

Comments: Submitted to ICASSP 2026

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[232] arXiv:2509.06283 (cross-list from cs.AI) [pdf, html, other]: Title: SFR-DeepResearch: Towards Effective Reinforcement Learning for Autonomously Reasoning Single Agents

Xuan-Phi Nguyen, Shrey Pandit, Revanth Gangi Reddy, Austin Xu, Silvio Savarese, Caiming Xiong, Shafiq Joty

Comments: Technical Report

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[233] arXiv:2509.06221 (cross-list from eess.AS) [pdf, html, other]: Title: Beamforming-LLM: What, Where and When Did I Miss?

Vishal Choudhari

Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[234] arXiv:2509.06195 (cross-list from cs.IR) [pdf, html, other]: Title: Language Bias in Information Retrieval: The Nature of the Beast and Mitigation Methods

Jinrui Yang, Fan Jiang, Timothy Baldwin

Comments: Accepted at EMNLP MRL 2024

Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[235] arXiv:2509.06174 (cross-list from cs.AI) [pdf, other]: Title: From Long to Short: LLMs Excel at Trimming Own Reasoning Chains

Wei Han, Geng Zhan, Sicheng Yu, Chenyu Wang, Bryan Hooi

Comments: 21 pages, 5 figures, 7 tables

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[236] arXiv:2509.06160 (cross-list from cs.AI) [pdf, html, other]: Title: Reverse-Engineered Reasoning for Open-Ended Generation

Haozhe Wang, Haoran Que, Qixin Xu, Minghao Liu, Wangchunshu Zhou, Jiazhan Feng, Wanjun Zhong, Wei Ye, Tong Yang, Wenhao Huang, Ge Zhang, Fangzhen Lin

Comments: Preprint

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[237] arXiv:2509.06093 (cross-list from cs.DB) [pdf, other]: Title: Language Native Lightly Structured Databases for Large Language Model Driven Composite Materials Research

Yuze Liu, Zhaoyuan Zhang, Xiangsheng Zeng, Yihe Zhang, Leping Yu, Lejia Wang, Xi Yu

Subjects: Databases (cs.DB); Materials Science (cond-mat.mtrl-sci); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[238] arXiv:2509.05983 (cross-list from cs.SD) [pdf, html, other]: Title: TSPC: A Two-Stage Phoneme-Centric Architecture for code-switching Vietnamese-English Speech Recognition

Minh N. H. Nguyen, Anh Nguyen Tran, Dung Truong Dinh, Nam Van Vo

Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[239] arXiv:2509.05978 (cross-list from eess.IV) [pdf, html, other]: Title: Imagining Alternatives: Towards High-Resolution 3D Counterfactual Medical Image Generation via Language Guidance

Mohamed Mohamed, Brennan Nichyporuk, Douglas L. Arnold, Tal Arbel

Subjects: Image and Video Processing (eess.IV); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[240] arXiv:2509.05634 (cross-list from eess.AS) [pdf, html, other]: Title: On the Contribution of Lexical Features to Speech Emotion Recognition

David Combei

Comments: Accepted to 13th Conference on Speech Technology and Human-Computer Dialogue

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[241] arXiv:2509.05608 (cross-list from cs.CR) [pdf, html, other]: Title: Cross-Service Threat Intelligence in LLM Services using Privacy-Preserving Fingerprints

Waris Gill, Natalie Isak, Matthew Dressman

Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[242] arXiv:2509.05390 (cross-list from cs.CY) [pdf, other]: Title: Authorship Without Writing: Large Language Models and the Senior Author Analogy

Clint Hurshman, Sebastian Porsdam Mann, Julian Savulescu, Brian D. Earp

Comments: 28 pages, 0 figures

Subjects: Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[243] arXiv:2509.05331 (cross-list from cs.CR) [pdf, html, other]: Title: ForensicsData: A Digital Forensics Dataset for Large Language Models

Youssef Chakir, Iyad Lahsen-Cherif

Comments: Accepted to WiMob 2025 (21st International Conference on Wireless and Mobile Computing, Networking and Communications), Marrakesh, Morocco, Oct 20-22, 2025. 6 pages, 5 figures, 5 tables. IEEEtran conference format

Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[244] arXiv:2509.05309 (cross-list from q-bio.QM) [pdf, html, other]: Title: ProtSAE: Disentangling and Interpreting Protein Language Models via Semantically-Guided Sparse Autoencoders

Xiangyu Liu, Haodi Lei, Yi Liu, Yang Liu, Wei Hu

Subjects: Quantitative Methods (q-bio.QM); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[245] arXiv:2509.03736 (cross-list from cs.AI) [pdf, html, other]: Title: Are LLM Agents Behaviorally Coherent? Latent Profiles for Social Simulation

James Mooney, Josef Woldense, Zheng Robert Jia, Shirley Anugrah Hayati, My Ha Nguyen, Vipul Raheja, Dongyeop Kang

Comments: 25 pages, 9 figures, 7 tables

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)

[246] arXiv:2509.05291 [pdf, html, other]: Title: Crosscoding Through Time: Tracking Emergence & Consolidation Of Linguistic Representations Throughout LLM Pretraining

Deniz Bayazit, Aaron Mueller, Antoine Bosselut

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[247] arXiv:2509.05282 [pdf, html, other]: Title: Elucidating the Design Space of Decay in Linear Attention

Zhen Qin, Xuyang Shen, Yiran Zhong

Comments: Accepted to COLM 2025. Yiran Zhong is the corresponding author. Code is available at this https URL

Subjects: Computation and Language (cs.CL)
[248] arXiv:2509.05254 [pdf, html, other]: Title: Uniform Information Density and Syntactic Reduction: Revisiting $\textit{that}$-Mentioning in English Complement Clauses

Hailin Hao, Elsi Kaiser

Subjects: Computation and Language (cs.CL)

Total of 352 entries : 1-100 101-200 149-248 201-300 301-352

Showing up to 100 entries per page: fewer | more | all

Computation and Language

Authors and titles for recent submissions

Wed, 10 Sep 2025 (continued, showing last 6 of 57 entries )

Tue, 9 Sep 2025 (showing 91 of 91 entries )

Mon, 8 Sep 2025 (showing first 3 of 107 entries )