Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.CL

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Computation and Language

Authors and titles for recent submissions

  • Fri, 12 Sep 2025
  • Thu, 11 Sep 2025
  • Wed, 10 Sep 2025
  • Tue, 9 Sep 2025
  • Mon, 8 Sep 2025

See today's new changes

Total of 352 entries : 72-171 101-200 201-300 301-352
Showing up to 100 entries per page: fewer | more | all

Thu, 11 Sep 2025 (continued, showing last 26 of 42 entries )

[72] arXiv:2509.08438 [pdf, html, other]
Title: CommonVoice-SpeechRE and RPG-MoGe: Advancing Speech Relation Extraction with a New Dataset and Multi-Order Generative Framework
Jinzhong Ning, Paerhati Tulajiang, Yingying Le, Yijia Zhang, Yuanyuan Sun, Hongfei Lin, Haifeng Liu
Subjects: Computation and Language (cs.CL); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[73] arXiv:2509.08381 [pdf, other]
Title: Low-Resource Fine-Tuning for Multi-Task Structured Information Extraction with a Billion-Parameter Instruction-Tuned Model
Yu Cheng Chih, Yong Hao Hou
Comments: 13 pages, 8 figures, includes experiments on JSON extraction, knowledge graph extraction, and NER
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[74] arXiv:2509.08358 [pdf, html, other]
Title: <think> So let's replace this phrase with insult... </think> Lessons learned from generation of toxic texts with LLMs
Sergey Pletenev, Daniil Moskovskiy, Alexander Panchenko
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[75] arXiv:2509.08355 [pdf, html, other]
Title: Automatic Detection of Inauthentic Templated Responses in English Language Assessments
Yashad Samant, Lee Becker, Scott Hellman, Bradley Behan, Sarah Hughes, Joshua Southerland
Comments: Accepted to National Council on Measurement in Education (NCME) 2025 Annual Meeting
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[76] arXiv:2509.08345 [pdf, html, other]
Title: Toward Subtrait-Level Model Explainability in Automated Writing Evaluation
Alejandro Andrade-Lotero, Lee Becker, Joshua Southerland, Scott Hellman
Comments: Accepted to National Council on Measurement in Education (NCME) 2025 Annual Meeting
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[77] arXiv:2509.08304 [pdf, other]
Title: Towards Knowledge-Aware Document Systems: Modeling Semantic Coverage Relations via Answerability Detection
Yehudit Aperstein, Alon Gottlib, Gal Benita, Alexander Apartsin
Comments: 27 pages, 1 figure
Subjects: Computation and Language (cs.CL)
[78] arXiv:2509.08217 [pdf, html, other]
Title: Balancing Quality and Variation: Spam Filtering Distorts Data Label Distributions
Eve Fleisig, Matthias Orlikowski, Philipp Cimiano, Dan Klein
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[79] arXiv:2509.08150 [pdf, html, other]
Title: Verbalized Algorithms
Supriya Lall, Christian Farrell, Hari Pathanjaly, Marko Pavic, Sarvesh Chezhian, Masataro Asai
Comments: Submitted to NeurIPS 2025 Workshop on Efficient Reasoning
Subjects: Computation and Language (cs.CL)
[80] arXiv:2509.08146 [pdf, html, other]
Title: Bias after Prompting: Persistent Discrimination in Large Language Models
Nivedha Sivakumar, Natalie Mackraz, Samira Khorshidi, Krishna Patel, Barry-John Theobald, Luca Zappella, Nicholas Apostoloff
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[81] arXiv:2509.08105 [pdf, html, other]
Title: MERLIN: Multi-Stage Curriculum Alignment for Multilingual Encoder and LLM Fusion
Kosei Uemura, David Guzmán, Quang Phuoc Nguyen, Jesujoba Oluwadara Alabi, En-shiun Annie Lee, David Ifeoluwa Adelani
Comments: under submission
Subjects: Computation and Language (cs.CL)
[82] arXiv:2509.08093 [pdf, html, other]
Title: Culturally transmitted color categories in LLMs reflect a learning bias toward efficient compression
Nathaniel Imel, Noga Zaslavsky
Subjects: Computation and Language (cs.CL)
[83] arXiv:2509.08075 [pdf, html, other]
Title: No for Some, Yes for Others: Persona Prompts and Other Sources of False Refusal in Language Models
Flor Miriam Plaza-del-Arco, Paul Röttger, Nino Scherrer, Emanuele Borgonovo, Elmar Plischke, Dirk Hovy
Subjects: Computation and Language (cs.CL)
[84] arXiv:2509.08032 [pdf, html, other]
Title: SciGPT: A Large Language Model for Scientific Literature Understanding and Knowledge Discovery
Fengyu She, Nan Wang, Hongfei Wu, Ziyi Wan, Jingmian Wang, Chang Wang
Subjects: Computation and Language (cs.CL)
[85] arXiv:2509.08025 [pdf, html, other]
Title: NOWJ@COLIEE 2025: A Multi-stage Framework Integrating Embedding Models and Large Language Models for Legal Retrieval and Entailment
Hoang-Trung Nguyen, Tan-Minh Nguyen, Xuan-Bach Le, Tuan-Kiet Le, Khanh-Huyen Nguyen, Ha-Thanh Nguyen, Thi-Hai-Yen Vuong, Le-Minh Nguyen
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[86] arXiv:2509.08022 [pdf, html, other]
Title: MVPBench: A Benchmark and Fine-Tuning Framework for Aligning Large Language Models with Diverse Human Values
Yao Liang, Dongcheng Zhao, Feifei Zhao, Guobin Shen, Yuwei Wang, Dongqi Liang, Yi Zeng
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[87] arXiv:2509.08000 [pdf, html, other]
Title: AntiDote: Bi-level Adversarial Training for Tamper-Resistant LLMs
Debdeep Sanyal, Manodeep Ray, Murari Mandal
Comments: 19 pages
Subjects: Computation and Language (cs.CL)
[88] arXiv:2509.07998 [pdf, html, other]
Title: Bilingual Word Level Language Identification for Omotic Languages
Mesay Gemeda Yigezu, Girma Yohannis Bade, Atnafu Lambebo Tonja, Olga Kolesnikova, Grigori Sidorov, Alexander Gelbukh
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[89] arXiv:2509.08814 (cross-list from cs.LG) [pdf, html, other]
Title: Merge-of-Thought Distillation
Zhanming Shen, Zeyu Qin, Zenan Huang, Hao Chen, Jiaqi Hu, Yihong Zhuang, Guoshan Lu, Gang Chen, Junbo Zhao
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[90] arXiv:2509.08803 (cross-list from cs.SI) [pdf, html, other]
Title: Scaling Truth: The Confidence Paradox in AI Fact-Checking
Ihsan A. Qazi, Zohaib Khan, Abdullah Ghani, Agha A. Raza, Zafar A. Qazi, Wassay Sajjad, Ayesha Ali, Asher Javaid, Muhammad Abdullah Sohail, Abdul H. Azeemi
Comments: 65 pages, 26 figures, 6 tables
Subjects: Social and Information Networks (cs.SI); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computers and Society (cs.CY)
[91] arXiv:2509.08777 (cross-list from cs.CV) [pdf, html, other]
Title: Calibrating MLLM-as-a-judge via Multimodal Bayesian Prompt Ensembles
Eric Slyman, Mehrab Tanjim, Kushal Kafle, Stefan Lee
Comments: 17 pages, 8 figures, Accepted at ICCV 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[92] arXiv:2509.08755 (cross-list from cs.LG) [pdf, other]
Title: AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning
Zhiheng Xi, Jixuan Huang, Chenyang Liao, Baodai Huang, Honglin Guo, Jiaqi Liu, Rui Zheng, Junjie Ye, Jiazheng Zhang, Wenxiang Chen, Wei He, Yiwen Ding, Guanyu Li, Zehui Chen, Zhengyin Du, Xuesong Yao, Yufei Xu, Jiecao Chen, Tao Gui, Zuxuan Wu, Qi Zhang, Xuanjing Huang, Yu-Gang Jiang
Comments: preprint, 39 pages, 16 figures. Project: this https URL. Framework and Code: this https URL, this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[93] arXiv:2509.08653 (cross-list from cs.LG) [pdf, html, other]
Title: Generative Data Refinement: Just Ask for Better Data
Minqi Jiang, João G. M. Araújo, Will Ellsworth, Sian Gooding, Edward Grefenstette
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[94] arXiv:2509.08494 (cross-list from cs.CY) [pdf, html, other]
Title: HumanAgencyBench: Scalable Evaluation of Human Agency Support in AI Assistants
Benjamin Sturgeon, Daniel Samuelson, Jacob Haimes, Jacy Reese Anthis
Subjects: Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[95] arXiv:2509.08315 (cross-list from cs.LG) [pdf, html, other]
Title: EvolKV: Evolutionary KV Cache Compression for LLM Inference
Bohan Yu, Yekun Chai
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Neural and Evolutionary Computing (cs.NE)
[96] arXiv:2509.08182 (cross-list from cs.PL) [pdf, html, other]
Title: XML Prompting as Grammar-Constrained Interaction: Fixed-Point Semantics, Convergence Guarantees, and Human-AI Protocols
Faruk Alpay, Taylan Alpay
Comments: 7 pages, multiple XML prompts
Subjects: Programming Languages (cs.PL); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[97] arXiv:2509.08010 (cross-list from cs.CY) [pdf, html, other]
Title: Measuring and mitigating overreliance is necessary for building human-compatible AI
Lujain Ibrahim, Katherine M. Collins, Sunnie S. Y. Kim, Anka Reuel, Max Lamparth, Kevin Feng, Lama Ahmad, Prajna Soni, Alia El Kattan, Merlin Stein, Siddharth Swaroop, Ilia Sucholutsky, Andrew Strait, Q. Vera Liao, Umang Bhatt
Subjects: Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)

Wed, 10 Sep 2025 (showing 57 of 57 entries )

[98] arXiv:2509.07980 [pdf, other]
Title: Parallel-R1: Towards Parallel Thinking via Reinforcement Learning
Tong Zheng, Hongming Zhang, Wenhao Yu, Xiaoyang Wang, Xinyu Yang, Runpeng Dai, Rui Liu, Huiwen Bao, Chengsong Huang, Heng Huang, Dong Yu
Comments: Project website: this https URL
Subjects: Computation and Language (cs.CL)
[99] arXiv:2509.07968 [pdf, html, other]
Title: SimpleQA Verified: A Reliable Factuality Benchmark to Measure Parametric Knowledge
Lukas Haas, Gal Yona, Giovanni D'Antonio, Sasha Goldshtein, Dipanjan Das
Subjects: Computation and Language (cs.CL)
[100] arXiv:2509.07925 [pdf, html, other]
Title: GENUINE: Graph Enhanced Multi-level Uncertainty Estimation for Large Language Models
Tuo Wang, Adithya Kulkarni, Tyler Cody, Peter A. Beling, Yujun Yan, Dawei Zhou
Comments: Accepted by EMNLP 2025
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[101] arXiv:2509.07908 [pdf, html, other]
Title: Biased Tales: Cultural and Topic Bias in Generating Children's Stories
Donya Rooein, Vilém Zouhar, Debora Nozza, Dirk Hovy
Subjects: Computation and Language (cs.CL)
[102] arXiv:2509.07889 [pdf, html, other]
Title: From Detection to Mitigation: Addressing Gender Bias in Chinese Texts via Efficient Tuning and Voting-Based Rebalancing
Chengyan Wu, Yiqiang Cai, Yufei Cheng, Yun Xue
Comments: NLPCC 2025
Subjects: Computation and Language (cs.CL)
[103] arXiv:2509.07869 [pdf, html, other]
Title: Are Humans as Brittle as Large Language Models?
Jiahui Li, Sean Papay, Roman Klinger
Subjects: Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[104] arXiv:2509.07829 [pdf, html, other]
Title: Small Open Models Achieve Near Parity with Large Models in Low Resource Literary Translation at a Fraction of the Cost
Mihai Nadas, Laura Diosan, Andreea Tomescu, Andrei Piscoran
Comments: 25 pages, 8 figures, includes datasets and models released on Hugging Face
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[105] arXiv:2509.07817 [pdf, other]
Title: Dual Knowledge-Enhanced Two-Stage Reasoner for Multimodal Dialog Systems
Xiaolin Chen, Xuemeng Song, Haokun Wen, Weili Guan, Xiangyu Zhao, Liqiang Nie
Subjects: Computation and Language (cs.CL); Multimedia (cs.MM)
[106] arXiv:2509.07801 [pdf, html, other]
Title: SciNLP: A Domain-Specific Benchmark for Full-Text Scientific Entity and Relation Extraction in NLP
Decheng Duan, Yingyi Zhang, Jitong Peng, Chengzhi Zhang
Comments: EMNLP 2025 Main
Subjects: Computation and Language (cs.CL); Digital Libraries (cs.DL); Information Retrieval (cs.IR)
[107] arXiv:2509.07768 [pdf, html, other]
Title: Are LLMs Enough for Hyperpartisan, Fake, Polarized and Harmful Content Detection? Evaluating In-Context Learning vs. Fine-Tuning
Michele Joshua Maggini, Dhia Merzougui, Rabiraj Bandyopadhyay, Gaël Dias, Fabrice Maurel, Pablo Gamallo
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[108] arXiv:2509.07755 [pdf, html, other]
Title: Factuality Beyond Coherence: Evaluating LLM Watermarking Methods for Medical Texts
Rochana Prih Hastuti, Rian Adam Rajagede, Mansour Al Ghanim, Mengxin Zheng, Qian Lou
Comments: Accepted at EMNLP 2025 Findings
Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[109] arXiv:2509.07730 [pdf, html, other]
Title: M-BRe: Discovering Training Samples for Relation Extraction from Unlabeled Texts with Large Language Models
Zexuan Li, Hongliang Dai, Piji Li
Comments: Accepted by EMNLP2025 Main Conference
Subjects: Computation and Language (cs.CL)
[110] arXiv:2509.07666 [pdf, html, other]
Title: MoLoRAG: Bootstrapping Document Understanding via Multi-modal Logic-aware Retrieval
Xixi Wu, Yanchao Tan, Nan Hou, Ruiyang Zhang, Hong Cheng
Comments: EMNLP Main 2025
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[111] arXiv:2509.07622 [pdf, html, other]
Title: MaLei at MultiClinSUM: Summarisation of Clinical Documents using Perspective-Aware Iterative Self-Prompting with LLMs
Libo Ren, Yee Man Ng, Lifeng Han
Comments: system paper at CLEF 2025
Subjects: Computation and Language (cs.CL)
[112] arXiv:2509.07588 [pdf, html, other]
Title: BALI: Enhancing Biomedical Language Representations through Knowledge Graph and Language Model Alignment
Andrey Sakhovskiy, Elena Tutubalina
Comments: 9 pages, 1 figure, published in "The 48th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2025)"
Journal-ref: Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval (2025). Association for Computing Machinery, 1152-1164
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[113] arXiv:2509.07555 [pdf, html, other]
Title: Avoiding Knowledge Edit Skipping in Multi-hop Question Answering with Guided Decomposition
Yi Liu, Xiangrong Zhu, Xiangyu Liu, Wei Wei, Wei Hu
Comments: Accepted in EMNLP Findings 2025
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[114] arXiv:2509.07553 [pdf, html, other]
Title: VeriOS: Query-Driven Proactive Human-Agent-GUI Interaction for Trustworthy OS Agents
Zheng Wu, Heyuan Huang, Xingyu Lou, Xiangmou Qu, Pengzhou Cheng, Zongru Wu, Weiwen Liu, Weinan Zhang, Jun Wang, Zhaoxiang Wang, Zhuosheng Zhang
Subjects: Computation and Language (cs.CL)
[115] arXiv:2509.07512 [pdf, html, other]
Title: ALLabel: Three-stage Active Learning for LLM-based Entity Recognition using Demonstration Retrieval
Zihan Chen, Lei Shi, Weize Wu, Qiji Zhou, Yue Zhang
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[116] arXiv:2509.07475 [pdf, html, other]
Title: HALT-RAG: A Task-Adaptable Framework for Hallucination Detection with Calibrated NLI Ensembles and Abstention
Saumya Goswami, Siddharth Kurra
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[117] arXiv:2509.07471 [pdf, html, other]
Title: From Scarcity to Efficiency: Investigating the Effects of Data Augmentation on African Machine Translation
Mardiyyah Oduwole, Oluwatosin Olajide, Jamiu Suleiman, Faith Hunja, Busayo Awobade, Fatimo Adebanjo, Comfort Akanni, Chinonyelum Igwe, Peace Ododo, Promise Omoigui, Steven Kolawole, Abraham Owodunni
Comments: 8 pages, 3 tables. Exploratory work on Data Augmentation for African Machine Translation
Subjects: Computation and Language (cs.CL)
[118] arXiv:2509.07462 [pdf, other]
Title: Understanding Stigmatizing Language Lexicons: A Comparative Analysis in Clinical Contexts
Yiliang Zhou, Di Hu, Tianchu Lyu, Jasmine Dhillon, Alexandra L. Beck, Gelareh Sadigh, Kai Zheng
Subjects: Computation and Language (cs.CL)
[119] arXiv:2509.07459 [pdf, html, other]
Title: AIxcellent Vibes at GermEval 2025 Shared Task on Candy Speech Detection: Improving Model Performance by Span-Level Training
Christian Rene Thelen, Patrick Gustav Blaneck, Tobias Bornheim, Niklas Grieger, Stephan Bialonski
Comments: 6 pages, 1 figure, 2 tables
Subjects: Computation and Language (cs.CL)
[120] arXiv:2509.07403 [pdf, html, other]
Title: LongEmotion: Measuring Emotional Intelligence of Large Language Models in Long-Context Interaction
Weichu Liu, Jing Xiong, Yuxuan Hu, Zixuan Li, Minghuan Tan, Ningning Mao, Chenyang Zhao, Zhongwei Wan, Chaofan Tao, Wendong Xu, Hui Shen, Chengming Li, Lingpeng Kong, Ngai Wong
Comments: Technical Report
Subjects: Computation and Language (cs.CL)
[121] arXiv:2509.07399 [pdf, html, other]
Title: The Role of Exploration Modules in Small Language Models for Knowledge Graph Question Answering
Yi-Jie Cheng, Oscar Chew, Yun-Nung Chen
Comments: Extended from ACL 2025 SRW
Subjects: Computation and Language (cs.CL)
[122] arXiv:2509.07389 [pdf, html, other]
Title: Talking with Oompa Loompas: A novel framework for evaluating linguistic acquisition of LLM agents
Sankalp Tattwadarshi Swain, Anshika Krishnatray, Dhruv Kumar, Jagat Sesh Challa
Comments: Under review
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[123] arXiv:2509.07370 [pdf, html, other]
Title: PersonaFuse: A Personality Activation-Driven Framework for Enhancing Human-LLM Interactions
Yixuan Tang, Yi Yang, Ahmed Abbasi
Subjects: Computation and Language (cs.CL)
[124] arXiv:2509.07324 [pdf, html, other]
Title: Mitigating Attention Localization in Small Scale: Self-Attention Refinement via One-step Belief Propagation
Nakyung Lee, Yeongoon Kim, Minhae Oh, Suhwan Kim, Jin Woo Koo, Hyewon Jo, Jungwoo Lee
Comments: Accepted at EMNLP 2025
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[125] arXiv:2509.07311 [pdf, html, other]
Title: Does This Look Familiar to You? Knowledge Analysis via Model Internal Representations
Sihyun Park
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[126] arXiv:2509.07309 [pdf, html, other]
Title: Instance-level Performance Prediction for Long-form Generation Tasks
Chi-Yang Hsu, Alexander Braylan, Yiheng Su, Omar Alonso, Matthew Lease
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[127] arXiv:2509.07308 [pdf, html, other]
Title: Basis Vector Metric: A Method for Robust Open-Ended State Change Detection
David Oprea, Sam Powers
Comments: 24 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[128] arXiv:2509.07301 [pdf, html, other]
Title: Causal Attention with Lookahead Keys
Zhuoqing Song, Peng Sun, Huizhuo Yuan, Quanquan Gu
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[129] arXiv:2509.07274 [pdf, html, other]
Title: LLM Analysis of 150+ years of German Parliamentary Debates on Migration Reveals Shift from Post-War Solidarity to Anti-Solidarity in the Last Decade
Aida Kostikova, Ole Pütz, Steffen Eger, Olga Sabelfeld, Benjamin Paassen
Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY); Machine Learning (cs.LG)
[130] arXiv:2509.07190 [pdf, html, other]
Title: Rule-Based Moral Principles for Explaining Uncertainty in Natural Language Generation
Zahra Atf, Peter R Lewis
Comments: This paper was accepted for presentation at the 35th IEEE International Conference on Collaborative Advances in Software and Computing. Conference website:this https URL
Subjects: Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[131] arXiv:2509.07188 [pdf, html, other]
Title: DischargeSim: A Simulation Benchmark for Educational Doctor-Patient Communication at Discharge
Zonghai Yao, Michael Sun, Won Seok Jang, Sunjae Kwon, Soie Kwon, Hong Yu
Comments: Equal contribution for the first two authors. To appear in the proceedings of the Main Conference on Empirical Methods in Natural Language Processing (EMNLP) 2025
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[132] arXiv:2509.07177 [pdf, html, other]
Title: Towards EnergyGPT: A Large Language Model Specialized for the Energy Sector
Amal Chebbi, Babajide Kolade
Subjects: Computation and Language (cs.CL)
[133] arXiv:2509.07142 [pdf, html, other]
Title: Toward Purpose-oriented Topic Model Evaluation enabled by Large Language Models
Zhiyin Tan, Jennifer D'Souza
Comments: Accepted for publication in International Journal on Digital Libraries (IJDL)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Digital Libraries (cs.DL)
[134] arXiv:2509.07139 [pdf, html, other]
Title: The ML-SUPERB 2.0 Challenge: Towards Inclusive ASR Benchmarking for All Language Varieties
William Chen, Chutong Meng, Jiatong Shi, Martijn Bartelds, Shih-Heng Wang, Hsiu-Hsuan Wang, Rafael Mosquera, Sara Hincapie, Dan Jurafsky, Antonis Anastasopoulos, Hung-yi Lee, Karen Livescu, Shinji Watanabe
Comments: Interspeech 2025
Subjects: Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[135] arXiv:2509.07135 [pdf, html, other]
Title: MedBench-IT: A Comprehensive Benchmark for Evaluating Large Language Models on Italian Medical Entrance Examinations
Ruggero Marino Lazzaroni, Alessandro Angioi, Michelangelo Puliga, Davide Sanna, Roberto Marras
Comments: Accepted as an oral presentation at CLiC-it 2025
Subjects: Computation and Language (cs.CL)
[136] arXiv:2509.07969 (cross-list from cs.CV) [pdf, html, other]
Title: Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search
Xin Lai, Junyi Li, Wei Li, Tao Liu, Tianjian Li, Hengshuang Zhao
Comments: Code, datasets, models are available at this https URL. Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[137] arXiv:2509.07966 (cross-list from cs.CV) [pdf, html, other]
Title: Visual-TableQA: Open-Domain Benchmark for Reasoning over Table Images
Boammani Aser Lompo, Marc Haraoui
Comments: Work in Progress
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[138] arXiv:2509.07909 (cross-list from cs.LG) [pdf, html, other]
Title: Uncovering Scaling Laws for Large Language Models via Inverse Problems
Arun Verma, Zhaoxuan Wu, Zijian Zhou, Xiaoqiang Lin, Zhiliang Chen, Rachael Hwee Ling Sim, Rui Qiao, Jingtan Wang, Nhung Bui, Xinyuan Niu, Wenyang Hu, Gregory Kang Ruey Lau, Zi-Yu Khoo, Zitong Zhao, Xinyi Xu, Apivich Hemachandra, See-Kiong Ng, Bryan Kian Hsiang Low
Comments: Accepted at EMNLP Findings 2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[139] arXiv:2509.07526 (cross-list from cs.SD) [pdf, html, other]
Title: Competitive Audio-Language Models with Data-Efficient Single-Stage Training on Public Data
Gokul Karthik Kumar, Rishabh Saraf, Ludovick Lepauloux, Abdul Muneer, Billel Mokeddem, Hakim Hacid
Comments: Accepted at ASRU 2025
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[140] arXiv:2509.07506 (cross-list from cs.DC) [pdf, html, other]
Title: Astra: A Multi-Agent System for GPU Kernel Performance Optimization
Anjiang Wei, Tianran Sun, Yogesh Seenichamy, Hang Song, Anne Ouyang, Azalia Mirhoseini, Ke Wang, Alex Aiken
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Software Engineering (cs.SE)
[141] arXiv:2509.07450 (cross-list from cs.CV) [pdf, html, other]
Title: GLEAM: Learning to Match and Explain in Cross-View Geo-Localization
Xudong Lu, Zhi Zheng, Yi Wan, Yongxiang Yao, Annan Wang, Renrui Zhang, Panwang Xia, Qiong Wu, Qingyun Li, Weifeng Lin, Xiangyu Zhao, Xue Yang, Hongsheng Li
Comments: 18 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[142] arXiv:2509.07414 (cross-list from cs.AI) [pdf, other]
Title: Language Self-Play For Data-Free Training
Jakub Grudzien Kuba, Mengting Gu, Qi Ma, Yuandong Tian, Vijai Mohan
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Science and Game Theory (cs.GT)
[143] arXiv:2509.07282 (cross-list from cs.LG) [pdf, html, other]
Title: ALICE: An Interpretable Neural Architecture for Generalization in Substitution Ciphers
Jeff Shen, Lindsay Smith
Comments: Preprint. Project page at this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[144] arXiv:2509.07253 (cross-list from cs.IR) [pdf, html, other]
Title: Benchmarking Information Retrieval Models on Complex Retrieval Tasks
Julian Killingback, Hamed Zamani
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[145] arXiv:2509.07202 (cross-list from cs.HC) [pdf, other]
Title: Neurocognitive Modeling for Text Generation: Deep Learning Architecture for EEG Data
Khushiyant
Comments: 15 pages, 10 figures, 5 tables
Subjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL)
[146] arXiv:2509.07170 (cross-list from cs.AI) [pdf, html, other]
Title: That's So FETCH: Fashioning Ensemble Techniques for LLM Classification in Civil Legal Intake and Referral
Quinten Steenhuis
Comments: Submission to JURIX 2025
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computers and Society (cs.CY)
[147] arXiv:2509.07163 (cross-list from cs.IR) [pdf, html, other]
Title: Beyond Sequential Reranking: Reranker-Guided Search Improves Reasoning Intensive Retrieval
Haike Xu, Tong Chen
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL); Machine Learning (cs.LG)
[148] arXiv:2509.07149 (cross-list from cs.LG) [pdf, html, other]
Title: Measuring Uncertainty in Transformer Circuits with Effective Information Consistency
Anatoly A. Krasnovsky
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Theory (cs.IT)
[149] arXiv:2509.07122 (cross-list from cs.AI) [pdf, html, other]
Title: Neuro-Symbolic Frameworks: Conceptual Characterization and Empirical Comparative Analysis
Sania Sinha, Tanawan Premsri, Danial Kamali, Parisa Kordjamshidi
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Symbolic Computation (cs.SC)
[150] arXiv:2509.07098 (cross-list from cs.AI) [pdf, html, other]
Title: Instruction Agent: Enhancing Agent with Expert Demonstration
Yinheng Li, Hailey Hultquist, Justin Wagle, Kazuhito Koishida
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[151] arXiv:2509.07017 (cross-list from cs.AI) [pdf, html, other]
Title: From Eigenmodes to Proofs: Integrating Graph Spectral Operators with Symbolic Interpretable Reasoning
Andrew Kiruluta, Priscilla Burity
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[152] arXiv:2509.07006 (cross-list from cs.CY) [pdf, html, other]
Title: ArGen: Auto-Regulation of Generative AI via GRPO and Policy-as-Code
Kapil Madan
Comments: 53 pages, 7 figures, 8 tables. Open-source implementation available at: this https URL. Work explores the integration of policy-as-code for AI alignment, with a case study in culturally-nuanced, ethical AI using Dharmic principles
Subjects: Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[153] arXiv:2509.06994 (cross-list from cs.CV) [pdf, html, other]
Title: VLMs-in-the-Wild: Bridging the Gap Between Academic Benchmarks and Enterprise Reality
Srihari Bandraupalli, Anupam Purwar
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[154] arXiv:2509.06982 (cross-list from cs.LG) [pdf, html, other]
Title: CARE: Decoding Time Safety Alignment via Rollback and Introspection Intervention
Xiaomeng Hu, Fei Huang, Chenhan Yuan, Junyang Lin, Tsung-Yi Ho
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)

Tue, 9 Sep 2025 (showing first 17 of 91 entries )

[155] arXiv:2509.06952 [pdf, other]
Title: On the Same Wavelength? Evaluating Pragmatic Reasoning in Language Models across Broad Concepts
Linlu Qiu, Cedegao E. Zhang, Joshua B. Tenenbaum, Yoon Kim, Roger P. Levy
Comments: EMNLP 2025 (Main)
Subjects: Computation and Language (cs.CL)
[156] arXiv:2509.06949 [pdf, other]
Title: Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models
Yinjie Wang, Ling Yang, Bowen Li, Ye Tian, Ke Shen, Mengdi Wang
Comments: Code and Models: this https URL
Subjects: Computation and Language (cs.CL)
[157] arXiv:2509.06948 [pdf, html, other]
Title: Beyond Two-Stage Training: Cooperative SFT and RL for LLM Reasoning
Liang Chen, Xueting Han, Li Shen, Jing Bai, Kam-Fai Wong
Subjects: Computation and Language (cs.CL)
[158] arXiv:2509.06902 [pdf, other]
Title: Proof-Carrying Numbers (PCN): A Protocol for Trustworthy Numeric Answers from LLMs via Claim Verification
Aivin V. Solatorio
Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR); Databases (cs.DB); Machine Learning (cs.LG)
[159] arXiv:2509.06888 [pdf, html, other]
Title: mmBERT: A Modern Multilingual Encoder with Annealed Language Learning
Marc Marone, Orion Weller, William Fleshman, Eugene Yang, Dawn Lawrie, Benjamin Van Durme
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[160] arXiv:2509.06883 [pdf, other]
Title: UNH at CheckThat! 2025: Fine-tuning Vs Prompting in Claim Extraction
Joe Wilder, Nikhil Kadapala, Benji Xu, Mohammed Alsaadi, Aiden Parsons, Mitchell Rogers, Palash Agarwal, Adam Hassick, Laura Dietz
Comments: 16 pages,3 tables, CLEF 2025 Working Notes, 9-12 September 2025, Madrid, Spain
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[161] arXiv:2509.06870 [pdf, other]
Title: The Majority is not always right: RL training for solution aggregation
Wenting Zhao, Pranjal Aggarwal, Swarnadeep Saha, Asli Celikyilmaz, Jason Weston, Ilia Kulikov
Subjects: Computation and Language (cs.CL)
[162] arXiv:2509.06838 [pdf, html, other]
Title: EPT Benchmark: Evaluation of Persian Trustworthiness in Large Language Models
Mohammad Reza Mirbagheri, Mohammad Mahdi Mirkamali, Zahra Motoshaker Arani, Ali Javeri, Amir Mahdi Sadeghzadeh, Rasool Jalili
Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[163] arXiv:2509.06836 [pdf, html, other]
Title: COMPACT: Common-token Optimized Model Pruning Across Channels and Tokens
Eugene Kwek, Wenpeng Yin
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[164] arXiv:2509.06813 [pdf, html, other]
Title: A Comparative Benchmark of Large Language Models for Labelling Wind Turbine Maintenance Logs
Max Malyi, Jonathan Shek, Alasdair McDonald, Andre Biscaya
Comments: Associated GitHub repository: this https URL
Subjects: Computation and Language (cs.CL)
[165] arXiv:2509.06809 [pdf, html, other]
Title: Saturation-Driven Dataset Generation for LLM Mathematical Reasoning in the TPTP Ecosystem
Valentin Quesnel, Damien Sileo
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[166] arXiv:2509.06807 [pdf, html, other]
Title: MoGU V2: Toward a Higher Pareto Frontier Between Model Usability and Security
Yanrui Du, Fenglei Fan, Sendong Zhao, Jiawei Cao, Ting Liu, Bing Qin
Subjects: Computation and Language (cs.CL)
[167] arXiv:2509.06806 [pdf, other]
Title: MachineLearningLM: Scaling Many-shot In-context Learning via Continued Pretraining
Haoyu Dong, Pengkun Zhang, Mingzhe Lu, Yanzhen Shen, Guolin Ke
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[168] arXiv:2509.06795 [pdf, html, other]
Title: Anchoring Refusal Direction: Mitigating Safety Risks in Tuning via Projection Constraint
Yanrui Du, Fenglei Fan, Sendong Zhao, Jiawei Cao, Qika Lin, Kai He, Ting Liu, Bing Qin, Mengling Feng
Subjects: Computation and Language (cs.CL)
[169] arXiv:2509.06704 [pdf, html, other]
Title: Will Annotators Disagree? Identifying Subjectivity in Value-Laden Arguments
Amir Homayounirad, Enrico Liscio, Tong Wang, Catholijn M. Jonker, Luciano C.Siebert
Comments: Accepted at Findings of EMNLP 2025
Subjects: Computation and Language (cs.CL)
[170] arXiv:2509.06675 [pdf, html, other]
Title: ParCzech4Speech: A New Speech Corpus Derived from Czech Parliamentary Data
Vladislav Stankov, Matyáš Kopp, Ondřej Bojar
Journal-ref: In: Proceedings of the 28th International Conference on Text, Speech, and Dialogue (TSD 2025), pp.299-308
Subjects: Computation and Language (cs.CL)
[171] arXiv:2509.06652 [pdf, html, other]
Title: IntrEx: A Dataset for Modeling Engagement in Educational Conversations
Xingwei Tan, Mahathi Parvatham, Chiara Gambi, Gabriele Pergola
Comments: EMNLP 2025 Findings camera-ready, 9+7 pages
Subjects: Computation and Language (cs.CL)
Total of 352 entries : 72-171 101-200 201-300 301-352
Showing up to 100 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack