Computation and Language

Authors and titles for recent submissions

See today's new changes

Total of 352 entries : 72-171 101-200 201-300 301-352

Showing up to 100 entries per page: fewer | more | all

[72] arXiv:2509.08438 [pdf, html, other]: Title: CommonVoice-SpeechRE and RPG-MoGe: Advancing Speech Relation Extraction with a New Dataset and Multi-Order Generative Framework

Jinzhong Ning, Paerhati Tulajiang, Yingying Le, Yijia Zhang, Yuanyuan Sun, Hongfei Lin, Haifeng Liu

Subjects: Computation and Language (cs.CL); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[73] arXiv:2509.08381 [pdf, other]: Title: Low-Resource Fine-Tuning for Multi-Task Structured Information Extraction with a Billion-Parameter Instruction-Tuned Model

Yu Cheng Chih, Yong Hao Hou

Comments: 13 pages, 8 figures, includes experiments on JSON extraction, knowledge graph extraction, and NER

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[74] arXiv:2509.08358 [pdf, html, other]: Title: <think> So let's replace this phrase with insult... </think> Lessons learned from generation of toxic texts with LLMs

Sergey Pletenev, Daniil Moskovskiy, Alexander Panchenko

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[75] arXiv:2509.08355 [pdf, html, other]: Title: Automatic Detection of Inauthentic Templated Responses in English Language Assessments

Yashad Samant, Lee Becker, Scott Hellman, Bradley Behan, Sarah Hughes, Joshua Southerland

Comments: Accepted to National Council on Measurement in Education (NCME) 2025 Annual Meeting

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[76] arXiv:2509.08345 [pdf, html, other]: Title: Toward Subtrait-Level Model Explainability in Automated Writing Evaluation

Alejandro Andrade-Lotero, Lee Becker, Joshua Southerland, Scott Hellman

Comments: Accepted to National Council on Measurement in Education (NCME) 2025 Annual Meeting

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[77] arXiv:2509.08304 [pdf, other]: Title: Towards Knowledge-Aware Document Systems: Modeling Semantic Coverage Relations via Answerability Detection

Yehudit Aperstein, Alon Gottlib, Gal Benita, Alexander Apartsin

Comments: 27 pages, 1 figure

Subjects: Computation and Language (cs.CL)
[78] arXiv:2509.08217 [pdf, html, other]: Title: Balancing Quality and Variation: Spam Filtering Distorts Data Label Distributions

Eve Fleisig, Matthias Orlikowski, Philipp Cimiano, Dan Klein

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[79] arXiv:2509.08150 [pdf, html, other]: Title: Verbalized Algorithms

Supriya Lall, Christian Farrell, Hari Pathanjaly, Marko Pavic, Sarvesh Chezhian, Masataro Asai

Comments: Submitted to NeurIPS 2025 Workshop on Efficient Reasoning

Subjects: Computation and Language (cs.CL)
[80] arXiv:2509.08146 [pdf, html, other]: Title: Bias after Prompting: Persistent Discrimination in Large Language Models

Nivedha Sivakumar, Natalie Mackraz, Samira Khorshidi, Krishna Patel, Barry-John Theobald, Luca Zappella, Nicholas Apostoloff

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[81] arXiv:2509.08105 [pdf, html, other]: Title: MERLIN: Multi-Stage Curriculum Alignment for Multilingual Encoder and LLM Fusion

Kosei Uemura, David Guzmán, Quang Phuoc Nguyen, Jesujoba Oluwadara Alabi, En-shiun Annie Lee, David Ifeoluwa Adelani

Comments: under submission

Subjects: Computation and Language (cs.CL)
[82] arXiv:2509.08093 [pdf, html, other]: Title: Culturally transmitted color categories in LLMs reflect a learning bias toward efficient compression

Nathaniel Imel, Noga Zaslavsky

Subjects: Computation and Language (cs.CL)
[83] arXiv:2509.08075 [pdf, html, other]: Title: No for Some, Yes for Others: Persona Prompts and Other Sources of False Refusal in Language Models

Flor Miriam Plaza-del-Arco, Paul Röttger, Nino Scherrer, Emanuele Borgonovo, Elmar Plischke, Dirk Hovy

Subjects: Computation and Language (cs.CL)
[84] arXiv:2509.08032 [pdf, html, other]: Title: SciGPT: A Large Language Model for Scientific Literature Understanding and Knowledge Discovery

Fengyu She, Nan Wang, Hongfei Wu, Ziyi Wan, Jingmian Wang, Chang Wang

Subjects: Computation and Language (cs.CL)
[85] arXiv:2509.08025 [pdf, html, other]: Title: NOWJ@COLIEE 2025: A Multi-stage Framework Integrating Embedding Models and Large Language Models for Legal Retrieval and Entailment

Hoang-Trung Nguyen, Tan-Minh Nguyen, Xuan-Bach Le, Tuan-Kiet Le, Khanh-Huyen Nguyen, Ha-Thanh Nguyen, Thi-Hai-Yen Vuong, Le-Minh Nguyen

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[86] arXiv:2509.08022 [pdf, html, other]: Title: MVPBench: A Benchmark and Fine-Tuning Framework for Aligning Large Language Models with Diverse Human Values

Yao Liang, Dongcheng Zhao, Feifei Zhao, Guobin Shen, Yuwei Wang, Dongqi Liang, Yi Zeng

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[87] arXiv:2509.08000 [pdf, html, other]: Title: AntiDote: Bi-level Adversarial Training for Tamper-Resistant LLMs

Debdeep Sanyal, Manodeep Ray, Murari Mandal

Comments: 19 pages

Subjects: Computation and Language (cs.CL)
[88] arXiv:2509.07998 [pdf, html, other]: Title: Bilingual Word Level Language Identification for Omotic Languages

Mesay Gemeda Yigezu, Girma Yohannis Bade, Atnafu Lambebo Tonja, Olga Kolesnikova, Grigori Sidorov, Alexander Gelbukh

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[89] arXiv:2509.08814 (cross-list from cs.LG) [pdf, html, other]: Title: Merge-of-Thought Distillation

Zhanming Shen, Zeyu Qin, Zenan Huang, Hao Chen, Jiaqi Hu, Yihong Zhuang, Guoshan Lu, Gang Chen, Junbo Zhao

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[90] arXiv:2509.08803 (cross-list from cs.SI) [pdf, html, other]: Title: Scaling Truth: The Confidence Paradox in AI Fact-Checking

Ihsan A. Qazi, Zohaib Khan, Abdullah Ghani, Agha A. Raza, Zafar A. Qazi, Wassay Sajjad, Ayesha Ali, Asher Javaid, Muhammad Abdullah Sohail, Abdul H. Azeemi

Comments: 65 pages, 26 figures, 6 tables

Subjects: Social and Information Networks (cs.SI); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computers and Society (cs.CY)
[91] arXiv:2509.08777 (cross-list from cs.CV) [pdf, html, other]: Title: Calibrating MLLM-as-a-judge via Multimodal Bayesian Prompt Ensembles

Eric Slyman, Mehrab Tanjim, Kushal Kafle, Stefan Lee

Comments: 17 pages, 8 figures, Accepted at ICCV 2025

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[92] arXiv:2509.08755 (cross-list from cs.LG) [pdf, other]: Title: AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning

Zhiheng Xi, Jixuan Huang, Chenyang Liao, Baodai Huang, Honglin Guo, Jiaqi Liu, Rui Zheng, Junjie Ye, Jiazheng Zhang, Wenxiang Chen, Wei He, Yiwen Ding, Guanyu Li, Zehui Chen, Zhengyin Du, Xuesong Yao, Yufei Xu, Jiecao Chen, Tao Gui, Zuxuan Wu, Qi Zhang, Xuanjing Huang, Yu-Gang Jiang

Comments: preprint, 39 pages, 16 figures. Project: this https URL. Framework and Code: this https URL, this https URL

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[93] arXiv:2509.08653 (cross-list from cs.LG) [pdf, html, other]: Title: Generative Data Refinement: Just Ask for Better Data

Minqi Jiang, João G. M. Araújo, Will Ellsworth, Sian Gooding, Edward Grefenstette

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[94] arXiv:2509.08494 (cross-list from cs.CY) [pdf, html, other]: Title: HumanAgencyBench: Scalable Evaluation of Human Agency Support in AI Assistants

Benjamin Sturgeon, Daniel Samuelson, Jacob Haimes, Jacy Reese Anthis

Subjects: Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[95] arXiv:2509.08315 (cross-list from cs.LG) [pdf, html, other]: Title: EvolKV: Evolutionary KV Cache Compression for LLM Inference

Bohan Yu, Yekun Chai

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Neural and Evolutionary Computing (cs.NE)
[96] arXiv:2509.08182 (cross-list from cs.PL) [pdf, html, other]: Title: XML Prompting as Grammar-Constrained Interaction: Fixed-Point Semantics, Convergence Guarantees, and Human-AI Protocols

Faruk Alpay, Taylan Alpay

Comments: 7 pages, multiple XML prompts

Subjects: Programming Languages (cs.PL); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[97] arXiv:2509.08010 (cross-list from cs.CY) [pdf, html, other]: Title: Measuring and mitigating overreliance is necessary for building human-compatible AI

Lujain Ibrahim, Katherine M. Collins, Sunnie S. Y. Kim, Anka Reuel, Max Lamparth, Kevin Feng, Lama Ahmad, Prajna Soni, Alia El Kattan, Merlin Stein, Siddharth Swaroop, Ilia Sucholutsky, Andrew Strait, Q. Vera Liao, Umang Bhatt

Subjects: Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)

[98] arXiv:2509.07980 [pdf, other]: Title: Parallel-R1: Towards Parallel Thinking via Reinforcement Learning

Tong Zheng, Hongming Zhang, Wenhao Yu, Xiaoyang Wang, Xinyu Yang, Runpeng Dai, Rui Liu, Huiwen Bao, Chengsong Huang, Heng Huang, Dong Yu

Comments: Project website: this https URL

Subjects: Computation and Language (cs.CL)
[99] arXiv:2509.07968 [pdf, html, other]: Title: SimpleQA Verified: A Reliable Factuality Benchmark to Measure Parametric Knowledge

Lukas Haas, Gal Yona, Giovanni D'Antonio, Sasha Goldshtein, Dipanjan Das

Subjects: Computation and Language (cs.CL)
[100] arXiv:2509.07925 [pdf, html, other]: Title: GENUINE: Graph Enhanced Multi-level Uncertainty Estimation for Large Language Models

Tuo Wang, Adithya Kulkarni, Tyler Cody, Peter A. Beling, Yujun Yan, Dawei Zhou

Comments: Accepted by EMNLP 2025

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[101] arXiv:2509.07908 [pdf, html, other]: Title: Biased Tales: Cultural and Topic Bias in Generating Children's Stories

Donya Rooein, Vilém Zouhar, Debora Nozza, Dirk Hovy

Subjects: Computation and Language (cs.CL)
[102] arXiv:2509.07889 [pdf, html, other]: Title: From Detection to Mitigation: Addressing Gender Bias in Chinese Texts via Efficient Tuning and Voting-Based Rebalancing

Chengyan Wu, Yiqiang Cai, Yufei Cheng, Yun Xue

Comments: NLPCC 2025

Subjects: Computation and Language (cs.CL)
[103] arXiv:2509.07869 [pdf, html, other]: Title: Are Humans as Brittle as Large Language Models?

Jiahui Li, Sean Papay, Roman Klinger

Subjects: Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[104] arXiv:2509.07829 [pdf, html, other]: Title: Small Open Models Achieve Near Parity with Large Models in Low Resource Literary Translation at a Fraction of the Cost

Mihai Nadas, Laura Diosan, Andreea Tomescu, Andrei Piscoran

Comments: 25 pages, 8 figures, includes datasets and models released on Hugging Face

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[105] arXiv:2509.07817 [pdf, other]: Title: Dual Knowledge-Enhanced Two-Stage Reasoner for Multimodal Dialog Systems

Xiaolin Chen, Xuemeng Song, Haokun Wen, Weili Guan, Xiangyu Zhao, Liqiang Nie

Subjects: Computation and Language (cs.CL); Multimedia (cs.MM)
[106] arXiv:2509.07801 [pdf, html, other]: Title: SciNLP: A Domain-Specific Benchmark for Full-Text Scientific Entity and Relation Extraction in NLP

Decheng Duan, Yingyi Zhang, Jitong Peng, Chengzhi Zhang

Comments: EMNLP 2025 Main

Subjects: Computation and Language (cs.CL); Digital Libraries (cs.DL); Information Retrieval (cs.IR)
[107] arXiv:2509.07768 [pdf, html, other]: Title: Are LLMs Enough for Hyperpartisan, Fake, Polarized and Harmful Content Detection? Evaluating In-Context Learning vs. Fine-Tuning

Michele Joshua Maggini, Dhia Merzougui, Rabiraj Bandyopadhyay, Gaël Dias, Fabrice Maurel, Pablo Gamallo

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[108] arXiv:2509.07755 [pdf, html, other]: Title: Factuality Beyond Coherence: Evaluating LLM Watermarking Methods for Medical Texts

Rochana Prih Hastuti, Rian Adam Rajagede, Mansour Al Ghanim, Mengxin Zheng, Qian Lou

Comments: Accepted at EMNLP 2025 Findings

Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[109] arXiv:2509.07730 [pdf, html, other]: Title: M-BRe: Discovering Training Samples for Relation Extraction from Unlabeled Texts with Large Language Models

Zexuan Li, Hongliang Dai, Piji Li

Comments: Accepted by EMNLP2025 Main Conference

Subjects: Computation and Language (cs.CL)
[110] arXiv:2509.07666 [pdf, html, other]: Title: MoLoRAG: Bootstrapping Document Understanding via Multi-modal Logic-aware Retrieval

Xixi Wu, Yanchao Tan, Nan Hou, Ruiyang Zhang, Hong Cheng

Comments: EMNLP Main 2025

Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[111] arXiv:2509.07622 [pdf, html, other]: Title: MaLei at MultiClinSUM: Summarisation of Clinical Documents using Perspective-Aware Iterative Self-Prompting with LLMs

Libo Ren, Yee Man Ng, Lifeng Han

Comments: system paper at CLEF 2025

Subjects: Computation and Language (cs.CL)
[112] arXiv:2509.07588 [pdf, html, other]: Title: BALI: Enhancing Biomedical Language Representations through Knowledge Graph and Language Model Alignment

Andrey Sakhovskiy, Elena Tutubalina

Comments: 9 pages, 1 figure, published in "The 48th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2025)"

Journal-ref: Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval (2025). Association for Computing Machinery, 1152-1164

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[113] arXiv:2509.07555 [pdf, html, other]: Title: Avoiding Knowledge Edit Skipping in Multi-hop Question Answering with Guided Decomposition

Yi Liu, Xiangrong Zhu, Xiangyu Liu, Wei Wei, Wei Hu

Comments: Accepted in EMNLP Findings 2025

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[114] arXiv:2509.07553 [pdf, html, other]: Title: VeriOS: Query-Driven Proactive Human-Agent-GUI Interaction for Trustworthy OS Agents

Zheng Wu, Heyuan Huang, Xingyu Lou, Xiangmou Qu, Pengzhou Cheng, Zongru Wu, Weiwen Liu, Weinan Zhang, Jun Wang, Zhaoxiang Wang, Zhuosheng Zhang

Subjects: Computation and Language (cs.CL)
[115] arXiv:2509.07512 [pdf, html, other]: Title: ALLabel: Three-stage Active Learning for LLM-based Entity Recognition using Demonstration Retrieval

Zihan Chen, Lei Shi, Weize Wu, Qiji Zhou, Yue Zhang

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[116] arXiv:2509.07475 [pdf, html, other]: Title: HALT-RAG: A Task-Adaptable Framework for Hallucination Detection with Calibrated NLI Ensembles and Abstention

Saumya Goswami, Siddharth Kurra

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[117] arXiv:2509.07471 [pdf, html, other]: Title: From Scarcity to Efficiency: Investigating the Effects of Data Augmentation on African Machine Translation

Mardiyyah Oduwole, Oluwatosin Olajide, Jamiu Suleiman, Faith Hunja, Busayo Awobade, Fatimo Adebanjo, Comfort Akanni, Chinonyelum Igwe, Peace Ododo, Promise Omoigui, Steven Kolawole, Abraham Owodunni

Comments: 8 pages, 3 tables. Exploratory work on Data Augmentation for African Machine Translation

Subjects: Computation and Language (cs.CL)
[118] arXiv:2509.07462 [pdf, other]: Title: Understanding Stigmatizing Language Lexicons: A Comparative Analysis in Clinical Contexts

Yiliang Zhou, Di Hu, Tianchu Lyu, Jasmine Dhillon, Alexandra L. Beck, Gelareh Sadigh, Kai Zheng

Subjects: Computation and Language (cs.CL)
[119] arXiv:2509.07459 [pdf, html, other]: Title: AIxcellent Vibes at GermEval 2025 Shared Task on Candy Speech Detection: Improving Model Performance by Span-Level Training

Christian Rene Thelen, Patrick Gustav Blaneck, Tobias Bornheim, Niklas Grieger, Stephan Bialonski

Comments: 6 pages, 1 figure, 2 tables

Subjects: Computation and Language (cs.CL)
[120] arXiv:2509.07403 [pdf, html, other]: Title: LongEmotion: Measuring Emotional Intelligence of Large Language Models in Long-Context Interaction

Weichu Liu, Jing Xiong, Yuxuan Hu, Zixuan Li, Minghuan Tan, Ningning Mao, Chenyang Zhao, Zhongwei Wan, Chaofan Tao, Wendong Xu, Hui Shen, Chengming Li, Lingpeng Kong, Ngai Wong

Comments: Technical Report

Subjects: Computation and Language (cs.CL)
[121] arXiv:2509.07399 [pdf, html, other]: Title: The Role of Exploration Modules in Small Language Models for Knowledge Graph Question Answering

Yi-Jie Cheng, Oscar Chew, Yun-Nung Chen

Comments: Extended from ACL 2025 SRW

Subjects: Computation and Language (cs.CL)
[122] arXiv:2509.07389 [pdf, html, other]: Title: Talking with Oompa Loompas: A novel framework for evaluating linguistic acquisition of LLM agents

Sankalp Tattwadarshi Swain, Anshika Krishnatray, Dhruv Kumar, Jagat Sesh Challa

Comments: Under review

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[123] arXiv:2509.07370 [pdf, html, other]: Title: PersonaFuse: A Personality Activation-Driven Framework for Enhancing Human-LLM Interactions

Yixuan Tang, Yi Yang, Ahmed Abbasi

Subjects: Computation and Language (cs.CL)
[124] arXiv:2509.07324 [pdf, html, other]: Title: Mitigating Attention Localization in Small Scale: Self-Attention Refinement via One-step Belief Propagation

Nakyung Lee, Yeongoon Kim, Minhae Oh, Suhwan Kim, Jin Woo Koo, Hyewon Jo, Jungwoo Lee

Comments: Accepted at EMNLP 2025

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[125] arXiv:2509.07311 [pdf, html, other]: Title: Does This Look Familiar to You? Knowledge Analysis via Model Internal Representations

Sihyun Park

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[126] arXiv:2509.07309 [pdf, html, other]: Title: Instance-level Performance Prediction for Long-form Generation Tasks

Chi-Yang Hsu, Alexander Braylan, Yiheng Su, Omar Alonso, Matthew Lease

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[127] arXiv:2509.07308 [pdf, html, other]: Title: Basis Vector Metric: A Method for Robust Open-Ended State Change Detection

David Oprea, Sam Powers

Comments: 24 pages

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[128] arXiv:2509.07301 [pdf, html, other]: Title: Causal Attention with Lookahead Keys

Zhuoqing Song, Peng Sun, Huizhuo Yuan, Quanquan Gu

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[129] arXiv:2509.07274 [pdf, html, other]: Title: LLM Analysis of 150+ years of German Parliamentary Debates on Migration Reveals Shift from Post-War Solidarity to Anti-Solidarity in the Last Decade

Aida Kostikova, Ole Pütz, Steffen Eger, Olga Sabelfeld, Benjamin Paassen

Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY); Machine Learning (cs.LG)
[130] arXiv:2509.07190 [pdf, html, other]: Title: Rule-Based Moral Principles for Explaining Uncertainty in Natural Language Generation

Zahra Atf, Peter R Lewis

Comments: This paper was accepted for presentation at the 35th IEEE International Conference on Collaborative Advances in Software and Computing. Conference website:this https URL

Subjects: Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[131] arXiv:2509.07188 [pdf, html, other]: Title: DischargeSim: A Simulation Benchmark for Educational Doctor-Patient Communication at Discharge

Zonghai Yao, Michael Sun, Won Seok Jang, Sunjae Kwon, Soie Kwon, Hong Yu

Comments: Equal contribution for the first two authors. To appear in the proceedings of the Main Conference on Empirical Methods in Natural Language Processing (EMNLP) 2025

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[132] arXiv:2509.07177 [pdf, html, other]: Title: Towards EnergyGPT: A Large Language Model Specialized for the Energy Sector

Amal Chebbi, Babajide Kolade

Subjects: Computation and Language (cs.CL)
[133] arXiv:2509.07142 [pdf, html, other]: Title: Toward Purpose-oriented Topic Model Evaluation enabled by Large Language Models

Zhiyin Tan, Jennifer D'Souza

Comments: Accepted for publication in International Journal on Digital Libraries (IJDL)

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Digital Libraries (cs.DL)
[134] arXiv:2509.07139 [pdf, html, other]: Title: The ML-SUPERB 2.0 Challenge: Towards Inclusive ASR Benchmarking for All Language Varieties

William Chen, Chutong Meng, Jiatong Shi, Martijn Bartelds, Shih-Heng Wang, Hsiu-Hsuan Wang, Rafael Mosquera, Sara Hincapie, Dan Jurafsky, Antonis Anastasopoulos, Hung-yi Lee, Karen Livescu, Shinji Watanabe

Comments: Interspeech 2025

Subjects: Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[135] arXiv:2509.07135 [pdf, html, other]: Title: MedBench-IT: A Comprehensive Benchmark for Evaluating Large Language Models on Italian Medical Entrance Examinations

Ruggero Marino Lazzaroni, Alessandro Angioi, Michelangelo Puliga, Davide Sanna, Roberto Marras

Comments: Accepted as an oral presentation at CLiC-it 2025

Subjects: Computation and Language (cs.CL)
[136] arXiv:2509.07969 (cross-list from cs.CV) [pdf, html, other]: Title: Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search

Xin Lai, Junyi Li, Wei Li, Tao Liu, Tianjian Li, Hengshuang Zhao

Comments: Code, datasets, models are available at this https URL. Project Page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[137] arXiv:2509.07966 (cross-list from cs.CV) [pdf, html, other]: Title: Visual-TableQA: Open-Domain Benchmark for Reasoning over Table Images

Boammani Aser Lompo, Marc Haraoui

Comments: Work in Progress

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[138] arXiv:2509.07909 (cross-list from cs.LG) [pdf, html, other]: Title: Uncovering Scaling Laws for Large Language Models via Inverse Problems

Arun Verma, Zhaoxuan Wu, Zijian Zhou, Xiaoqiang Lin, Zhiliang Chen, Rachael Hwee Ling Sim, Rui Qiao, Jingtan Wang, Nhung Bui, Xinyuan Niu, Wenyang Hu, Gregory Kang Ruey Lau, Zi-Yu Khoo, Zitong Zhao, Xinyi Xu, Apivich Hemachandra, See-Kiong Ng, Bryan Kian Hsiang Low

Comments: Accepted at EMNLP Findings 2025

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[139] arXiv:2509.07526 (cross-list from cs.SD) [pdf, html, other]: Title: Competitive Audio-Language Models with Data-Efficient Single-Stage Training on Public Data

Gokul Karthik Kumar, Rishabh Saraf, Ludovick Lepauloux, Abdul Muneer, Billel Mokeddem, Hakim Hacid

Comments: Accepted at ASRU 2025

Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[140] arXiv:2509.07506 (cross-list from cs.DC) [pdf, html, other]: Title: Astra: A Multi-Agent System for GPU Kernel Performance Optimization

Anjiang Wei, Tianran Sun, Yogesh Seenichamy, Hang Song, Anne Ouyang, Azalia Mirhoseini, Ke Wang, Alex Aiken

Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Software Engineering (cs.SE)
[141] arXiv:2509.07450 (cross-list from cs.CV) [pdf, html, other]: Title: GLEAM: Learning to Match and Explain in Cross-View Geo-Localization

Xudong Lu, Zhi Zheng, Yi Wan, Yongxiang Yao, Annan Wang, Renrui Zhang, Panwang Xia, Qiong Wu, Qingyun Li, Weifeng Lin, Xiangyu Zhao, Xue Yang, Hongsheng Li

Comments: 18 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[142] arXiv:2509.07414 (cross-list from cs.AI) [pdf, other]: Title: Language Self-Play For Data-Free Training

Jakub Grudzien Kuba, Mengting Gu, Qi Ma, Yuandong Tian, Vijai Mohan

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Science and Game Theory (cs.GT)
[143] arXiv:2509.07282 (cross-list from cs.LG) [pdf, html, other]: Title: ALICE: An Interpretable Neural Architecture for Generalization in Substitution Ciphers

Jeff Shen, Lindsay Smith

Comments: Preprint. Project page at this https URL

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[144] arXiv:2509.07253 (cross-list from cs.IR) [pdf, html, other]: Title: Benchmarking Information Retrieval Models on Complex Retrieval Tasks

Julian Killingback, Hamed Zamani

Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[145] arXiv:2509.07202 (cross-list from cs.HC) [pdf, other]: Title: Neurocognitive Modeling for Text Generation: Deep Learning Architecture for EEG Data

Khushiyant

Comments: 15 pages, 10 figures, 5 tables

Subjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL)
[146] arXiv:2509.07170 (cross-list from cs.AI) [pdf, html, other]: Title: That's So FETCH: Fashioning Ensemble Techniques for LLM Classification in Civil Legal Intake and Referral

Quinten Steenhuis

Comments: Submission to JURIX 2025

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computers and Society (cs.CY)
[147] arXiv:2509.07163 (cross-list from cs.IR) [pdf, html, other]: Title: Beyond Sequential Reranking: Reranker-Guided Search Improves Reasoning Intensive Retrieval

Haike Xu, Tong Chen

Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL); Machine Learning (cs.LG)
[148] arXiv:2509.07149 (cross-list from cs.LG) [pdf, html, other]: Title: Measuring Uncertainty in Transformer Circuits with Effective Information Consistency

Anatoly A. Krasnovsky

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Theory (cs.IT)
[149] arXiv:2509.07122 (cross-list from cs.AI) [pdf, html, other]: Title: Neuro-Symbolic Frameworks: Conceptual Characterization and Empirical Comparative Analysis

Sania Sinha, Tanawan Premsri, Danial Kamali, Parisa Kordjamshidi

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Symbolic Computation (cs.SC)
[150] arXiv:2509.07098 (cross-list from cs.AI) [pdf, html, other]: Title: Instruction Agent: Enhancing Agent with Expert Demonstration

Yinheng Li, Hailey Hultquist, Justin Wagle, Kazuhito Koishida

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[151] arXiv:2509.07017 (cross-list from cs.AI) [pdf, html, other]: Title: From Eigenmodes to Proofs: Integrating Graph Spectral Operators with Symbolic Interpretable Reasoning

Andrew Kiruluta, Priscilla Burity

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[152] arXiv:2509.07006 (cross-list from cs.CY) [pdf, html, other]: Title: ArGen: Auto-Regulation of Generative AI via GRPO and Policy-as-Code

Kapil Madan

Comments: 53 pages, 7 figures, 8 tables. Open-source implementation available at: this https URL. Work explores the integration of policy-as-code for AI alignment, with a case study in culturally-nuanced, ethical AI using Dharmic principles

Subjects: Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[153] arXiv:2509.06994 (cross-list from cs.CV) [pdf, html, other]: Title: VLMs-in-the-Wild: Bridging the Gap Between Academic Benchmarks and Enterprise Reality

Srihari Bandraupalli, Anupam Purwar

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[154] arXiv:2509.06982 (cross-list from cs.LG) [pdf, html, other]: Title: CARE: Decoding Time Safety Alignment via Rollback and Introspection Intervention

Xiaomeng Hu, Fei Huang, Chenhan Yuan, Junyang Lin, Tsung-Yi Ho

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)

[155] arXiv:2509.06952 [pdf, other]: Title: On the Same Wavelength? Evaluating Pragmatic Reasoning in Language Models across Broad Concepts

Linlu Qiu, Cedegao E. Zhang, Joshua B. Tenenbaum, Yoon Kim, Roger P. Levy

Comments: EMNLP 2025 (Main)

Subjects: Computation and Language (cs.CL)
[156] arXiv:2509.06949 [pdf, other]: Title: Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models

Yinjie Wang, Ling Yang, Bowen Li, Ye Tian, Ke Shen, Mengdi Wang

Comments: Code and Models: this https URL

Subjects: Computation and Language (cs.CL)
[157] arXiv:2509.06948 [pdf, html, other]: Title: Beyond Two-Stage Training: Cooperative SFT and RL for LLM Reasoning

Liang Chen, Xueting Han, Li Shen, Jing Bai, Kam-Fai Wong

Subjects: Computation and Language (cs.CL)
[158] arXiv:2509.06902 [pdf, other]: Title: Proof-Carrying Numbers (PCN): A Protocol for Trustworthy Numeric Answers from LLMs via Claim Verification

Aivin V. Solatorio

Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR); Databases (cs.DB); Machine Learning (cs.LG)
[159] arXiv:2509.06888 [pdf, html, other]: Title: mmBERT: A Modern Multilingual Encoder with Annealed Language Learning

Marc Marone, Orion Weller, William Fleshman, Eugene Yang, Dawn Lawrie, Benjamin Van Durme

Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[160] arXiv:2509.06883 [pdf, other]: Title: UNH at CheckThat! 2025: Fine-tuning Vs Prompting in Claim Extraction

Joe Wilder, Nikhil Kadapala, Benji Xu, Mohammed Alsaadi, Aiden Parsons, Mitchell Rogers, Palash Agarwal, Adam Hassick, Laura Dietz

Comments: 16 pages,3 tables, CLEF 2025 Working Notes, 9-12 September 2025, Madrid, Spain

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[161] arXiv:2509.06870 [pdf, other]: Title: The Majority is not always right: RL training for solution aggregation

Wenting Zhao, Pranjal Aggarwal, Swarnadeep Saha, Asli Celikyilmaz, Jason Weston, Ilia Kulikov

Subjects: Computation and Language (cs.CL)
[162] arXiv:2509.06838 [pdf, html, other]: Title: EPT Benchmark: Evaluation of Persian Trustworthiness in Large Language Models

Mohammad Reza Mirbagheri, Mohammad Mahdi Mirkamali, Zahra Motoshaker Arani, Ali Javeri, Amir Mahdi Sadeghzadeh, Rasool Jalili

Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[163] arXiv:2509.06836 [pdf, html, other]: Title: COMPACT: Common-token Optimized Model Pruning Across Channels and Tokens

Eugene Kwek, Wenpeng Yin

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[164] arXiv:2509.06813 [pdf, html, other]: Title: A Comparative Benchmark of Large Language Models for Labelling Wind Turbine Maintenance Logs

Max Malyi, Jonathan Shek, Alasdair McDonald, Andre Biscaya

Comments: Associated GitHub repository: this https URL

Subjects: Computation and Language (cs.CL)
[165] arXiv:2509.06809 [pdf, html, other]: Title: Saturation-Driven Dataset Generation for LLM Mathematical Reasoning in the TPTP Ecosystem

Valentin Quesnel, Damien Sileo

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[166] arXiv:2509.06807 [pdf, html, other]: Title: MoGU V2: Toward a Higher Pareto Frontier Between Model Usability and Security

Yanrui Du, Fenglei Fan, Sendong Zhao, Jiawei Cao, Ting Liu, Bing Qin

Subjects: Computation and Language (cs.CL)
[167] arXiv:2509.06806 [pdf, other]: Title: MachineLearningLM: Scaling Many-shot In-context Learning via Continued Pretraining

Haoyu Dong, Pengkun Zhang, Mingzhe Lu, Yanzhen Shen, Guolin Ke

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[168] arXiv:2509.06795 [pdf, html, other]: Title: Anchoring Refusal Direction: Mitigating Safety Risks in Tuning via Projection Constraint

Yanrui Du, Fenglei Fan, Sendong Zhao, Jiawei Cao, Qika Lin, Kai He, Ting Liu, Bing Qin, Mengling Feng

Subjects: Computation and Language (cs.CL)
[169] arXiv:2509.06704 [pdf, html, other]: Title: Will Annotators Disagree? Identifying Subjectivity in Value-Laden Arguments

Amir Homayounirad, Enrico Liscio, Tong Wang, Catholijn M. Jonker, Luciano C.Siebert

Comments: Accepted at Findings of EMNLP 2025

Subjects: Computation and Language (cs.CL)
[170] arXiv:2509.06675 [pdf, html, other]: Title: ParCzech4Speech: A New Speech Corpus Derived from Czech Parliamentary Data

Vladislav Stankov, Matyáš Kopp, Ondřej Bojar

Journal-ref: In: Proceedings of the 28th International Conference on Text, Speech, and Dialogue (TSD 2025), pp.299-308

Subjects: Computation and Language (cs.CL)
[171] arXiv:2509.06652 [pdf, html, other]: Title: IntrEx: A Dataset for Modeling Engagement in Educational Conversations

Xingwei Tan, Mahathi Parvatham, Chiara Gambi, Gabriele Pergola

Comments: EMNLP 2025 Findings camera-ready, 9+7 pages

Subjects: Computation and Language (cs.CL)

Total of 352 entries : 72-171 101-200 201-300 301-352

Showing up to 100 entries per page: fewer | more | all

Computation and Language

Authors and titles for recent submissions

Thu, 11 Sep 2025 (continued, showing last 26 of 42 entries )

Wed, 10 Sep 2025 (showing 57 of 57 entries )

Tue, 9 Sep 2025 (showing first 17 of 91 entries )