Computation and Language

Authors and titles for August 2025

Total of 1753 entries : 1-50 51-100 101-150 151-200 201-250 ... 1751-1753

Showing up to 50 entries per page: fewer | more | all

[51] arXiv:2508.01096 [pdf, html, other]: Title: Cross-Domain Web Information Extraction at Pinterest

Michael Farag, Patrick Halina, Andrey Zaytsev, Alekhya Munagala, Imtihan Ahmed, Junhao Wang

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[52] arXiv:2508.01159 [pdf, other]: Title: Asking the Right Questions: Benchmarking Large Language Models in the Development of Clinical Consultation Templates

Liam G. McCoy, Fateme Nateghi Haredasht, Kanav Chopra, David Wu, David JH Wu, Abass Conteh, Sarita Khemani, Saloni Kumar Maharaj, Vishnu Ravi, Arth Pahwa, Yingjie Weng, Leah Rosengaus, Lena Giang, Kelvin Zhenghao Li, Olivia Jee, Daniel Shirvani, Ethan Goh, Jonathan H. Chen

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[53] arXiv:2508.01161 [pdf, html, other]: Title: CSIRO-LT at SemEval-2025 Task 11: Adapting LLMs for Emotion Recognition for Multiple Languages

Jiyu Chen, Necva Bölücü, Sarvnaz Karimi, Diego Mollá, Cécile L. Paris

Comments: In Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval-2025), Vienna, Austria. Association for Computational Linguistics

Subjects: Computation and Language (cs.CL)
[54] arXiv:2508.01198 [pdf, html, other]: Title: Adaptive Content Restriction for Large Language Models via Suffix Optimization

Yige Li, Peihai Jiang, Jun Sun, Peng Shu, Tianming Liu, Zhen Xiang

Comments: 19 pages

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[55] arXiv:2508.01213 [pdf, html, other]: Title: Show or Tell? Modeling the evolution of request-making in Human-LLM conversations

Shengqi Zhu, Jeffrey M. Rzeszotarski, David Mimno

Subjects: Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[56] arXiv:2508.01222 [pdf, html, other]: Title: WebDS: An End-to-End Benchmark for Web-based Data Science

Ethan Hsu, Hong Meng Yam, Ines Bouissou, Aaron Murali John, Raj Thota, Josh Koe, Vivek Sarath Putta, G K Dharesan, Alexander Spangher, Shikhar Murty, Tenghao Huang, Christopher D. Manning

Comments: 14 pages

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[57] arXiv:2508.01245 [pdf, html, other]: Title: WarriorMath: Enhancing the Mathematical Ability of Large Language Models with a Defect-aware Framework

Yue Chen, Minghua He, Fangkai Yang, Pu Zhao, Lu Wang, Yu Kang, Yifei Dong, Yuefeng Zhan, Hao Sun, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang

Subjects: Computation and Language (cs.CL)
[58] arXiv:2508.01263 [pdf, html, other]: Title: Bridging LLMs and Symbolic Reasoning in Educational QA Systems: Insights from the XAI Challenge at IJCNN 2025

Long S. T. Nguyen, Khang H. N. Vo, Thu H. A. Nguyen, Tuan C. Bui, Duc Q. Nguyen, Thanh-Tung Tran, Anh D. Nguyen, Minh L. Nguyen, Fabien Baldacci, Thang H. Bui, Emanuel Di Nardo, Angelo Ciaramella, Son H. Le, Ihsan Ullah, Lorenzo Di Rocco, Tho T. Quan

Comments: The XAI Challenge @ TRNS-AI Workshop, IJCNN 2025: Explainable AI for Educational Question Answering. Website: this https URL

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Logic in Computer Science (cs.LO)
[59] arXiv:2508.01290 [pdf, html, other]: Title: Prompting Large Language Models with Partial Knowledge for Answering Questions with Unseen Entities

Zhichao Yan, Jiapu Wang, Jiaoyan Chen, Yanyan Wang, Hongye Tan, Jiye Liang, Xiaoli Li, Ru Li, Jeff Z.Pan

Subjects: Computation and Language (cs.CL)
[60] arXiv:2508.01302 [pdf, html, other]: Title: Aligning Language Models with Real-time Knowledge Editing

Chenming Tang, Yutong Yang, Kexue Wang, Yunfang Wu

Comments: Pre-print

Subjects: Computation and Language (cs.CL)
[61] arXiv:2508.01309 [pdf, html, other]: Title: D-SCoRE: Document-Centric Segmentation and CoT Reasoning with Structured Export for QA-CoT Data Generation

Weibo Zhou, Lingbo Li, Shangsong Liang

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[62] arXiv:2508.01317 [pdf, other]: Title: LinkQA: Synthesizing Diverse QA from Multiple Seeds Strongly Linked by Knowledge Points

Xuemiao Zhang, Can Ren, Chengying Tu, Rongxiang Weng, Hongfei Yan, Jingang Wang, Xunliang Cai

Subjects: Computation and Language (cs.CL)
[63] arXiv:2508.01326 [pdf, html, other]: Title: Large-Scale Diverse Synthesis for Mid-Training

Xuemiao Zhang, Chengying Tu, Can Ren, Rongxiang Weng, Hongfei Yan, Jingang Wang, Xunliang Cai

Subjects: Computation and Language (cs.CL)
[64] arXiv:2508.01370 [pdf, html, other]: Title: MaRGen: Multi-Agent LLM Approach for Self-Directed Market Research and Analysis

Roman Koshkin, Pengyu Dai, Nozomi Fujikawa, Masahito Togami, Marco Visentini-Scarzanella

Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[65] arXiv:2508.01401 [pdf, html, other]: Title: MedSynth: Realistic, Synthetic Medical Dialogue-Note Pairs

Ahmad Rezaie Mianroodi, Amirali Rezaie, Niko Grisel Todorov, Cyril Rakovski, Frank Rudzicz

Comments: 7 pages excluding references and appendices

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[66] arXiv:2508.01411 [pdf, other]: Title: ArzEn-MultiGenre: An aligned parallel dataset of Egyptian Arabic song lyrics, novels, and subtitles, with English translations

Rania Al-Sabbagh

Journal-ref: Data in Brief, 54

Subjects: Computation and Language (cs.CL)
[67] arXiv:2508.01412 [pdf, html, other]: Title: Discovering Bias Associations through Open-Ended LLM Generations

Jinhao Pan, Chahat Raj, Ziwei Zhu

Subjects: Computation and Language (cs.CL)
[68] arXiv:2508.01424 [pdf, html, other]: Title: From Query to Logic: Ontology-Driven Multi-Hop Reasoning in LLMs

Haonan Bian, Yutao Qi, Rui Yang, Yuanxi Che, Jiaqian Wang, Heming Xia, Ranran Zhen

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[69] arXiv:2508.01450 [pdf, other]: Title: Towards Efficient Medical Reasoning with Minimal Fine-Tuning Data

Xinlin Zhuang, Feilong Tang, Haolin Yang, Ming Hu, Huifa Li, Haochen Xue, Yichen Li, Junjun He, Zongyuan Ge, Ying Qian, Imran Razzak

Comments: preprint, under review

Subjects: Computation and Language (cs.CL)
[70] arXiv:2508.01473 [pdf, html, other]: Title: TreeDiff: AST-Guided Code Generation with Diffusion LLMs

Yiming Zeng, Jinghan Cao, Zexin Li, Yiming Chen, Tao Ren, Dawei Xiang, Xidong Wu, Shangqian Gao, Tingting Yu

Subjects: Computation and Language (cs.CL)
[71] arXiv:2508.01480 [pdf, html, other]: Title: Harnessing Collective Intelligence of LLMs for Robust Biomedical QA: A Multi-Model Approach

Dimitra Panou, Alexandros C. Dimopoulos, Manolis Koubarakis, Martin Reczko

Subjects: Computation and Language (cs.CL)
[72] arXiv:2508.01486 [pdf, html, other]: Title: TeSent: A Benchmark Dataset for Fairness-aware Explainable Sentiment Classification in Telugu

Vallabhaneni Raj Kumar, Ashwin S, Supriya Manna, Niladri Sett, Cheedella V S N M S Hema Harshitha, Kurakula Harshitha, Anand Kumar Sharma, Basina Deepakraj, Tanuj Sarkar, Bondada Navaneeth Krishna, Samanthapudi Shakeer

Comments: work under review

Subjects: Computation and Language (cs.CL)
[73] arXiv:2508.01491 [pdf, html, other]: Title: The Homogenizing Effect of Large Language Models on Human Expression and Thought

Zhivar Sourati, Alireza S. Ziabari, Morteza Dehghani

Subjects: Computation and Language (cs.CL)
[74] arXiv:2508.01503 [pdf, html, other]: Title: A Theory of Adaptive Scaffolding for LLM-Based Pedagogical Agents

Clayton Cohn, Surya Rayala, Namrata Srivastava, Joyce Horn Fonteles, Shruti Jain, Xinying Luo, Divya Mereddy, Naveeduddin Mohammed, Gautam Biswas

Subjects: Computation and Language (cs.CL)
[75] arXiv:2508.01541 [pdf, html, other]: Title: MOPrompt: Multi-objective Semantic Evolution for Prompt Optimization

Sara Câmara, Eduardo Luz, Valéria Carvalho, Ivan Meneghini, Gladston Moreira

Comments: 8 pages

Subjects: Computation and Language (cs.CL)
[76] arXiv:2508.01554 [pdf, other]: Title: Are All Prompt Components Value-Neutral? Understanding the Heterogeneous Adversarial Robustness of Dissected Prompt in Large Language Models

Yujia Zheng, Tianhao Li, Haotian Huang, Tianyu Zeng, Jingyu Lu, Chuangxin Chu, Yuekai Huang, Ziyou Jiang, Qian Xiong, Yuyao Ge, Mingyang Li

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
[77] arXiv:2508.01630 [pdf, html, other]: Title: OpenMed NER: Open-Source, Domain-Adapted State-of-the-Art Transformers for Biomedical NER Across 12 Public Datasets

Maziyar Panahi

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[78] arXiv:2508.01656 [pdf, other]: Title: Authorship Attribution in Multilingual Machine-Generated Texts

Lucio La Cava, Dominik Macko, Róbert Móro, Ivan Srba, Andrea Tagarelli

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Human-Computer Interaction (cs.HC); Physics and Society (physics.soc-ph)
[79] arXiv:2508.01674 [pdf, html, other]: Title: CUPID: Evaluating Personalized and Contextualized Alignment of LLMs from Interactions

Tae Soo Kim, Yoonjoo Lee, Yoonah Park, Jiho Kim, Young-Ho Kim, Juho Kim

Comments: Accepted to COLM 2025. Project Website: this https URL

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[80] arXiv:2508.01682 [pdf, html, other]: Title: The Bidirectional Process Reward Model

Lingyin Zhang, Jun Gao, Xiaoxue Ren, Ziqiang Cao

Subjects: Computation and Language (cs.CL)
[81] arXiv:2508.01696 [pdf, html, other]: Title: CoCoA: Collaborative Chain-of-Agents for Parametric-Retrieved Knowledge Synergy

Yi Jiang, Sendong Zhao, Jianbo Li, Haochun Wang, Lizhe Zhang, Yan Liu, Bing Qin

Comments: code available at this https URL

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[82] arXiv:2508.01708 [pdf, html, other]: Title: Am I Blue or Is My Hobby Counting Teardrops? Expression Leakage in Large Language Models as a Symptom of Irrelevancy Disruption

Berkay Köprü, Mehrzad Mashal, Yigit Gurses, Akos Kadar, Maximilian Schmitt, Ditty Mathew, Felix Burkhardt, Florian Eyben, Björn W. Schuller

Subjects: Computation and Language (cs.CL)
[83] arXiv:2508.01710 [pdf, html, other]: Title: CultureGuard: Towards Culturally-Aware Dataset and Guard Model for Multilingual Safety Applications

Raviraj Joshi, Rakesh Paul, Kanishk Singla, Anusha Kamath, Michael Evans, Katherine Luna, Shaona Ghosh, Utkarsh Vaidya, Eileen Long, Sanjay Singh Chauhan, Niranjan Wartikar

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[84] arXiv:2508.01739 [pdf, html, other]: Title: Enhancing the Preference Extractor in Multi-turn Dialogues: From Annotating Disasters to Accurate Preference Extraction

Cheng Wang, ziru Liu, Pengcheng Tang, Mingyu Zhang, Quanyu Dai, Yue Zhu

Subjects: Computation and Language (cs.CL)
[85] arXiv:2508.01754 [pdf, html, other]: Title: AI-Generated Text is Non-Stationary: Detection via Temporal Tomography

Alva West, Yixuan Weng, Minjun Zhu, Luodan Zhang, Zhen Lin, Guangsheng Bao, Yue Zhang

Subjects: Computation and Language (cs.CL)
[86] arXiv:2508.01781 [pdf, html, other]: Title: A comprehensive taxonomy of hallucinations in Large Language Models

Manuel Cossio

Comments: 55 pages, 16 figures, 3 tables

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[87] arXiv:2508.01812 [pdf, html, other]: Title: HeQ: a Large and Diverse Hebrew Reading Comprehension Benchmark

Amir DN Cohen, Hilla Merhav, Yoav Goldberg, Reut Tsarfaty

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[88] arXiv:2508.01815 [pdf, html, other]: Title: AGENTICT$^2$S:Robust Text-to-SPARQL via Agentic Collaborative Reasoning over Heterogeneous Knowledge Graphs for the Circular Economy

Yang Zhao, Chengxiao Dai, Wei Zhuo, Tan Chuan Fu, Yue Xiu, Dusit Niyato, Jonathan Z. Low, Eugene Ho Hong Zhuang, Daren Zong Loong Tan

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[89] arXiv:2508.01832 [pdf, html, other]: Title: MLP Memory: A Retriever-Pretrained Memory for Large Language Models

Rubin Wei, Jiaqi Cao, Jiarui Wang, Jushi Kai, Qipeng Guo, Bowen Zhou, Zhouhan Lin

Subjects: Computation and Language (cs.CL)
[90] arXiv:2508.01858 [pdf, html, other]: Title: Web-CogReasoner: Towards Knowledge-Induced Cognitive Reasoning for Web Agents

Yuhan Guo, Cong Guo, Aiwen Sun, Hongliang He, Xinyu Yang, Yue Lu, Yingji Zhang, Xuntao Guo, Dong Zhang, Jianzhuang Liu, Jiang Duan, Yijia Xiao, Liangjian Wen, Hai-Ming Xu, Yong Dai

Comments: Our code and data is open sourced at this https URL

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[91] arXiv:2508.01862 [pdf, html, other]: Title: Counterfactual Probing for Hallucination Detection and Mitigation in Large Language Models

Yijun Feng

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[92] arXiv:2508.01918 [pdf, other]: Title: Quantum-RAG and PunGPT2: Advancing Low-Resource Language Generation and Retrieval for the Punjabi Language

Jaskaranjeet Singh, Rakesh Thakur

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[93] arXiv:2508.01930 [pdf, html, other]: Title: Word Overuse and Alignment in Large Language Models: The Influence of Learning from Human Feedback

Tom S. Juzek, Zina B. Ward

Comments: Accepted for publication in the Proceedings of the 5th Workshop on Bias and Fairness in AI (BIAS 2025) at ECML PKDD

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[94] arXiv:2508.01943 [pdf, html, other]: Title: ROVER: Recursive Reasoning Over Videos with Vision-Language Models for Embodied Tasks

Philip Schroeder, Ondrej Biza, Thomas Weng, Hongyin Luo, James Glass

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[95] arXiv:2508.01959 [pdf, html, other]: Title: SitEmb-v1.5: Improved Context-Aware Dense Retrieval for Semantic Association and Long Story Comprehension

Junjie Wu, Jiangnan Li, Yuqing Li, Lemao Liu, Liyan Xu, Jiwei Li, Dit-Yan Yeung, Jie Zhou, Mo Yu

Comments: Our trained models can be downloaded from: this https URL

Subjects: Computation and Language (cs.CL)
[96] arXiv:2508.01977 [pdf, html, other]: Title: TIBSTC-CoT: A Multi-Domain Instruction Dataset for Chain-of-Thought Reasoning in Language Models

Fan Gao, Cheng Huang, Nyima Tashi, Yutong Liu, Xiangxiang Wang, Thupten Tsering, Ban Ma-bao, Renzeg Duojie, Gadeng Luosang, Rinchen Dongrub, Dorje Tashi, Xiao Feng, Hao Wang, Yongbin Yu

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[97] arXiv:2508.01990 [pdf, html, other]: Title: Contextually Aware E-Commerce Product Question Answering using RAG

Praveen Tangarajan, Anand A. Rajasekar, Manish Rathi, Vinay Rao Dandin, Ozan Ersoy

Comments: 6 pages, 1 figure, 5 tables. Preprint under review

Subjects: Computation and Language (cs.CL)
[98] arXiv:2508.01999 [pdf, html, other]: Title: Prompting Large Language Models to Detect Dementia Family Caregivers

Md Badsha Biswas, Özlem Uzuner

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[99] arXiv:2508.02013 [pdf, html, other]: Title: SpeechRole: A Large-Scale Dataset and Benchmark for Evaluating Speech Role-Playing Agents

Changhao Jiang, Jiajun Sun, Yifei Cao, Jiabao Zhuang, Hui Li, Xiaoran Fan, Ming Zhang, Junjie Ye, Shihan Dou, Zhiheng Xi, Jingqi Tong, Yilong Wu, Baoyu Fan, Zhen Wang, Tao Liang, Zhihui Fei, Mingyang Wan, Guojun Ma, Tao Ji, Tao Gui, Qi Zhang, Xuanjing Huang

Subjects: Computation and Language (cs.CL)
[100] arXiv:2508.02018 [pdf, html, other]: Title: SpeechR: A Benchmark for Speech Reasoning in Large Audio-Language Models

Wanqi Yang, Yanda Li, Yunchao Wei, Meng Fang, Ling Chen

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)

Total of 1753 entries : 1-50 51-100 101-150 151-200 201-250 ... 1751-1753

Showing up to 50 entries per page: fewer | more | all