Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.CL

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Computation and Language

Authors and titles for November 2024

Total of 1311 entries : 1-100 201-300 301-400 401-500 501-600 601-700 701-800 801-900 ... 1301-1311
Showing up to 100 entries per page: fewer | more | all
[501] arXiv:2411.10145 [pdf, other]
Title: An Effective Framework to Help Large Language Models Handle Numeric-involved Long-context Tasks
Yijiong Yu
Subjects: Computation and Language (cs.CL)
[502] arXiv:2411.10163 [pdf, html, other]
Title: Compound-QA: A Benchmark for Evaluating LLMs on Compound Questions
Yutao Hou, Yajing Luo, Zhiwen Ruan, Hongru Wang, Weifeng Ge, Yun Chen, Guanhua Chen
Subjects: Computation and Language (cs.CL)
[503] arXiv:2411.10172 [pdf, html, other]
Title: Increasing the Accessibility of Causal Domain Knowledge via Causal Information Extraction Methods: A Case Study in the Semiconductor Manufacturing Industry
Houssam Razouk, Leonie Benischke, Daniel Garber, Roman Kern
Comments: 17 pages, 2 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[504] arXiv:2411.10227 [pdf, html, other]
Title: Entropy and type-token ratio in gigaword corpora
Pablo Rosillo-Rodes, Maxi San Miguel, David Sanchez
Comments: 15 pages, 10 figures, 8 tables
Journal-ref: Phys. Rev. Research 7, 033054 (2025)
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR); Physics and Society (physics.soc-ph)
[505] arXiv:2411.10242 [pdf, other]
Title: Measuring Non-Adversarial Reproduction of Training Data in Large Language Models
Michael Aerni, Javier Rando, Edoardo Debenedetti, Nicholas Carlini, Daphne Ippolito, Florian Tramèr
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[506] arXiv:2411.10298 [pdf, html, other]
Title: Unveiling Topological Structures from Language: A Comprehensive Survey of Topological Data Analysis Applications in NLP
Adaku Uchendu, Thai Le
Subjects: Computation and Language (cs.CL)
[507] arXiv:2411.10328 [pdf, other]
Title: Emotion Detection in Reddit: Comparative Study of Machine Learning and Deep Learning Techniques
Maliheh Alaeddini
Subjects: Computation and Language (cs.CL)
[508] arXiv:2411.10371 [pdf, html, other]
Title: A Survey of Event Causality Identification: Taxonomy, Challenges, Assessment, and Prospects
Qing Cheng, Zefan Zeng, Xingchen Hu, Yuehang Si, Zhong Liu
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[509] arXiv:2411.10416 [pdf, html, other]
Title: Towards Automatic Evaluation of Task-Oriented Dialogue Flows
Mehrnoosh Mirtaheri, Nikhil Varghese, Chandra Khatri, Amol Kelkar
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[510] arXiv:2411.10436 [pdf, html, other]
Title: Mitigating Hallucination in Multimodal Large Language Model via Hallucination-targeted Direct Preference Optimization
Yuhan Fu, Ruobing Xie, Xingwu Sun, Zhanhui Kang, Xirong Li
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[511] arXiv:2411.10442 [pdf, html, other]
Title: Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization
Weiyun Wang, Zhe Chen, Wenhai Wang, Yue Cao, Yangzhou Liu, Zhangwei Gao, Jinguo Zhu, Xizhou Zhu, Lewei Lu, Yu Qiao, Jifeng Dai
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[512] arXiv:2411.10477 [pdf, other]
Title: A Survey on Importance of Homophones Spelling Correction Model for Khmer Authors
Seanghort Born, Madeth May, Claudine Piau-Toffolon, Sébastien Iksal
Subjects: Computation and Language (cs.CL)
[513] arXiv:2411.10533 [pdf, html, other]
Title: On the Compatibility of Generative AI and Generative Linguistics
Eva Portelance, Masoud Jasbi
Subjects: Computation and Language (cs.CL)
[514] arXiv:2411.10541 [pdf, html, other]
Title: Does Prompt Formatting Have Any Impact on LLM Performance?
Jia He, Mukund Rungta, David Koleczek, Arshdeep Sekhon, Franklin X Wang, Sadid Hasan
Comments: Submitted to NAACL 2025
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[515] arXiv:2411.10557 [pdf, html, other]
Title: MLAN: Language-Based Instruction Tuning Preserves and Transfers Knowledge in Multimodal Language Models
Jianhong Tu, Zhuohao Ni, Nicholas Crispino, Zihao Yu, Michael Bendersky, Beliz Gunel, Ruoxi Jia, Xin Liu, Lingjuan Lyu, Dawn Song, Chenguang Wang
Subjects: Computation and Language (cs.CL)
[516] arXiv:2411.10581 [pdf, html, other]
Title: On the Shortcut Learning in Multilingual Neural Machine Translation
Wenxuan Wang, Wenxiang Jiao, Jen-tse Huang, Zhaopeng Tu, Michael R. Lyu
Comments: Accepted by Neurocomputing 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[517] arXiv:2411.10588 [pdf, other]
Title: A dataset of questions on decision-theoretic reasoning in Newcomb-like problems
Caspar Oesterheld, Emery Cooper, Miles Kodama, Linh Chi Nguyen, Ethan Perez
Comments: 48 pages, 15 figures; code and data at this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[518] arXiv:2411.10629 [pdf, other]
Title: Leveraging large language models for efficient representation learning for entity resolution
Xiaowei Xu, Bi T. Foua, Xingqiao Wang, Vivek Gunasekaran, John R. Talburt
Comments: 22 pages and 12 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[519] arXiv:2411.10636 [pdf, html, other]
Title: Gender Bias Mitigation for Bangla Classification Tasks
Sajib Kumar Saha Joy, Arman Hassan Mahy, Meherin Sultana, Azizah Mamun Abha, MD Piyal Ahmmed, Yue Dong, G M Shahariar
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[520] arXiv:2411.10666 [pdf, html, other]
Title: SAM Decoding: Speculative Decoding via Suffix Automaton
Yuxuan Hu, Ke Wang, Xiaokang Zhang, Fanjin Zhang, Cuiping Li, Hong Chen, Jing Zhang
Comments: 16 pages, 9 figures, 9 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[521] arXiv:2411.10670 [pdf, html, other]
Title: IntentGPT: Few-shot Intent Discovery with Large Language Models
Juan A. Rodriguez, Nicholas Botzer, David Vazquez, Christopher Pal, Marco Pedersoli, Issam Laradji
Comments: ICLR 2024 Workshop on LLM Agents
Subjects: Computation and Language (cs.CL)
[522] arXiv:2411.10681 [pdf, html, other]
Title: Structured Dialogue System for Mental Health: An LLM Chatbot Leveraging the PM+ Guidelines
Yixiang Chen, Xinyu Zhang, Jinran Wang, Xurong Xie, Nan Yan, Hui Chen, Lan Wang
Comments: Accepted to the 16th International Conference on Social Robotic (ICSR 2024)
Subjects: Computation and Language (cs.CL)
[523] arXiv:2411.10724 [pdf, html, other]
Title: HJ-Ky-0.1: an Evaluation Dataset for Kyrgyz Word Embeddings
Anton Alekseev, Gulnara Kabaeva
Comments: The translation of the 2023 paper into English
Journal-ref: Herald of KSTU 68(4) (2023)
Subjects: Computation and Language (cs.CL)
[524] arXiv:2411.10730 [pdf, html, other]
Title: Comparison of Multilingual and Bilingual Models for Satirical News Detection of Arabic and English
Omar W. Abdalla, Aditya Joshi, Rahat Masood, Salil S. Kanhere
Comments: ALTA 2024 (Selected for publication)
Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[525] arXiv:2411.10761 [pdf, html, other]
Title: Can Generic LLMs Help Analyze Child-adult Interactions Involving Children with Autism in Clinical Observation?
Tiantian Feng, Anfeng Xu, Rimita Lahiri, Helen Tager-Flusberg, So Hyun Kim, Somer Bishop, Catherine Lord, Shrikanth Narayanan
Comments: GenAI for Health Workshop, NeurIPS 2024
Subjects: Computation and Language (cs.CL)
[526] arXiv:2411.10813 [pdf, html, other]
Title: Information Anxiety in Large Language Models
Prasoon Bajpai, Sarah Masud, Tanmoy Chakraborty
Subjects: Computation and Language (cs.CL)
[527] arXiv:2411.10869 [pdf, other]
Title: Large Language Models (LLMs) as Traffic Control Systems at Urban Intersections: A New Paradigm
Sari Masri, Huthaifa I. Ashqar, Mohammed Elhenawy
Comments: The data and code that support the findings of this study are openly available in Zenodo at this https URL, reference number 14171745
Subjects: Computation and Language (cs.CL); Computational Engineering, Finance, and Science (cs.CE); Computers and Society (cs.CY); Human-Computer Interaction (cs.HC)
[528] arXiv:2411.10878 [pdf, html, other]
Title: Empowering Meta-Analysis: Leveraging Large Language Models for Scientific Synthesis
Jawad Ibn Ahad, Rafeed Mohammad Sultan, Abraham Kaikobad, Fuad Rahman, Mohammad Ruhul Amin, Nabeel Mohammed, Shafin Rahman
Comments: Accepted in 2024 IEEE International Conference on Big Data (IEEE BigData)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[529] arXiv:2411.10879 [pdf, html, other]
Title: BanglaDialecto: An End-to-End AI-Powered Regional Speech Standardization
Md. Nazmus Sadat Samin, Jawad Ibn Ahad, Tanjila Ahmed Medha, Fuad Rahman, Mohammad Ruhul Amin, Nabeel Mohammed, Shafin Rahman
Comments: Accepted in 2024 IEEE International Conference on Big Data (IEEE BigData)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[530] arXiv:2411.10912 [pdf, html, other]
Title: SPICA: Retrieving Scenarios for Pluralistic In-Context Alignment
Quan Ze Chen, K.J. Kevin Feng, Chan Young Park, Amy X. Zhang
Subjects: Computation and Language (cs.CL)
[531] arXiv:2411.10914 [pdf, html, other]
Title: BPO: Towards Balanced Preference Optimization between Knowledge Breadth and Depth in Alignment
Sizhe Wang, Yongqi Tong, Hengyuan Zhang, Dawei Li, Xin Zhang, Tianlong Chen
Comments: The 2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics (NAACL 2025)- Main Conference
Subjects: Computation and Language (cs.CL)
[532] arXiv:2411.10915 [pdf, html, other]
Title: Bias in Large Language Models: Origin, Evaluation, and Mitigation
Yufei Guo, Muzhe Guo, Juntao Su, Zhou Yang, Mengqiu Zhu, Hongfei Li, Mengyang Qiu, Shuo Shuo Liu
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[533] arXiv:2411.10927 [pdf, other]
Title: Inter-linguistic Phonetic Composition (IPC): A Theoretical and Computational Approach to Enhance Second Language Pronunciation
Jisang Park, Minu Kim, DaYoung Hong, Jongha Lee
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[534] arXiv:2411.10928 [pdf, html, other]
Title: Learn from Downstream and Be Yourself in Multimodal Large Language Model Fine-Tuning
Wenke Huang, Jian Liang, Zekun Shi, Didi Zhu, Guancheng Wan, He Li, Bo Du, Dacheng Tao, Mang Ye
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[535] arXiv:2411.10934 [pdf, html, other]
Title: Analyzing Pokémon and Mario Streamers' Twitch Chat with LLM-based User Embeddings
Mika Hämäläinen, Jack Rueter, Khalid Alnajjar
Comments: NLP4DH 2024
Subjects: Computation and Language (cs.CL)
[536] arXiv:2411.10950 [pdf, html, other]
Title: Understanding Multimodal LLMs: the Mechanistic Interpretability of Llava in Visual Question Answering
Zeping Yu, Sophia Ananiadou
Comments: preprint
Subjects: Computation and Language (cs.CL)
[537] arXiv:2411.10954 [pdf, html, other]
Title: Dialectal Toxicity Detection: Evaluating LLM-as-a-Judge Consistency Across Language Varieties
Fahim Faisal, Md Mushfiqur Rahman, Antonios Anastasopoulos
Subjects: Computation and Language (cs.CL)
[538] arXiv:2411.10955 [pdf, other]
Title: A Topic-aware Comparable Corpus of Chinese Variations
Da-Chen Lian, Shu-Kai Hsieh
Comments: 4 pages, 4 figures, presented at APCLC2018: ASIA-PACIFIC CORPUS LINGUISTICS CONFERENCE 2018
Subjects: Computation and Language (cs.CL)
[539] arXiv:2411.11027 [pdf, html, other]
Title: BianCang: A Traditional Chinese Medicine Large Language Model
Sibo Wei, Xueping Peng, Yi-fei Wang, Jiasheng Si, Weiyu Zhang, Wenpeng Lu, Xiaoming Wu, Yinglong Wang
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[540] arXiv:2411.11053 [pdf, html, other]
Title: SRA-MCTS: Self-driven Reasoning Augmentation with Monte Carlo Tree Search for Code Generation
Bin Xu, Yiguan Lin, Yinghao Li, Yang Gao
Comments: Accepted by IJCAI2025
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[541] arXiv:2411.11055 [pdf, html, other]
Title: FastDraft: How to Train Your Draft
Ofir Zafrir, Igor Margulis, Dorin Shteyman, Shira Guskin, Guy Boudoukh
Comments: Accepted at ACL 2025
Subjects: Computation and Language (cs.CL)
[542] arXiv:2411.11061 [pdf, html, other]
Title: Beyond Human-Like Processing: Large Language Models Perform Equivalently on Forward and Backward Scientific Text
Xiaoliang Luo, Michael Ramscar, Bradley C. Love
Subjects: Computation and Language (cs.CL); Neurons and Cognition (q-bio.NC)
[543] arXiv:2411.11072 [pdf, html, other]
Title: Multilingual Large Language Models: A Systematic Survey
Shaolin Zhu, Supryadi, Shaoyang Xu, Haoran Sun, Leiyu Pan, Menglong Cui, Jiangcun Du, Renren Jin, António Branco, Deyi Xiong
Subjects: Computation and Language (cs.CL)
[544] arXiv:2411.11081 [pdf, html, other]
Title: The Promises and Pitfalls of LLM Annotations in Dataset Labeling: a Case Study on Media Bias Detection
Tomas Horych, Christoph Mandl, Terry Ruas, Andre Greiner-Petter, Bela Gipp, Akiko Aizawa, Timo Spinde
Subjects: Computation and Language (cs.CL)
[545] arXiv:2411.11171 [pdf, html, other]
Title: LLäMmlein: Transparent, Compact and Competitive German-Only Language Models from Scratch
Jan Pfister, Julia Wunderle, Andreas Hotho
Comments: camera ready @ACL25; this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[546] arXiv:2411.11206 [pdf, html, other]
Title: Capturing Sparks of Abstraction for the ARC Challenge
Martin Andrews
Comments: Submitted as a paper entry for the 2024 ARC Prize
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[547] arXiv:2411.11235 [pdf, html, other]
Title: MEMO-Bench: A Multiple Benchmark for Text-to-Image and Multimodal Large Language Models on Human Emotion Analysis
Yingjie Zhou, Zicheng Zhang, Jiezhang Cao, Jun Jia, Yanwei Jiang, Farong Wen, Xiaohong Liu, Xiongkuo Min, Guangtao Zhai
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[548] arXiv:2411.11247 [pdf, html, other]
Title: ZeFaV: Boosting Large Language Models for Zero-shot Fact Verification
Son T. Luu, Hiep Nguyen, Trung Vo, Le-Minh Nguyen
Comments: This pre-print has been published in PRICAI 2024: Trends in Artificial Intelligence. The published version is available at this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[549] arXiv:2411.11260 [pdf, other]
Title: Large corpora and large language models: a replicable method for automating grammatical annotation
Cameron Morin, Matti Marttinen Larsson
Journal-ref: Linguistics Vanguard, 1-10 (2025)
Subjects: Computation and Language (cs.CL)
[550] arXiv:2411.11266 [pdf, html, other]
Title: VersaTune: An Efficient Data Composition Framework for Training Multi-Capability LLMs
Keer Lu, Keshi Zhao, Zhuoran Zhang, Zheng Liang, Da Pan, Shusen Zhang, Xin Wu, Guosheng Dong, Bin Cui, Tengjiao Wang, Wentao Zhang
Subjects: Computation and Language (cs.CL)
[551] arXiv:2411.11289 [pdf, html, other]
Title: LP Data Pipeline: Lightweight, Purpose-driven Data Pipeline for Large Language Models
Yungi Kim, Hyunsoo Ha, Seonghoon Yang, Sukyung Lee, Jihoo Kim, Chanjun Park
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[552] arXiv:2411.11295 [pdf, html, other]
Title: Transcending Language Boundaries: Harnessing LLMs for Low-Resource Language Translation
Peng Shu, Junhao Chen, Zhengliang Liu, Hui Wang, Zihao Wu, Tianyang Zhong, Yiwei Li, Huaqin Zhao, Hanqi Jiang, Yi Pan, Yifan Zhou, Constance Owl, Xiaoming Zhai, Ninghao Liu, Claudio Saunt, Tianming Liu
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[553] arXiv:2411.11344 [pdf, other]
Title: Mitigating Knowledge Conflicts in Language Model-Driven Question Answering
Han Cao, Zhaoyang Zhang, Xiangtian Li, Chufan Wu, Hansong Zhang, Wenqing Zhang
Comments: revised version, more figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[554] arXiv:2411.11371 [pdf, html, other]
Title: Rethinking Thinking Tokens: Understanding Why They Underperform in Practice
Sreeram Vennam, David Valente, David Herel, Ponnurangam Kumaraguru
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[555] arXiv:2411.11424 [pdf, html, other]
Title: Membership Inference Attack against Long-Context Large Language Models
Zixiong Wang, Gaoyang Liu, Yang Yang, Chen Wang
Subjects: Computation and Language (cs.CL)
[556] arXiv:2411.11479 [pdf, html, other]
Title: Value-Spectrum: Quantifying Preferences of Vision-Language Models via Value Decomposition in Social Media Contexts
Jingxuan Li, Yuning Yang, Shengqi Yang, Linfan Zhang, Ying Nian Wu
Comments: ACL 2025 main
Subjects: Computation and Language (cs.CL)
[557] arXiv:2411.11496 [pdf, html, other]
Title: Safe + Safe = Unsafe? Exploring How Safe Images Can Be Exploited to Jailbreak Large Vision-Language Models
Chenhang Cui, Gelei Deng, An Zhang, Jingnan Zheng, Yicong Li, Lianli Gao, Tianwei Zhang, Tat-Seng Chua
Subjects: Computation and Language (cs.CL)
[558] arXiv:2411.11531 [pdf, html, other]
Title: Addressing Hallucinations in Language Models with Knowledge Graph Embeddings as an Additional Modality
Viktoriia Chekalina, Anton Razzhigaev, Elizaveta Goncharova, Andrey Kuznetsov
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[559] arXiv:2411.11581 [pdf, html, other]
Title: OASIS: Open Agent Social Interaction Simulations with One Million Agents
Ziyi Yang, Zaibin Zhang, Zirui Zheng, Yuxian Jiang, Ziyue Gan, Zhiyu Wang, Zijian Ling, Jinsong Chen, Martz Ma, Bowen Dong, Prateek Gupta, Shuyue Hu, Zhenfei Yin, Guohao Li, Xu Jia, Lijun Wang, Bernard Ghanem, Huchuan Lu, Chaochao Lu, Wanli Ouyang, Yu Qiao, Philip Torr, Jing Shao
Subjects: Computation and Language (cs.CL)
[560] arXiv:2411.11623 [pdf, html, other]
Title: Federated Incremental Named Entity Recognition
Duzhen Zhang, Yahan Yu, Chenxing Li, Jiahua Dong, Dong Yu
Comments: Accepted by IEEE/ACM Transactions on Audio, Speech and Language Processing
Subjects: Computation and Language (cs.CL)
[561] arXiv:2411.11635 [pdf, other]
Title: Chapter 7 Review of Data-Driven Generative AI Models for Knowledge Extraction from Scientific Literature in Healthcare
Leon Kopitar, Primoz Kocbek, Lucija Gosak, Gregor Stiglic
Comments: 16 pages, 5 figures, 1 table
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[562] arXiv:2411.11694 [pdf, html, other]
Title: Enhancing LLM Reasoning with Reward-guided Tree Search
Jinhao Jiang, Zhipeng Chen, Yingqian Min, Jie Chen, Xiaoxue Cheng, Jiapeng Wang, Yiru Tang, Haoxiang Sun, Jia Deng, Wayne Xin Zhao, Zheng Liu, Dong Yan, Jian Xie, Zhongyuan Wang, Ji-Rong Wen
Comments: Technical Report on Slow Thinking with LLMs: I
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[563] arXiv:2411.11707 [pdf, html, other]
Title: FedCoLLM: A Parameter-Efficient Federated Co-tuning Framework for Large and Small Language Models
Tao Fan, Yan Kang, Guoqiang Ma, Lixin Fan, Kai Chen, Qiang Yang
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[564] arXiv:2411.11731 [pdf, html, other]
Title: Moral Persuasion in Large Language Models: Evaluating Susceptibility and Ethical Alignment
Allison Huang, Yulu Niki Pi, Carlos Mougan
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[565] arXiv:2411.11736 [pdf, html, other]
Title: Advacheck at GenAI Detection Task 1: AI Detection Powered by Domain-Aware Multi-Tasking
German Gritsai, Anastasia Voznyuk, Ildar Khabutdinov, Andrey Grabovoy
Subjects: Computation and Language (cs.CL)
[566] arXiv:2411.11770 [pdf, html, other]
Title: CNMBERT: A Model for Converting Hanyu Pinyin Abbreviations to Chinese Characters
Zishuo Feng, Feng Cao
Comments: 8 pages, 5 figures, 8 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[567] arXiv:2411.11843 [pdf, html, other]
Title: Bi-Mamba: Towards Accurate 1-Bit State Space Models
Shengkun Tang, Liqun Ma, Haonan Li, Mingjie Sun, Zhiqiang Shen
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[568] arXiv:2411.11984 [pdf, html, other]
Title: Understanding Chain-of-Thought in LLMs through Information Theory
Jean-Francois Ton, Muhammad Faaiz Taufiq, Yang Liu
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[569] arXiv:2411.12000 [pdf, html, other]
Title: ByteScience: Bridging Unstructured Scientific Literature and Structured Data with Auto Fine-tuned Large Language Model in Token Granularity
Tong Xie, Hanzhi Zhang, Shaozhou Wang, Yuwei Wan, Imran Razzak, Chunyu Kit, Wenjie Zhang, Bram Hoex
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[570] arXiv:2411.12056 [pdf, html, other]
Title: Benchmarking pre-trained text embedding models in aligning built asset information
Mehrzad Shahinmoghadam, Ali Motamedi
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[571] arXiv:2411.12074 [pdf, html, other]
Title: Mitigating Gender Bias in Contextual Word Embeddings
Navya Yarrabelly, Vinay Damodaran, Feng-Guang Su
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[572] arXiv:2411.12103 [pdf, html, other]
Title: Does Unlearning Truly Unlearn? A Black Box Evaluation of LLM Unlearning Methods
Jai Doshi, Asa Cooper Stickland
Comments: 9 pages, 2 figures
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[573] arXiv:2411.12142 [pdf, html, other]
Title: A Computational Method for Measuring "Open Codes" in Qualitative Analysis
John Chen, Alexandros Lotsos, Lexie Zhao, Caiyi Wang, Jessica Hullman, Bruce Sherin, Uri Wilensky, Michael Horn
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[574] arXiv:2411.12147 [pdf, html, other]
Title: JuniperLiu at CoMeDi Shared Task: Models as Annotators in Lexical Semantics Disagreements
Zhu Liu, Zhen Hu, Ying Liu
Comments: accepted by CoMeDi workshop in Coling 2025
Subjects: Computation and Language (cs.CL)
[575] arXiv:2411.12156 [pdf, html, other]
Title: HNCSE: Advancing Sentence Embeddings via Hybrid Contrastive Learning with Hard Negatives
Wenxiao Liu, Zihong Yang, Chaozhuo Li, Zijin Hong, Jianfeng Ma, Zhiquan Liu, Litian Zhang, Feiran Huang
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[576] arXiv:2411.12157 [pdf, other]
Title: A Combined Encoder and Transformer Approach for Coherent and High-Quality Text Generation
Jiajing Chen, Shuo Wang, Zhen Qi, Zhenhong Zhang, Chihang Wang, Hongye Zheng
Subjects: Computation and Language (cs.CL)
[577] arXiv:2411.12240 [pdf, html, other]
Title: Evaluating Tokenizer Performance of Large Language Models Across Official Indian Languages
S. Tamang, D. J. Bora
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[578] arXiv:2411.12254 [pdf, html, other]
Title: Predicting User Intents and Musical Attributes from Music Discovery Conversations
Daeyong Kwon, SeungHeon Doh, Juhan Nam
Comments: 8 pages, 4 figures
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[579] arXiv:2411.12262 [pdf, html, other]
Title: Low-resource Machine Translation: what for? who for? An observational study on a dedicated Tetun language translation service
Raphael Merx, Adérito José Guterres Correia, Hanna Suominen, Ekaterina Vylomova
Comments: to be published in LoResMT 2025
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[580] arXiv:2411.12287 [pdf, html, other]
Title: CUE-M: Contextual Understanding and Enhanced Search with Multimodal Large Language Model
Dongyoung Go, Taesun Whang, Chanhee Lee, Hwa-Yeon Kim, Sunghoon Park, Seunghwan Ji, Jinho Kim, Dongchan Kim, Young-Bum Kim
Comments: Preprint. Under review
Subjects: Computation and Language (cs.CL)
[581] arXiv:2411.12307 [pdf, html, other]
Title: Balancing Accuracy and Efficiency in Multi-Turn Intent Classification for LLM-Powered Dialog Systems in Production
Junhua Liu, Yong Keat Tan, Bin Fu, Kwan Hui Lim
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[582] arXiv:2411.12372 [pdf, html, other]
Title: RedPajama: an Open Dataset for Training Large Language Models
Maurice Weber, Daniel Fu, Quentin Anthony, Yonatan Oren, Shane Adams, Anton Alexandrov, Xiaozhong Lyu, Huu Nguyen, Xiaozhe Yao, Virginia Adams, Ben Athiwaratkun, Rahul Chalamala, Kezhen Chen, Max Ryabinin, Tri Dao, Percy Liang, Christopher Ré, Irina Rish, Ce Zhang
Comments: 38th Conference on Neural Information Processing Systems (NeurIPS 2024) Track on Datasets and Benchmarks
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[583] arXiv:2411.12395 [pdf, html, other]
Title: Do LLMs Understand Ambiguity in Text? A Case Study in Open-world Question Answering
Aryan Keluskar, Amrita Bhattacharjee, Huan Liu
Comments: Accepted at the REU Symposium at IEEE BigData 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[584] arXiv:2411.12405 [pdf, html, other]
Title: Evaluating the Prompt Steerability of Large Language Models
Erik Miehling, Michael Desmond, Karthikeyan Natesan Ramamurthy, Elizabeth M. Daly, Pierre Dognin, Jesus Rios, Djallel Bouneffouf, Miao Liu
Comments: Short version appeared at the Pluralistic Alignment workshop at NeurIPS 2024; extended version appeared at NAACL 2025
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[585] arXiv:2411.12449 [pdf, html, other]
Title: Neon: News Entity-Interaction Extraction for Enhanced Question Answering
Sneha Singhania, Silviu Cucerzan, Allen Herring, Sujay Kumar Jauhar
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[586] arXiv:2411.12458 [pdf, html, other]
Title: Variation between Credible and Non-Credible News Across Topics
Emilie Francis
Comments: 9 pages, 1 figure
Journal-ref: The First International Conference on Natural Language Processing and Artificial Intelligence for Cyber Security (NLPAICS 2024), 86-96 (2024)
Subjects: Computation and Language (cs.CL)
[587] arXiv:2411.12460 [pdf, html, other]
Title: Exploring Iterative Controllable Summarization with Large Language Models
Sangwon Ryu, Heejin Do, Daehee Kim, Hwanjo Yu, Dongwoo Kim, Yunsu Kim, Gary Geunbae Lee, Jungseul Ok
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[588] arXiv:2411.12473 [pdf, html, other]
Title: NMT-Obfuscator Attack: Ignore a sentence in translation with only one word
Sahar Sadrizadeh, César Descalzo, Ljiljana Dolamic, Pascal Frossard
Subjects: Computation and Language (cs.CL)
[589] arXiv:2411.12493 [pdf, other]
Title: Eradicating Social Biases in Sentiment Analysis using Semantic Blinding and Semantic Propagation Graph Neural Networks
Hubert Plisiecki
Subjects: Computation and Language (cs.CL)
[590] arXiv:2411.12580 [pdf, html, other]
Title: Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models
Laura Ruis, Maximilian Mozes, Juhan Bae, Siddhartha Rao Kamalakara, Dwarak Talupuru, Acyr Locatelli, Robert Kirk, Tim Rocktäschel, Edward Grefenstette, Max Bartolo
Comments: Published at ICLR 2025
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[591] arXiv:2411.12587 [pdf, other]
Title: Whisper Finetuning on Nepali Language
Sanjay Rijal, Shital Adhikari, Manish Dahal, Manish Awale, Vaghawan Ojha
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[592] arXiv:2411.12685 [pdf, html, other]
Title: Enhanced Sign Language Translation between American Sign Language (ASL) and Indian Sign Language (ISL) Using LLMs
Malay Kumar, S. Sarvajit Visagan, Tanish Sarang Mahajan, Anisha Natarajan
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[593] arXiv:2411.12703 [pdf, html, other]
Title: Strengthening False Information Propagation Detection: Leveraging SVM and Sophisticated Text Vectorization Techniques in comparison to BERT
Ahmed Akib Jawad Karim, Kazi Hafiz Md Asad, Aznur Azam
Comments: 6 pages, 3 tables and 6 Figures. Submitted to a conference
Subjects: Computation and Language (cs.CL)
[594] arXiv:2411.12712 [pdf, html, other]
Title: Enhancing Multi-Class Disease Classification: Neoplasms, Cardiovascular, Nervous System, and Digestive Disorders Using Advanced LLMs
Ahmed Akib Jawad Karim, Muhammad Zawad Mahmud, Samiha Islam, Aznur Azam
Comments: 7 Pages, 4 tables and 11 figures. Under review in a IEEE conference
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[595] arXiv:2411.12719 [pdf, html, other]
Title: Rethinking MUSHRA: Addressing Modern Challenges in Text-to-Speech Evaluation
Praveen Srinivasa Varadhan, Amogh Gulati, Ashwin Sankar, Srija Anand, Anirudh Gupta, Anirudh Mukherjee, Shiva Kumar Marepally, Ankur Bhatia, Saloni Jaju, Suvrat Bhooshan, Mitesh M. Khapra
Comments: Accepted in TMLR
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[596] arXiv:2411.12720 [pdf, html, other]
Title: Scaling laws for nonlinear dynamical models of articulatory control
Sam Kirkham
Comments: Updated title and minor changes to text after first round of reviews
Journal-ref: JASA Express Lett. 5, 025201 (2025)
Subjects: Computation and Language (cs.CL)
[597] arXiv:2411.12728 [pdf, other]
Title: Information Theory of Meaningful Communication
Doron Sivan, Misha Tsodyks
Subjects: Computation and Language (cs.CL); Information Theory (cs.IT)
[598] arXiv:2411.12736 [pdf, html, other]
Title: ACING: Actor-Critic for Instruction Learning in Black-Box Large Language Models
Salma Kharrat, Fares Fourati, Marco Canini
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Systems and Control (eess.SY); Optimization and Control (math.OC)
[599] arXiv:2411.12758 [pdf, html, other]
Title: An exploration of the effect of quantisation on energy consumption and inference time of StarCoder2
Pepijn de Reus, Ana Oprescu, Jelle Zuidema
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Software Engineering (cs.SE)
[600] arXiv:2411.12759 [pdf, other]
Title: A Novel Approach to Eliminating Hallucinations in Large Language Model-Assisted Causal Discovery
Grace Sng, Yanming Zhang, Klaus Mueller
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Total of 1311 entries : 1-100 201-300 301-400 401-500 501-600 601-700 701-800 801-900 ... 1301-1311
Showing up to 100 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack