Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.IR

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Information Retrieval

Authors and titles for recent submissions

  • Tue, 14 Oct 2025
  • Mon, 13 Oct 2025
  • Fri, 10 Oct 2025
  • Thu, 9 Oct 2025
  • Wed, 8 Oct 2025

See today's new changes

Total of 93 entries : 1-50 51-93 58-93
Showing up to 50 entries per page: fewer | more | all

Fri, 10 Oct 2025 (showing 16 of 16 entries )

[58] arXiv:2510.08281 [pdf, html, other]
Title: Mobile Gamer Lifetime Value Prediction via Objective Decomposition and Reconstruction
Tianwei Li, Yu Zhao, Yunze Li, Sheng Li
Comments: 6 pages, 6 figures
Subjects: Information Retrieval (cs.IR)
[59] arXiv:2510.08252 [pdf, html, other]
Title: ReasonEmbed: Enhanced Text Embeddings for Reasoning-Intensive Document Retrieval
Jianlyu Chen, Junwei Lan, Chaofan Li, Defu Lian, Zheng Liu
Comments: 17 pages, 3 figures
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[60] arXiv:2510.08109 [pdf, html, other]
Title: VersionRAG: Version-Aware Retrieval-Augmented Generation for Evolving Documents
Daniel Huwiler, Kurt Stockinger, Jonathan Fürst
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[61] arXiv:2510.08048 [pdf, html, other]
Title: TaoSR-AGRL: Adaptive Guided Reinforcement Learning Framework for E-commerce Search Relevance
Jianhui Yang, Yiming Jin, Pengkun Jiao, Chenhe Dong, Zerui Huang, Shaowei Yao, Xiaojiang Zhou, Dan Ou, Haihong Tang
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[62] arXiv:2510.07885 [pdf, other]
Title: Generation and annotation of item usage scenarios in e-commerce using large language models
Madoka Hagiri, Kazushi Okamoto, Koki Karube, Kei Harada, Atsushi Shibata
Journal-ref: The 26th International Symposium on Advanced Intelligent Systems (ISIS 2025)
Subjects: Information Retrieval (cs.IR)
[63] arXiv:2510.07784 [pdf, html, other]
Title: PLUM: Adapting Pre-trained Language Models for Industrial-scale Generative Recommendations
Ruining He, Lukasz Heldt, Lichan Hong, Raghunandan Keshavan, Shifan Mao, Nikhil Mehta, Zhengyang Su, Alicia Tsai, Yueqi Wang, Shao-Chuan Wang, Xinyang Yi, Lexi Baugher, Baykal Cakici, Ed Chi, Cristos Goodrow, Ningren Han, He Ma, Romer Rosales, Abby Van Soest, Devansh Tandon, Su-Lin Wu, Weilong Yang, Yilin Zheng
Comments: 11 pages, 6 figures
Subjects: Information Retrieval (cs.IR); Machine Learning (cs.LG)
[64] arXiv:2510.07728 [pdf, html, other]
Title: Who Stole Your Data? A Method for Detecting Unauthorized RAG Theft
Peiyang Liu, Ziqiang Cui, Di Liang, Wei Ye
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[65] arXiv:2510.07720 [pdf, html, other]
Title: Queries Are Not Alone: Clustering Text Embeddings for Video Search
Peyang Liu, Xi Wang, Ziqiang Cui, Wei Ye
Comments: Accepted by International ACM SIGIR Conference on Research and Development in Information Retrieval 2025
Subjects: Information Retrieval (cs.IR)
[66] arXiv:2510.07644 [pdf, html, other]
Title: ISMIE: A Framework to Characterize Information Seeking in Modern Information Environments
Shuoqi Sun, Danula Hettiachchi, Damiano Spina
Comments: This paper has been accepted to SIGIR-AP 2025
Subjects: Information Retrieval (cs.IR)
[67] arXiv:2510.07621 [pdf, html, other]
Title: Retentive Relevance: Capturing Long-Term User Value in Recommendation Systems
Saeideh Bakhshi, Phuong Mai Nguyen, Robert Schiller, Tiantian Xu, Pawan Kodandapani, Andrew Levine, Cayman Simpson, Qifan Wang
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[68] arXiv:2510.07484 [pdf, html, other]
Title: Reasoning by Exploration: A Unified Approach to Retrieval and Generation over Graphs
Haoyu Han, Kai Guo, Harry Shomer, Yu Wang, Yucheng Chu, Hang Li, Li Ma, Jiliang Tang
Subjects: Information Retrieval (cs.IR)
[69] arXiv:2510.08558 (cross-list from cs.AI) [pdf, other]
Title: Agent Learning via Early Experience
Kai Zhang, Xiangchao Chen, Bo Liu, Tianci Xue, Zeyi Liao, Zhihan Liu, Xiyao Wang, Yuting Ning, Zhaorun Chen, Xiaohan Fu, Jian Xie, Yuxuan Sun, Boyu Gou, Qi Qi, Zihang Meng, Jianwei Yang, Ning Zhang, Xian Li, Ashish Shah, Dat Huynh, Hengduo Li, Zi Yang, Sara Cao, Lawrence Jang, Shuyan Zhou, Jiacheng Zhu, Huan Sun, Jason Weston, Yu Su, Yifan Wu
Comments: Work in progress
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[70] arXiv:2510.08385 (cross-list from cs.CV) [pdf, html, other]
Title: Detecting Legend Items on Historical Maps Using GPT-4o with In-Context Learning
Sofia Kirsanova, Yao-Yi Chiang, Weiwei Duan
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Databases (cs.DB); Information Retrieval (cs.IR)
[71] arXiv:2510.07796 (cross-list from cs.LG) [pdf, html, other]
Title: HySim-LLM: Embedding-Weighted Fine-Tuning Bounds and Manifold Denoising for Domain-Adapted LLMs
Majid Jaberi-Douraki, Hossein Sholehrasa, Xuan Xu, Remya Ampadi Ramachandran
Subjects: Machine Learning (cs.LG); Information Retrieval (cs.IR)
[72] arXiv:2510.07489 (cross-list from cs.AI) [pdf, other]
Title: Evaluation of LLMs for Process Model Analysis and Optimization
Akhil Kumar, Jianliang Leon Zhao, Om Dobariya
Comments: 15 pages, 5 tables, 4 figures; full research paper currently under review for the Workshop on Information Technologies and Systems (WITS) 2025. The paper presents a comprehensive evaluation of large language models (LLMs) for business process model analysis and optimization, including error detection, reasoning, and scenario-based redesign
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computers and Society (cs.CY); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[73] arXiv:2510.07414 (cross-list from cs.CL) [pdf, html, other]
Title: Haystack Engineering: Context Engineering for Heterogeneous and Agentic Long-Context Evaluation
Mufei Li, Dongqi Fu, Limei Wang, Si Zhang, Hanqing Zeng, Kaan Sancak, Ruizhong Qiu, Haoyu Wang, Xiaoxin He, Xavier Bresson, Yinglong Xia, Chonglin Sun, Pan Li
Comments: Code available at this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)

Thu, 9 Oct 2025 (showing 11 of 11 entries )

[74] arXiv:2510.06924 [pdf, other]
Title: Ethical AI prompt recommendations in large language models using collaborative filtering
Jordan Nelson, Almas Baimagambetov, Konstantinos Avgerinakis, Nikolaos Polatidis
Comments: This paper has been accepted to by the International Journal of Parallel, Emergent & Distributed Systems (Taylor and Francis) and has an assigned DOI. We have already chose to make this open access using CC BY. The article is not yet available online on the publisher's website. The DOI is: this http URL
Subjects: Information Retrieval (cs.IR)
[75] arXiv:2510.06888 [pdf, html, other]
Title: M3Retrieve: Benchmarking Multimodal Retrieval for Medicine
Arkadeep Acharya, Akash Ghosh, Pradeepika Verma, Kitsuchart Pasupa, Sriparna Saha, Priti Singh
Comments: EMNLP Mains 2025
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI)
[76] arXiv:2510.06838 [pdf, html, other]
Title: Crossing Domains without Labels: Distant Supervision for Term Extraction
Elena Senger, Yuri Campbell, Rob van der Goot, Barbara Plank
Comments: Accepted at EMNLP Industry Track 2025
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[77] arXiv:2510.06728 [pdf, html, other]
Title: Reproducing and Extending Causal Insights Into Term Frequency Computation in Neural Rankers
Cile van Marken, Roxana Petcu
Comments: 10 pages, 6 figures, submitted to SIGIR-AP
Subjects: Information Retrieval (cs.IR)
[78] arXiv:2510.06658 [pdf, html, other]
Title: Can We Hide Machines in the Crowd? Quantifying Equivalence in LLM-in-the-loop Annotation Tasks
Jiaman He, Zikang Leng, Dana McKay, Damiano Spina, Johanne R. Trippas
Comments: Accepted at SIGIR-AP 2025
Subjects: Information Retrieval (cs.IR)
[79] arXiv:2510.06657 [pdf, html, other]
Title: LLM-Powered Nuanced Video Attribute Annotation for Enhanced Recommendations
Boyuan Long, Yueqi Wang, Hiloni Mehta, Mick Zomnir, Omkar Pathak, Changping Meng, Ruolin Jia, Yajun Peng, Dapeng Hong, Xia Wu, Mingyan Gao, Onkar Dalal, Ningren Han
Comments: RecSys 2025 Industry Track
Subjects: Information Retrieval (cs.IR)
[80] arXiv:2510.06999 (cross-list from cs.CL) [pdf, html, other]
Title: Towards Reliable Retrieval in RAG Systems for Large Legal Datasets
Markus Reuter, Tobias Lingenberg, Rūta Liepiņa, Francesca Lagioia, Marco Lippi, Giovanni Sartor, Andrea Passerini, Burcu Sayin
Comments: Accepted for the 7th Natural Legal Language Processing Workshop (NLLP 2025), co-located with EMNLP 2025
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[81] arXiv:2510.06987 (cross-list from cs.LG) [pdf, other]
Title: Spiral Model Technique For Data Science & Machine Learning Lifecycle
Rohith Mahadevan
Subjects: Machine Learning (cs.LG); Information Retrieval (cs.IR); Software Engineering (cs.SE)
[82] arXiv:2510.06823 (cross-list from cs.CR) [pdf, html, other]
Title: Exposing Citation Vulnerabilities in Generative Engines
Riku Mochizuki, Shusuke Komatsu, Souta Noguchi, Kazuto Ataka
Comments: 12 pages, under-reviewing at a conference
Subjects: Cryptography and Security (cs.CR); Computation and Language (cs.CL); Information Retrieval (cs.IR)
[83] arXiv:2510.06805 (cross-list from cs.CL) [pdf, html, other]
Title: Overview of the Plagiarism Detection Task at PAN 2025
André Greiner-Petter, Maik Fröbe, Jan Philip Wahle, Terry Ruas, Bela Gipp, Akiko Aizawa, Martin Potthast
Comments: Working Notes at PAN at CLEF 2025
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[84] arXiv:2510.06732 (cross-list from cs.CL) [pdf, html, other]
Title: Are LLMs Reliable Rankers? Rank Manipulation via Two-Stage Token Optimization
Tiancheng Xing, Jerry Li, Yixuan Du, Xiyang Hu
Comments: 10 pages, 3 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)

Wed, 8 Oct 2025 (showing 9 of 9 entries )

[85] arXiv:2510.05952 [pdf, html, other]
Title: How public datasets constrain the development of diversity-aware news recommender systems, and what law could do about it
Max van Drunen, Sanne Vrijenhoek
Subjects: Information Retrieval (cs.IR)
[86] arXiv:2510.05624 [pdf, html, other]
Title: Limitations of Current Evaluation Practices for Conversational Recommender Systems and the Potential of User Simulation
Nolwenn Bernard, Krisztian Balog
Comments: Proceedings of the 2025 Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in the Asia Pacific Region (SIGIR-AP 2025), December 7--10, 2025, Xi'an, China
Subjects: Information Retrieval (cs.IR)
[87] arXiv:2510.05598 [pdf, html, other]
Title: AgentDR Dynamic Recommendation with Implicit Item-Item Relations via LLM-based Agents
Mingdai Yang, Nurendra Choudhary, Jiangshu Du, Edward W.Huang, Philip S.Yu, Karthik Subbian, Danai Kourta
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI)
[88] arXiv:2510.05495 [pdf, html, other]
Title: Automated Research Article Classification and Recommendation Using NLP and ML
Shadikur Rahman, Hasibul Karim Shanto, Umme Ayman Koana, Syed Muhammad Danish
Comments: 8 pages, 4 figures, Accepted in Foundation and Large Language Models (FLLM2025)
Subjects: Information Retrieval (cs.IR)
[89] arXiv:2510.05396 [pdf, html, other]
Title: Scalable In-context Ranking with Generative Models
Nilesh Gupta, Chong You, Srinadh Bhojanapalli, Sanjiv Kumar, Inderjit Dhillon, Felix Yu
Journal-ref: Neurips 2025
Subjects: Information Retrieval (cs.IR); Machine Learning (cs.LG)
[90] arXiv:2510.06198 (cross-list from cs.CL) [pdf, html, other]
Title: Peeking inside the Black-Box: Reinforcement Learning for Explainable and Accurate Relation Extraction
Xinyu Guo, Zhengliang Shi, Minglai Yang, Mahdi Rahimi, Mihai Surdeanu
Comments: Working in process
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[91] arXiv:2510.06002 (cross-list from cs.AI) [pdf, html, other]
Title: Deterministic Legal Retrieval: An Action API for Querying the SAT-Graph RAG
Hudson de Martim
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR)
[92] arXiv:2510.05524 (cross-list from cs.CL) [pdf, html, other]
Title: KEO: Knowledge Extraction on OMIn via Knowledge Graphs and RAG for Safety-Critical Aviation Maintenance
Kuangshi Ai, Jonathan A. Karr Jr, Meng Jiang, Nitesh V. Chawla, Chaoli Wang
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[93] arXiv:2510.05121 (cross-list from cs.CL) [pdf, html, other]
Title: Towards Structured Knowledge: Advancing Triple Extraction from Regional Trade Agreements using Large Language Models
Durgesh Nandini, Rebekka Koch, Mirco Schoenfeld
Subjects: Computation and Language (cs.CL); Computational Engineering, Finance, and Science (cs.CE); Information Retrieval (cs.IR); Machine Learning (cs.LG)
Total of 93 entries : 1-50 51-93 58-93
Showing up to 50 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack