Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.IR

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Information Retrieval

Authors and titles for recent submissions

  • Fri, 3 Oct 2025
  • Thu, 2 Oct 2025
  • Wed, 1 Oct 2025
  • Tue, 30 Sep 2025
  • Mon, 29 Sep 2025

See today's new changes

Total of 96 entries : 1-50 51-96
Showing up to 50 entries per page: fewer | more | all

Tue, 30 Sep 2025 (showing 28 of 28 entries )

[51] arXiv:2509.24869 [pdf, html, other]
Title: Retro*: Optimizing LLMs for Reasoning-Intensive Document Retrieval
Junwei Lan, Jianlyu Chen, Zheng Liu, Chaofan Li, Siqi Bao, Defu Lian
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[52] arXiv:2509.24632 [pdf, html, other]
Title: UniDex: Rethinking Search Inverted Indexing with Unified Semantic Modeling
Zan Li, Jiahui Chen, Yuan Chai, Xiaoze Jiang, Xiaohua Qi, Zhiheng Qin, Runbin Zhou, Shun Zuo, Guangchao Hao, Kefeng Wang, Jingshan Lv, Yupeng Huang, Xiao Liang, Han Li
Comments: 11 pages, 6 figures and 5 tables
Subjects: Information Retrieval (cs.IR)
[53] arXiv:2509.24424 [pdf, html, other]
Title: Multi-Item-Query Attention for Stable Sequential Recommendation
Mingshi Xu, Haoren Zhu, Wilfred Siu Hung Ng
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[54] arXiv:2509.23874 [pdf, html, other]
Title: Multi-Value-Product Retrieval-Augmented Generation for Industrial Product Attribute Value Identification
Huike Zou, Haiyang Yang, Yindu Su, Liyu Chen, Chengbao Lian, Qingheng Zhang, Shuguang Han, Jufeng Chen
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI)
[55] arXiv:2509.23861 [pdf, html, other]
Title: Investigating Multi-layer Representations for Dense Passage Retrieval
Zhongbin Xie, Thomas Lukasiewicz
Comments: Accepted to Findings of EMNLP 2025
Subjects: Information Retrieval (cs.IR)
[56] arXiv:2509.23860 [pdf, html, other]
Title: GSID: Generative Semantic Indexing for E-Commerce Product Understanding
Haiyang Yang, Qinye Xie, Qingheng Zhang, Liyu Chen, Huike Zou, Chengbao Lian, Shuguang Han, Fei Huang, Jufeng Chen, Bo Zheng
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI)
[57] arXiv:2509.23776 [pdf, html, other]
Title: Semantic Representation of Processes with Ontology Design Patterns
Ebrahim Norouzi, Sven Hertling, Jörg Waitelonis, Harald Sack
Subjects: Information Retrieval (cs.IR); Information Theory (cs.IT)
[58] arXiv:2509.23771 [pdf, other]
Title: Constructing Opera Seria in the Iberian Courts: Metastasian Repertoire for Spain and Portugal
Ana Llorens, Alvaro Torrente
Journal-ref: Anuario Musical, 76 (2021), pp. 73-110
Subjects: Information Retrieval (cs.IR)
[59] arXiv:2509.23649 [pdf, html, other]
Title: From Past To Path: Masked History Learning for Next-Item Prediction in Generative Recommendation
KaiWen Wei, Kejun He, Xiaomian Kang, Jie Zhang, Yuming Yang, Jiang Zhong, He Bai, Junnan Zhu
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[60] arXiv:2509.23175 [pdf, html, other]
Title: WARBERT: A Hierarchical BERT-based Model for Web API Recommendation
Zishuo Xu, Yuhong Gu, Dezhong Yao
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI)
[61] arXiv:2509.22807 [pdf, html, other]
Title: MTRec: Learning to Align with User Preferences via Mental Reward Models
Mengchen Zhao, Yifan Gao, Yaqing Hou, Xiangyang Li, Pengjie Gu, Zhenhua Dong, Ruiming Tang, Yi Cai
Journal-ref: Proceedings of the 39th Conference on Neural Information Processing Systems (NeurIPS 2025)
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI)
[62] arXiv:2509.22661 [pdf, other]
Title: Next Point-of-interest (POI) Recommendation Model Based on Multi-modal Spatio-temporal Context Feature Embedding
Lingyu Zhang, Guobin Wu, Yan Wang, Pengfei Xu, Jian Liang, Xuan Song, Yunhai Wang
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI)
[63] arXiv:2509.22660 [pdf, html, other]
Title: Fairness for niche users and providers: algorithmic choice and profile portability
Elizabeth McKinnie, Anas Buhayh, Clement Canel, Robin Burke
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[64] arXiv:2509.22659 [pdf, html, other]
Title: Federated Consistency- and Complementarity-aware Consensus-enhanced Recommendation
Yunqi Mi, Boyang Yan, Guoshuai Zhao, Jialie Shen, Xueming Qian
Subjects: Information Retrieval (cs.IR)
[65] arXiv:2509.22658 [pdf, html, other]
Title: How good are LLMs at Retrieving Documents in a Specific Domain?
Nafis Tanveer Islam, Zhiming Zhao
Comments: Accepted at FAIEMA Conference 2025. DOI will be provided once the conference publishes the paper
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI)
[66] arXiv:2509.25106 (cross-list from cs.CL) [pdf, html, other]
Title: Towards Personalized Deep Research: Benchmarks and Evaluations
Yuan Liang, Jiaxian Li, Yuqing Wang, Piaohong Wang, Motong Tian, Pai Liu, Shuofei Qiao, Runnan Fang, He Zhu, Ge Zhang, Minghao Liu, Yuchen Eleanor Jiang, Ningyu Zhang, Wangchunshu Zhou
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[67] arXiv:2509.25085 (cross-list from cs.CL) [pdf, html, other]
Title: jina-reranker-v3: Last but Not Late Interaction for Document Reranking
Feng Wang, Yuqing Li, Han Xiao
Comments: early draft, CodeIR table needs to be updated (qwen baselines are missing)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[68] arXiv:2509.25084 (cross-list from cs.CL) [pdf, html, other]
Title: Scaling Generalist Data-Analytic Agents
Shuofei Qiao, Yanqiu Zhao, Zhisong Qiu, Xiaobin Wang, Jintian Zhang, Zhao Bin, Ningyu Zhang, Yong Jiang, Pengjun Xie, Fei Huang, Huajun Chen
Comments: Work in progress
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[69] arXiv:2509.24815 (cross-list from cs.DS) [pdf, html, other]
Title: Efficient Sketching and Nearest Neighbor Search Algorithms for Sparse Vector Sets
Sebastian Bruch, Franco Maria Nardini, Cosimo Rulli, Rossano Venturini
Subjects: Data Structures and Algorithms (cs.DS); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[70] arXiv:2509.24405 (cross-list from cs.CL) [pdf, html, other]
Title: Multilingual Text-to-SQL: Benchmarking the Limits of Language Models with Collaborative Language Agents
Khanh Trinh Pham, Thu Huong Nguyen, Jun Jo, Quoc Viet Hung Nguyen, Thanh Tam Nguyen
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Databases (cs.DB); Emerging Technologies (cs.ET); Information Retrieval (cs.IR)
[71] arXiv:2509.24193 (cross-list from cs.CL) [pdf, html, other]
Title: AceSearcher: Bootstrapping Reasoning and Search for LLMs via Reinforced Self-Play
Ran Xu, Yuchen Zhuang, Zihan Dong, Jonathan Wang, Yue Yu, Joyce C. Ho, Linjun Zhang, Haoyu Wang, Wenqi Shi, Carl Yang
Comments: Accepted to NeurIPS 2025 (Spotlight)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[72] arXiv:2509.23883 (cross-list from cs.CL) [pdf, html, other]
Title: DocPruner: A Storage-Efficient Framework for Multi-Vector Visual Document Retrieval via Adaptive Patch-Level Embedding Pruning
Yibo Yan, Guangwei Xu, Xin Zou, Shuliang Liu, James Kwok, Xuming Hu
Comments: Under review
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[73] arXiv:2509.23742 (cross-list from cs.LG) [pdf, html, other]
Title: GBSK: Skeleton Clustering via Granular-ball Computing and Multi-Sampling for Large-Scale Data
Yewang Chen, Junfeng Li, Shuyin Xia, Qinghong Lai, Xinbo Gao, Guoyin Wang, Dongdong Cheng, Yi Liu, Yi Wang
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
[74] arXiv:2509.23577 (cross-list from cs.DB) [pdf, html, other]
Title: ML-Asset Management: Curation, Discovery, and Utilization
Mengying Wang, Moming Duan, Yicong Huang, Chen Li, Bingsheng He, Yinghui Wu
Comments: Tutorial, VLDB 2025. Project page: this https URL
Journal-ref: PVLDB, 18(12): 5493 - 5498, 2025
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[75] arXiv:2509.23471 (cross-list from cs.LG) [pdf, html, other]
Title: Drift-Adapter: A Practical Approach to Near Zero-Downtime Embedding Model Upgrades in Vector Databases
Harshil Vejendla
Comments: EMNLP 2025 Main 12 pages, 6 figures
Subjects: Machine Learning (cs.LG); Information Retrieval (cs.IR)
[76] arXiv:2509.23338 (cross-list from cs.DB) [pdf, other]
Title: PARROT: A Benchmark for Evaluating LLMs in Cross-System SQL Translation
Wei Zhou, Guoliang Li, Haoyu Wang, Yuxing Han, Xufei Wu, Fan Wu, Xuanhe Zhou
Comments: To appear in NeurIPS 2025. Welcome your submission to challenge our leaderboard at: this https URL. Also visit our code repository at: this https URL
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[77] arXiv:2509.22991 (cross-list from cs.CL) [pdf, html, other]
Title: ADAM: A Diverse Archive of Mankind for Evaluating and Enhancing LLMs in Biographical Reasoning
Jasin Cekinmez, Omid Ghahroodi, Saad Fowad Chandle, Dhiman Gupta, Ehsaneddin Asgari
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[78] arXiv:2509.22845 (cross-list from cs.CL) [pdf, html, other]
Title: Learning to Detect Relevant Contexts and Knowledge for Response Selection in Retrieval-based Dialogue Systems
Kai Hua, Zhiyuan Feng, Chongyang Tao, Rui Yan, Lu Zhang
Comments: 10 pages, 4 figures, accepted by CIKM 2020
Journal-ref: Proc. CIKM 20, pp. 525-534, 2020
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)

Mon, 29 Sep 2025 (showing 18 of 18 entries )

[79] arXiv:2509.22486 [pdf, html, other]
Title: Your RAG is Unfair: Exposing Fairness Vulnerabilities in Retrieval-Augmented Generation via Backdoor Attacks
Gaurav Bagwe, Saket S. Chaturvedi, Xiaolong Ma, Xiaoyong Yuan, Kuang-Ching Wang, Lan Zhang
Comments: Accepted by EMNLP 2025
Subjects: Information Retrieval (cs.IR); Cryptography and Security (cs.CR)
[80] arXiv:2509.22325 [pdf, html, other]
Title: Can Synthetic Query Rewrites Capture User Intent Better than Humans in Retrieval-Augmented Generation?
JiaYing Zheng, HaiNan Zhang, Liang Pang, YongXin Tong, ZhiMing Zheng
Comments: 10 pages, 6 figures
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[81] arXiv:2509.22116 [pdf, html, other]
Title: Does Generative Retrieval Overcome the Limitations of Dense Retrieval?
Yingchen Zhang, Ruqing Zhang, Jiafeng Guo, Maarten de Rijke, Yixing Fan, Xueqi Cheng
Subjects: Information Retrieval (cs.IR)
[82] arXiv:2509.22046 [pdf, html, other]
Title: GoalRank: Group-Relative Optimization for a Large Ranking Model
Kaike Zhang, Xiaobei Wang, Shuchang Liu, Hailan Yang, Xiang Li, Lantao Hu, Han Li, Qi Cao, Fei Sun, Kun Gai
Subjects: Information Retrieval (cs.IR)
[83] arXiv:2509.21966 [pdf, html, other]
Title: Effect of Model Merging in Domain-Specific Ad-hoc Retrieval
Taiga Sasaki, Takehiro Yamamoto, Hiroaki Ohshima, Sumio Fujita
Comments: Accepted at CIKM 2025, 5 pages
Subjects: Information Retrieval (cs.IR)
[84] arXiv:2509.21391 [pdf, html, other]
Title: MIXRAG : Mixture-of-Experts Retrieval-Augmented Generation for Textual Graph Understanding and Question Answering
Lihui Liu, Carl J. Yang
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI)
[85] arXiv:2509.21371 [pdf, html, other]
Title: ReGeS: Reciprocal Retrieval-Generation Synergy for Conversational Recommender Systems
Dayu Yang, Hui Fang
Comments: Accepted by WISE 2025: 26th International Web Information Systems Engineering conference. Our code is publicly available at the link: this https URL
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[86] arXiv:2509.21339 [pdf, html, other]
Title: Cross-Modal Retrieval with Cauchy-Schwarz Divergence
Jiahao Zhang, Wenzhe Yin, Shujian Yu
Comments: Accepted by ACMMM-25
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[87] arXiv:2509.21336 [pdf, html, other]
Title: HetaRAG: Hybrid Deep Retrieval-Augmented Generation across Heterogeneous Data Stores
Guohang Yan, Yue Zhang, Pinlong Cai, Ding Wang, Song Mao, Hongwei Zhang, Yaoze Zhang, Hairong Zhang, Xinyu Cai, Botian Shi
Comments: 15 pages, 4 figures
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[88] arXiv:2509.21325 [pdf, html, other]
Title: PIR-RAG: A System for Private Information Retrieval in Retrieval-Augmented Generation
Baiqiang Wang, Qian Lou, Mengxin Zheng, Dongfang Zhao
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
[89] arXiv:2509.21324 [pdf, html, other]
Title: From Search to Reasoning: A Five-Level RAG Capability Framework for Enterprise Data
Gurbinder Gill, Ritvik Gupta, Denis Lusson, Anand Chandrashekar, Donald Nguyen
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI)
[90] arXiv:2509.21323 [pdf, html, other]
Title: SPELUNKER: Item Similarity Search Using Large Language Models and Custom K-Nearest Neighbors
Ana Rodrigues, João Mata, Rui Rego
Comments: 6 pages, 4 figures
Subjects: Information Retrieval (cs.IR)
[91] arXiv:2509.22565 (cross-list from cs.CL) [pdf, other]
Title: Retrieval-Augmented Guardrails for AI-Drafted Patient-Portal Messages: Error Taxonomy Construction and Large-Scale Evaluation
Wenyuan Chen, Fateme Nateghi Haredasht, Kameron C. Black, Francois Grolleau, Emily Alsentzer, Jonathan H. Chen, Stephen P. Ma
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[92] arXiv:2509.22493 (cross-list from cs.RO) [pdf, html, other]
Title: Ontological foundations for contrastive explanatory narration of robot plans
Alberto Olivares-Alarcos, Sergi Foix, Júlia Borràs, Gerard Canal, Guillem Alenyà
Comments: This version was submitted to the journal Information Sciences and is under review since October 2024
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Logic in Computer Science (cs.LO)
[93] arXiv:2509.22275 (cross-list from stat.AP) [pdf, html, other]
Title: Chronic Stress, Immune Suppression, and Cancer Occurrence: Unveiling the Connection using Survey Data and Predictive Models
Teddy Lazebnik, Vered Aharonson
Subjects: Applications (stat.AP); Computers and Society (cs.CY); Information Retrieval (cs.IR)
[94] arXiv:2509.22162 (cross-list from cs.DB) [pdf, other]
Title: The system of processing and analysis of customer tracking data for customer journey research on the base of RFID technology
Marina Kholod
Comments: 20 pages, in Russian language, 5 figures
Subjects: Databases (cs.DB); Information Retrieval (cs.IR)
[95] arXiv:2509.22150 (cross-list from cs.CV) [pdf, html, other]
Title: Joint graph entropy knowledge distillation for point cloud classification and robustness against corruptions
Zhiqiang Tian, Weigang Li, Junwei Hu, Chunhua Deng
Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
[96] arXiv:2509.22125 (cross-list from cs.CL) [pdf, html, other]
Title: FoodSEM: Large Language Model Specialized in Food Named-Entity Linking
Ana Gjorgjevikj, Matej Martinc, Gjorgjina Cenikj, Sašo Džeroski, Barbara Koroušić Seljak, Tome Eftimov
Comments: To appear in the Proceedings of the 28th International Conference on Discovery Science (DS 2025)
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
Total of 96 entries : 1-50 51-96
Showing up to 50 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack