Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.IR

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Information Retrieval

Authors and titles for recent submissions

  • Fri, 3 Oct 2025
  • Thu, 2 Oct 2025
  • Wed, 1 Oct 2025
  • Tue, 30 Sep 2025
  • Mon, 29 Sep 2025

See today's new changes

Total of 96 entries : 29-78 51-96
Showing up to 50 entries per page: fewer | more | all

Wed, 1 Oct 2025 (showing 22 of 22 entries )

[29] arXiv:2509.26448 [pdf, other]
Title: Informed Dataset Selection
Abdullah Abbas, Michael Heep, Theodor Sperle
Comments: 45 pages, 4 figures
Subjects: Information Retrieval (cs.IR)
[30] arXiv:2509.26378 [pdf, other]
Title: MR$^2$-Bench: Going Beyond Matching to Reasoning in Multimodal Retrieval
Junjie Zhou, Ze Liu, Lei Xiong, Jin-Ge Yao, Yueze Wang, Shitao Xiao, Fenfen Lin, Miguel Hu Chen, Zhicheng Dou, Siqi Bao, Defu Lian, Yongping Xiong, Zheng Liu
Subjects: Information Retrieval (cs.IR); Computer Vision and Pattern Recognition (cs.CV)
[31] arXiv:2509.26262 [pdf, html, other]
Title: Analyzing BEV Suitability and Charging Strategies Using Italian Driving Data
Homa Jamalof, Luca Vassio, Danilo Giordano, Marco Mellia, Claudio De Tommasi
Comments: Accepted at 2025 IEEE Transportation Electrification Conference and Expo, Asia-Pacific (ITEC-AP 2025)
Subjects: Information Retrieval (cs.IR); Computational Engineering, Finance, and Science (cs.CE)
[32] arXiv:2509.26203 [pdf, other]
Title: Self-supervised learning for phase retrieval
Victor Sechaud (Phys-ENS), Patrice Abry (Phys-ENS), Laurent Jacques (ICTEAM), Julián Tachella (Phys-ENS, CNRS)
Comments: in French language. GRETSI, Aug 2025, Strasboug, France
Subjects: Information Retrieval (cs.IR); Machine Learning (cs.LG)
[33] arXiv:2509.26184 [pdf, html, other]
Title: Auto-ARGUE: LLM-Based Report Generation Evaluation
William Walden, Marc Mason, Orion Weller, Laura Dietz, Hannah Recknor, Bryan Li, Gabrielle Kaili-May Liu, Yu Hou, James Mayfield, Eugene Yang
Comments: ECIR 2025 demo format
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[34] arXiv:2509.26172 [pdf, html, other]
Title: Leveraging Scene Context with Dual Networks for Sequential User Behavior Modeling
Xu Chen, Yunmeng Shu, Yuangang Pan, Jinsong Lan, Xiaoyong Zhu, Shuai Xiao, Haojin Zhu, Ivor W. Tsang, Bo Zheng
Comments: 12pages
Subjects: Information Retrieval (cs.IR)
[35] arXiv:2509.26107 [pdf, html, other]
Title: Items Proxy Bridging: Enabling Frictionless Critiquing in Knowledge Graph Recommendations
Huanyu Zhang, Xiaoxuan Shen, Yu Lei, Baolin Yi, Jianfang Liu, Yinao xie
Subjects: Information Retrieval (cs.IR)
[36] arXiv:2509.26063 [pdf, html, other]
Title: Fading to Grow: Growing Preference Ratios via Preference Fading Discrete Diffusion for Recommendation
Guoqing Hu, An Zhang. Shuchang Liu, Wenyu Mao, Jiancan Wu, Xun Yang, Xiang Li, Lantao Hu, Han Li, Kun Gai, Xiang Wang
Journal-ref: NeurIPS 2025
Subjects: Information Retrieval (cs.IR)
[37] arXiv:2509.25839 [pdf, html, other]
Title: RAE: A Neural Network Dimensionality Reduction Method for Nearest Neighbors Preservation in Vector Search
Han Zhang, Dongfang Zhao
Comments: submitted to ICLR 2026
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Databases (cs.DB)
[38] arXiv:2509.25803 [pdf, html, other]
Title: Better with Less: Small Proprietary Models Surpass Large Language Models in Financial Transaction Understanding
Wanying Ding, Savinay Narendra, Xiran Shi, Adwait Ratnaparkhi, Chengrui Yang, Nikoo Sabzevar, Ziyan Yin
Comments: 9 pages, 5 figures
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computational Engineering, Finance, and Science (cs.CE); Machine Learning (cs.LG)
[39] arXiv:2509.25755 [pdf, html, other]
Title: HiFIRec: Towards High-Frequency yet Low-Intention Behaviors for Multi-Behavior Recommendation
Ruiqi Luo, Ran Jin, Zhenglong Li, Kaixi Hu, Xiaohui Tao, Lin Li
Subjects: Information Retrieval (cs.IR); Social and Information Networks (cs.SI)
[40] arXiv:2509.25602 [pdf, html, other]
Title: TRUE: A Reproducible Framework for LLM-Driven Relevance Judgment in Information Retrieval
Mouly Dewan, Jiqun Liu, Chirag Shah
Subjects: Information Retrieval (cs.IR)
[41] arXiv:2509.25494 [pdf, html, other]
Title: On-Premise AI for the Newsroom: Evaluating Small Language Models for Investigative Document Search
Nick Hagar, Nicholas Diakopoulos, Jeremy Gilbert
Comments: Accepted to Computation + Journalism Symposium 2025
Subjects: Information Retrieval (cs.IR)
[42] arXiv:2509.26584 (cross-list from cs.AI) [pdf, html, other]
Title: Fairness Testing in Retrieval-Augmented Generation: How Small Perturbations Reveal Bias in Small Language Models
Matheus Vinicius da Silva de Oliveira, Jonathan de Andrade Silva, Awdren de Lima Fontao
Subjects: Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG); Software Engineering (cs.SE)
[43] arXiv:2509.26330 (cross-list from cs.CV) [pdf, html, other]
Title: SQUARE: Semantic Query-Augmented Fusion and Efficient Batch Reranking for Training-free Zero-Shot Composed Image Retrieval
Ren-Di Wu, Yu-Yen Lin, Huei-Fang Yang
Comments: 20 pages, 9 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
[44] arXiv:2509.26094 (cross-list from cs.DS) [pdf, html, other]
Title: On Computing Top-$k$ Simple Shortest Paths from a Single Source
Mattia D'Emidio, Gabriele Di Stefano
Comments: 21 pages, 2 figures, to be published in ALENEX 2026
Subjects: Data Structures and Algorithms (cs.DS); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Networking and Internet Architecture (cs.NI)
[45] arXiv:2509.26014 (cross-list from cs.SE) [pdf, html, other]
Title: Using GPT to build a Project Management assistant for Jira environments
Joel Garcia-Escribano, Arkaitz Carbajo, Mikel Egaña Aranguren, Unai Lopez-Novoa
Subjects: Software Engineering (cs.SE); Information Retrieval (cs.IR)
[46] arXiv:2509.25992 (cross-list from cs.SI) [pdf, html, other]
Title: MHINDR -- a DSM5 based mental health diagnosis and recommendation framework using LLM
Vaishali Agarwal, Sachin Thukral, Arnab Chatterjee
Comments: 7 pages, 1 figure, 4 tables
Subjects: Social and Information Networks (cs.SI); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[47] arXiv:2509.25716 (cross-list from cs.SE) [pdf, html, other]
Title: DeepCodeSeek: Real-Time API Retrieval for Context-Aware Code Generation
Esakkivel Esakkiraja, Denis Akhiyarov, Aditya Shanmugham, Chitra Ganapathy
Comments: Retrieval-Augmented Generation, API Prediction, Context-Aware Code Generation, Enterprise Code Completion, Reinforcement Learning, ServiceNow, Real-Time Code Search, Query Enhancement, Fine-Tuning, Embedding, Reranker
Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[48] arXiv:2509.25593 (cross-list from cs.AI) [pdf, html, other]
Title: Causal Autoencoder-like Generation of Feedback Fuzzy Cognitive Maps with an LLM Agent
Akash Kumar Panda, Olaoluwa Adigun, Bart Kosko
Comments: 8 pages, 4 figures
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Information Retrieval (cs.IR)
[49] arXiv:2509.25487 (cross-list from cs.LG) [pdf, html, other]
Title: Scalable Disk-Based Approximate Nearest Neighbor Search with Page-Aligned Graph
Dingyi Kang, Dongming Jiang, Hanshen Yang, Hang Liu, Bingzhe Li
Subjects: Machine Learning (cs.LG); Databases (cs.DB); Information Retrieval (cs.IR)
[50] arXiv:2509.25257 (cross-list from cs.SE) [pdf, html, other]
Title: RANGER -- Repository-Level Agent for Graph-Enhanced Retrieval
Pratik Shah, Rajat Ghosh, Aryan Singhal, Debojyoti Dutta
Comments: 24 pages, 4 figures
Subjects: Software Engineering (cs.SE); Information Retrieval (cs.IR); Machine Learning (cs.LG)

Tue, 30 Sep 2025 (showing 28 of 28 entries )

[51] arXiv:2509.24869 [pdf, html, other]
Title: Retro*: Optimizing LLMs for Reasoning-Intensive Document Retrieval
Junwei Lan, Jianlyu Chen, Zheng Liu, Chaofan Li, Siqi Bao, Defu Lian
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[52] arXiv:2509.24632 [pdf, html, other]
Title: UniDex: Rethinking Search Inverted Indexing with Unified Semantic Modeling
Zan Li, Jiahui Chen, Yuan Chai, Xiaoze Jiang, Xiaohua Qi, Zhiheng Qin, Runbin Zhou, Shun Zuo, Guangchao Hao, Kefeng Wang, Jingshan Lv, Yupeng Huang, Xiao Liang, Han Li
Comments: 11 pages, 6 figures and 5 tables
Subjects: Information Retrieval (cs.IR)
[53] arXiv:2509.24424 [pdf, html, other]
Title: Multi-Item-Query Attention for Stable Sequential Recommendation
Mingshi Xu, Haoren Zhu, Wilfred Siu Hung Ng
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[54] arXiv:2509.23874 [pdf, html, other]
Title: Multi-Value-Product Retrieval-Augmented Generation for Industrial Product Attribute Value Identification
Huike Zou, Haiyang Yang, Yindu Su, Liyu Chen, Chengbao Lian, Qingheng Zhang, Shuguang Han, Jufeng Chen
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI)
[55] arXiv:2509.23861 [pdf, html, other]
Title: Investigating Multi-layer Representations for Dense Passage Retrieval
Zhongbin Xie, Thomas Lukasiewicz
Comments: Accepted to Findings of EMNLP 2025
Subjects: Information Retrieval (cs.IR)
[56] arXiv:2509.23860 [pdf, html, other]
Title: GSID: Generative Semantic Indexing for E-Commerce Product Understanding
Haiyang Yang, Qinye Xie, Qingheng Zhang, Liyu Chen, Huike Zou, Chengbao Lian, Shuguang Han, Fei Huang, Jufeng Chen, Bo Zheng
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI)
[57] arXiv:2509.23776 [pdf, html, other]
Title: Semantic Representation of Processes with Ontology Design Patterns
Ebrahim Norouzi, Sven Hertling, Jörg Waitelonis, Harald Sack
Subjects: Information Retrieval (cs.IR); Information Theory (cs.IT)
[58] arXiv:2509.23771 [pdf, other]
Title: Constructing Opera Seria in the Iberian Courts: Metastasian Repertoire for Spain and Portugal
Ana Llorens, Alvaro Torrente
Journal-ref: Anuario Musical, 76 (2021), pp. 73-110
Subjects: Information Retrieval (cs.IR)
[59] arXiv:2509.23649 [pdf, html, other]
Title: From Past To Path: Masked History Learning for Next-Item Prediction in Generative Recommendation
KaiWen Wei, Kejun He, Xiaomian Kang, Jie Zhang, Yuming Yang, Jiang Zhong, He Bai, Junnan Zhu
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[60] arXiv:2509.23175 [pdf, html, other]
Title: WARBERT: A Hierarchical BERT-based Model for Web API Recommendation
Zishuo Xu, Yuhong Gu, Dezhong Yao
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI)
[61] arXiv:2509.22807 [pdf, html, other]
Title: MTRec: Learning to Align with User Preferences via Mental Reward Models
Mengchen Zhao, Yifan Gao, Yaqing Hou, Xiangyang Li, Pengjie Gu, Zhenhua Dong, Ruiming Tang, Yi Cai
Journal-ref: Proceedings of the 39th Conference on Neural Information Processing Systems (NeurIPS 2025)
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI)
[62] arXiv:2509.22661 [pdf, other]
Title: Next Point-of-interest (POI) Recommendation Model Based on Multi-modal Spatio-temporal Context Feature Embedding
Lingyu Zhang, Guobin Wu, Yan Wang, Pengfei Xu, Jian Liang, Xuan Song, Yunhai Wang
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI)
[63] arXiv:2509.22660 [pdf, html, other]
Title: Fairness for niche users and providers: algorithmic choice and profile portability
Elizabeth McKinnie, Anas Buhayh, Clement Canel, Robin Burke
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[64] arXiv:2509.22659 [pdf, html, other]
Title: Federated Consistency- and Complementarity-aware Consensus-enhanced Recommendation
Yunqi Mi, Boyang Yan, Guoshuai Zhao, Jialie Shen, Xueming Qian
Subjects: Information Retrieval (cs.IR)
[65] arXiv:2509.22658 [pdf, html, other]
Title: How good are LLMs at Retrieving Documents in a Specific Domain?
Nafis Tanveer Islam, Zhiming Zhao
Comments: Accepted at FAIEMA Conference 2025. DOI will be provided once the conference publishes the paper
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI)
[66] arXiv:2509.25106 (cross-list from cs.CL) [pdf, html, other]
Title: Towards Personalized Deep Research: Benchmarks and Evaluations
Yuan Liang, Jiaxian Li, Yuqing Wang, Piaohong Wang, Motong Tian, Pai Liu, Shuofei Qiao, Runnan Fang, He Zhu, Ge Zhang, Minghao Liu, Yuchen Eleanor Jiang, Ningyu Zhang, Wangchunshu Zhou
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[67] arXiv:2509.25085 (cross-list from cs.CL) [pdf, html, other]
Title: jina-reranker-v3: Last but Not Late Interaction for Document Reranking
Feng Wang, Yuqing Li, Han Xiao
Comments: early draft, CodeIR table needs to be updated (qwen baselines are missing)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[68] arXiv:2509.25084 (cross-list from cs.CL) [pdf, html, other]
Title: Scaling Generalist Data-Analytic Agents
Shuofei Qiao, Yanqiu Zhao, Zhisong Qiu, Xiaobin Wang, Jintian Zhang, Zhao Bin, Ningyu Zhang, Yong Jiang, Pengjun Xie, Fei Huang, Huajun Chen
Comments: Work in progress
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[69] arXiv:2509.24815 (cross-list from cs.DS) [pdf, html, other]
Title: Efficient Sketching and Nearest Neighbor Search Algorithms for Sparse Vector Sets
Sebastian Bruch, Franco Maria Nardini, Cosimo Rulli, Rossano Venturini
Subjects: Data Structures and Algorithms (cs.DS); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[70] arXiv:2509.24405 (cross-list from cs.CL) [pdf, html, other]
Title: Multilingual Text-to-SQL: Benchmarking the Limits of Language Models with Collaborative Language Agents
Khanh Trinh Pham, Thu Huong Nguyen, Jun Jo, Quoc Viet Hung Nguyen, Thanh Tam Nguyen
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Databases (cs.DB); Emerging Technologies (cs.ET); Information Retrieval (cs.IR)
[71] arXiv:2509.24193 (cross-list from cs.CL) [pdf, html, other]
Title: AceSearcher: Bootstrapping Reasoning and Search for LLMs via Reinforced Self-Play
Ran Xu, Yuchen Zhuang, Zihan Dong, Jonathan Wang, Yue Yu, Joyce C. Ho, Linjun Zhang, Haoyu Wang, Wenqi Shi, Carl Yang
Comments: Accepted to NeurIPS 2025 (Spotlight)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[72] arXiv:2509.23883 (cross-list from cs.CL) [pdf, html, other]
Title: DocPruner: A Storage-Efficient Framework for Multi-Vector Visual Document Retrieval via Adaptive Patch-Level Embedding Pruning
Yibo Yan, Guangwei Xu, Xin Zou, Shuliang Liu, James Kwok, Xuming Hu
Comments: Under review
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[73] arXiv:2509.23742 (cross-list from cs.LG) [pdf, html, other]
Title: GBSK: Skeleton Clustering via Granular-ball Computing and Multi-Sampling for Large-Scale Data
Yewang Chen, Junfeng Li, Shuyin Xia, Qinghong Lai, Xinbo Gao, Guoyin Wang, Dongdong Cheng, Yi Liu, Yi Wang
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
[74] arXiv:2509.23577 (cross-list from cs.DB) [pdf, html, other]
Title: ML-Asset Management: Curation, Discovery, and Utilization
Mengying Wang, Moming Duan, Yicong Huang, Chen Li, Bingsheng He, Yinghui Wu
Comments: Tutorial, VLDB 2025. Project page: this https URL
Journal-ref: PVLDB, 18(12): 5493 - 5498, 2025
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[75] arXiv:2509.23471 (cross-list from cs.LG) [pdf, html, other]
Title: Drift-Adapter: A Practical Approach to Near Zero-Downtime Embedding Model Upgrades in Vector Databases
Harshil Vejendla
Comments: EMNLP 2025 Main 12 pages, 6 figures
Subjects: Machine Learning (cs.LG); Information Retrieval (cs.IR)
[76] arXiv:2509.23338 (cross-list from cs.DB) [pdf, other]
Title: PARROT: A Benchmark for Evaluating LLMs in Cross-System SQL Translation
Wei Zhou, Guoliang Li, Haoyu Wang, Yuxing Han, Xufei Wu, Fan Wu, Xuanhe Zhou
Comments: To appear in NeurIPS 2025. Welcome your submission to challenge our leaderboard at: this https URL. Also visit our code repository at: this https URL
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[77] arXiv:2509.22991 (cross-list from cs.CL) [pdf, html, other]
Title: ADAM: A Diverse Archive of Mankind for Evaluating and Enhancing LLMs in Biographical Reasoning
Jasin Cekinmez, Omid Ghahroodi, Saad Fowad Chandle, Dhiman Gupta, Ehsaneddin Asgari
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[78] arXiv:2509.22845 (cross-list from cs.CL) [pdf, html, other]
Title: Learning to Detect Relevant Contexts and Knowledge for Response Selection in Retrieval-based Dialogue Systems
Kai Hua, Zhiyuan Feng, Chongyang Tao, Rui Yan, Lu Zhang
Comments: 10 pages, 4 figures, accepted by CIKM 2020
Journal-ref: Proc. CIKM 20, pp. 525-534, 2020
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
Total of 96 entries : 29-78 51-96
Showing up to 50 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack