Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.IR

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Information Retrieval

Authors and titles for recent submissions

  • Mon, 6 Oct 2025
  • Fri, 3 Oct 2025
  • Thu, 2 Oct 2025
  • Wed, 1 Oct 2025
  • Tue, 30 Sep 2025

See today's new changes

Total of 89 entries
Showing up to 2000 entries per page: fewer | more | all

Mon, 6 Oct 2025 (showing 11 of 11 entries )

[1] arXiv:2510.03203 [pdf, other]
Title: OpenZL: A Graph-Based Model for Compression
Yann Collet, Nick Terrell, W. Felix Handte, Danielle Rozenblit, Victor Zhang, Kevin Zhang, Yaelle Goldschlag, Jennifer Lee, Daniel Riegel, Stan Angelov, Nadav Rotem
Subjects: Information Retrieval (cs.IR); Databases (cs.DB)
[2] arXiv:2510.02668 [pdf, html, other]
Title: AgenticRAG: Tool-Augmented Foundation Models for Zero-Shot Explainable Recommender Systems
Bo Ma, Hang Li, ZeHua Hu, XiaoFan Gui, LuYao Liu, Simon Liu
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI)
[3] arXiv:2510.02657 [pdf, html, other]
Title: Less LLM, More Documents: Searching for Improved RAG
Jingjie Ning, Yibo Kong, Yunfan Long, Jamie Callan
Comments: 16 pages. Submitted to ECIR 2026
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[4] arXiv:2510.02656 [pdf, html, other]
Title: A Simple but Effective Elaborative Query Reformulation Approach for Natural Language Recommendation
Qianfeng Wen, Yifan Liu, Justin Cui, Joshua Zhang, Anton Korikov, George-Kirollos Saad, Scott Sanner
Comments: 11 pages, 5 figures
Subjects: Information Retrieval (cs.IR)
[5] arXiv:2510.02512 [pdf, html, other]
Title: Revisiting Query Variants: The Advantage of Retrieval Over Generation of Query Variants for Effective QPP
Fangzheng Tian, Debasis Ganguly, Craig Macdonald
Comments: 11 pages, 4 figures
Subjects: Information Retrieval (cs.IR)
[6] arXiv:2510.03038 (cross-list from cs.LG) [pdf, html, other]
Title: CHORD: Customizing Hybrid-precision On-device Model for Sequential Recommendation with Device-cloud Collaboration
Tianqi Liu, Kairui Fu, Shengyu Zhang, Wenyan Fan, Zhaocheng Du, Jieming Zhu, Fan Wu, Fei Wu
Comments: accepted by ACM MM'25
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[7] arXiv:2510.02967 (cross-list from cs.CL) [pdf, html, other]
Title: Grounding Large Language Models in Clinical Evidence: A Retrieval-Augmented Generation System for Querying UK NICE Clinical Guidelines
Matthew Lewis, Samuel Thio, Richard JB Dobson, Spiros Denaxas
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[8] arXiv:2510.02827 (cross-list from cs.CL) [pdf, html, other]
Title: StepChain GraphRAG: Reasoning Over Knowledge Graphs for Multi-Hop Question Answering
Tengjun Ni, Xin Yuan, Shenghong Li, Kai Wu, Ren Ping Liu, Wei Ni, Wenjie Zhang
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[9] arXiv:2510.02669 (cross-list from cs.AI) [pdf, html, other]
Title: AutoMaAS: Self-Evolving Multi-Agent Architecture Search for Large Language Models
Bo Ma, Hang Li, ZeHua Hu, XiaoFan Gui, LuYao Liu, Simon Liu
Subjects: Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Information Retrieval (cs.IR)
[10] arXiv:2510.02653 (cross-list from cs.AI) [pdf, html, other]
Title: Geolog-IA: Conversational System for Academic Theses
Micaela Fuel Pozo, Andrea Guatumillo Saltos, Yeseña Tipan Llumiquinga, Kelly Lascano Aguirre, Marilyn Castillo Jara, Christian Mejia-Escobar
Comments: 17 pages, in Spanish language
Subjects: Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[11] arXiv:2510.02539 (cross-list from cs.CL) [pdf, html, other]
Title: Hierarchical Semantic Retrieval with Cobweb
Anant Gupta, Karthik Singaravadivelan, Zekun Wang
Comments: 20 pages, 7 tables, 4 figures
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)

Fri, 3 Oct 2025 (showing 15 of 15 entries )

[12] arXiv:2510.02241 [pdf, html, other]
Title: Study on LLMs for Promptagator-Style Dense Retriever Training
Daniel Gwon, Nour Jedidi, Jimmy Lin
Comments: CIKM 2025 short research paper
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[13] arXiv:2510.02219 [pdf, html, other]
Title: Contrastive Retrieval Heads Improve Attention-Based Re-Ranking
Linh Tran, Yulong Li, Radu Florian, Wei Sun
Subjects: Information Retrieval (cs.IR)
[14] arXiv:2510.01871 [pdf, html, other]
Title: Ranking Items from Discrete Ratings: The Cost of Unknown User Thresholds
Oscar Villemaud, Suryanarayana Sankagiri, Matthias Grossglauser
Comments: 12 pages, 4 figures
Subjects: Information Retrieval (cs.IR); Machine Learning (cs.LG)
[15] arXiv:2510.01698 [pdf, html, other]
Title: TalkPlay-Tools: Conversational Music Recommendation with LLM Tool Calling
Seungheon Doh, Keunwoo Choi, Juhan Nam
Comments: Accepted for publication at The Workshop on AI for Music, Neural Information Processing Systems (NeurIPS-AI4Music)
Subjects: Information Retrieval (cs.IR); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[16] arXiv:2510.01622 [pdf, html, other]
Title: LLM4Rec: Large Language Models for Multimodal Generative Recommendation with Causal Debiasing
Bo Ma, Hang Li, ZeHua Hu, XiaoFan Gui, LuYao Liu, Simon Lau
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[17] arXiv:2510.01606 [pdf, html, other]
Title: Bridging Collaborative Filtering and Large Language Models with Dynamic Alignment, Multimodal Fusion and Evidence-grounded Explanations
Bo Ma, LuYao Liu, Simon Lau, Chandler Yuan, and XueY Cui, Rosie Zhang
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[18] arXiv:2510.01574 [pdf, html, other]
Title: Synthetic Prefixes to Mitigate Bias in Real-Time Neural Query Autocomplete
Adithya Rajan, Xiaoyu Liu, Prateek Verma, Vibhu Arora
Comments: Accepted to the Proceedings of the ACM SIGIR Asia Pacific Conference on Information Retrieval (SIGIR-AP 2025), December 7-10, 2025, Xi'an, China
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[19] arXiv:2510.01553 [pdf, html, other]
Title: IoDResearch: Deep Research on Private Heterogeneous Data via the Internet of Data
Zhuofan Shi, Zijie Guo, Xinjian Ma, Gang Huang, Yun Ma, Xiang Jing
Comments: 8 pages,4 figures
Subjects: Information Retrieval (cs.IR)
[20] arXiv:2510.01523 [pdf, html, other]
Title: MetaSynth: Multi-Agent Metadata Generation from Implicit Feedback in Black-Box Systems
Shreeranjani Srirangamsridharan, Ali Abavisani, Reza Yousefi Maragheh, Ramin Giahi, Kai Zhao, Jason Cho, Sushant Kumar
Comments: NeurIPS Workshop LAW
Subjects: Information Retrieval (cs.IR)
[21] arXiv:2510.01198 [pdf, html, other]
Title: Optimal signals assignment for eBay View Item page
Matan Mandelbrod, Biwei Jiang, Giald Wagner, Tal Franji, Guy Feigenblat
Comments: Accepted at the CONSEQUENCES 2025 workshop, co-located with ACM RecSys 2025
Subjects: Information Retrieval (cs.IR)
[22] arXiv:2510.01197 [pdf, html, other]
Title: Are LLMs ready to help non-expert users to make charts of official statistics data?
Gadir Suleymanli, Alexander Rogiers, Lucas Lageweg, Jefrey Lijffijt
Subjects: Information Retrieval (cs.IR)
[23] arXiv:2510.01196 [pdf, html, other]
Title: Location Matters: Leveraging Multi-Resolution Geo-Embeddings for Housing Search
Ivo Silva, Pedro Nogueira, Guilherme Bonaldo (QuintoAndar)
Comments: Accepted to RecSys 2025 (industry track)
Subjects: Information Retrieval (cs.IR); Machine Learning (cs.LG)
[24] arXiv:2510.01792 (cross-list from cs.CL) [pdf, html, other]
Title: Comparison of Unsupervised Metrics for Evaluating Judicial Decision Extraction
Ivan Leonidovich Litvak, Anton Kostin, Fedor Lashkin, Tatiana Maksiyan, Sergey Lagutin
Comments: 28 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[25] arXiv:2510.01513 (cross-list from cs.CV) [pdf, html, other]
Title: From Videos to Indexed Knowledge Graphs -- Framework to Marry Methods for Multimodal Content Analysis and Understanding
Basem Rizk, Joel Walsh, Mark Core, Benjamin Nye
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR)
[26] arXiv:2510.01285 (cross-list from cs.MA) [pdf, html, other]
Title: LLM-based Multi-Agent Blackboard System for Information Discovery in Data Science
Alireza Salemi, Mihir Parmar, Palash Goyal, Yiwen Song, Jinsung Yoon, Hamed Zamani, Hamid Palangi, Tomas Pfister
Subjects: Multiagent Systems (cs.MA); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)

Thu, 2 Oct 2025 (showing 13 of 13 entries )

[27] arXiv:2510.01149 [pdf, html, other]
Title: ModernVBERT: Towards Smaller Visual Document Retrievers
Paul Teiletche, Quentin Macé, Max Conti, Antonio Loison, Gautier Viaud, Pierre Colombo, Manuel Faysse
Subjects: Information Retrieval (cs.IR)
[28] arXiv:2510.00966 [pdf, other]
Title: Deep Learning-Based Approach for Improving Relational Aggregated Search
Sara Saad Soliman, Ahmed Younes, Islam Elkabani, Ashraf Elsayed
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI)
[29] arXiv:2510.00908 [pdf, html, other]
Title: Bridging Language Gaps: Advances in Cross-Lingual Information Retrieval with Multilingual LLMs
Roksana Goworek, Olivia Macmillan-Scott, Eda B. Özyiğit
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[30] arXiv:2510.00887 [pdf, html, other]
Title: On Listwise Reranking for Corpus Feedback
Soyoung Yoon, Jongho Kim, Daeyong Kwon, Avishek Anand, Seung-won Hwang
Comments: Under review
Subjects: Information Retrieval (cs.IR)
[31] arXiv:2510.00671 [pdf, html, other]
Title: Milco: Learned Sparse Retrieval Across Languages via a Multilingual Connector
Thong Nguyen, Yibin Lei, Jia-Huei Ju, Eugene Yang, Andrew Yates
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[32] arXiv:2510.00165 [pdf, html, other]
Title: Privacy-Preserving Learning-Augmented Data Structures
Prabhav Goyal, Vinesh Sridhar, Wilson Zheng
Comments: 6 pages, 2 figures
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Data Structures and Algorithms (cs.DS)
[33] arXiv:2510.00143 [pdf, html, other]
Title: HLTCOE at TREC 2024 NeuCLIR Track
Eugene Yang, Dawn Lawrie, Orion Weller, James Mayfield
Comments: TREC 2024 System Paper; 6 pages; 7 tables
Subjects: Information Retrieval (cs.IR)
[34] arXiv:2510.00137 [pdf, html, other]
Title: Optimizing What Matters: AUC-Driven Learning for Robust Neural Retrieval
Nima Sheikholeslami, Erfan Hosseini, Patrice Bechard, Srivatsava Daruru, Sai Rajeswar
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[35] arXiv:2510.00861 (cross-list from cs.CL) [pdf, other]
Title: Erase to Improve: Erasable Reinforcement Learning for Search-Augmented LLMs
Ziliang Wang, Kang An, Xuhui Zheng, Faqiang Qian, Weikun Zhang, Cijun Ouyang, Jialu Cai, Yuhang Wang, Yichao Wu
Comments: 10 pages, 4 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[36] arXiv:2510.00706 (cross-list from cs.AI) [pdf, html, other]
Title: AttentionDep: Domain-Aware Attention for Explainable Depression Severity Assessment
Yusif Ibrahimov, Tarique Anwar, Tommy Yuan, Turan Mutallimov, Elgun Hasanov
Subjects: Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[37] arXiv:2510.00694 (cross-list from cs.CL) [pdf, html, other]
Title: ALARB: An Arabic Legal Argument Reasoning Benchmark
Harethah Abu Shairah, Somayah AlHarbi, Abdulaziz AlHussein, Sameer Alsabea, Omar Shaqaqi, Hebah AlShamlan, Omar Knio, George Turkiyyah
Comments: Accepted paper at ArabicNLP 2025
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[38] arXiv:2510.00324 (cross-list from cs.SE) [pdf, html, other]
Title: Which Programming Language and Model Work Best With LLM-as-a-Judge For Code Retrieval?
Lucas Roberts, Denisa Roberts
Comments: Accepted as a full paper at SIGIR-AP 2025
Subjects: Software Engineering (cs.SE); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[39] arXiv:2510.00039 (cross-list from cs.DB) [pdf, html, other]
Title: AutoPK: Leveraging LLMs and a Hybrid Similarity Metric for Advanced Retrieval of Pharmacokinetic Data from Complex Tables and Documents
Hossein Sholehrasa, Amirhossein Ghanaatian, Doina Caragea, Lisa A. Tell, Jim E. Riviere, Majid Jaberi-Douraki
Comments: Accepted at the 2025 IEEE 37th ICTAI
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)

Wed, 1 Oct 2025 (showing 22 of 22 entries )

[40] arXiv:2509.26448 [pdf, other]
Title: Informed Dataset Selection
Abdullah Abbas, Michael Heep, Theodor Sperle
Comments: 45 pages, 4 figures
Subjects: Information Retrieval (cs.IR)
[41] arXiv:2509.26378 [pdf, other]
Title: MR$^2$-Bench: Going Beyond Matching to Reasoning in Multimodal Retrieval
Junjie Zhou, Ze Liu, Lei Xiong, Jin-Ge Yao, Yueze Wang, Shitao Xiao, Fenfen Lin, Miguel Hu Chen, Zhicheng Dou, Siqi Bao, Defu Lian, Yongping Xiong, Zheng Liu
Subjects: Information Retrieval (cs.IR); Computer Vision and Pattern Recognition (cs.CV)
[42] arXiv:2509.26262 [pdf, html, other]
Title: Analyzing BEV Suitability and Charging Strategies Using Italian Driving Data
Homa Jamalof, Luca Vassio, Danilo Giordano, Marco Mellia, Claudio De Tommasi
Comments: Accepted at 2025 IEEE Transportation Electrification Conference and Expo, Asia-Pacific (ITEC-AP 2025)
Subjects: Information Retrieval (cs.IR); Computational Engineering, Finance, and Science (cs.CE)
[43] arXiv:2509.26203 [pdf, other]
Title: Self-supervised learning for phase retrieval
Victor Sechaud (Phys-ENS), Patrice Abry (Phys-ENS), Laurent Jacques (ICTEAM), Julián Tachella (Phys-ENS, CNRS)
Comments: in French language. GRETSI, Aug 2025, Strasboug, France
Subjects: Information Retrieval (cs.IR); Machine Learning (cs.LG)
[44] arXiv:2509.26184 [pdf, html, other]
Title: Auto-ARGUE: LLM-Based Report Generation Evaluation
William Walden, Marc Mason, Orion Weller, Laura Dietz, Hannah Recknor, Bryan Li, Gabrielle Kaili-May Liu, Yu Hou, James Mayfield, Eugene Yang
Comments: ECIR 2025 demo format
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[45] arXiv:2509.26172 [pdf, html, other]
Title: Leveraging Scene Context with Dual Networks for Sequential User Behavior Modeling
Xu Chen, Yunmeng Shu, Yuangang Pan, Jinsong Lan, Xiaoyong Zhu, Shuai Xiao, Haojin Zhu, Ivor W. Tsang, Bo Zheng
Comments: 12pages
Subjects: Information Retrieval (cs.IR)
[46] arXiv:2509.26107 [pdf, html, other]
Title: Items Proxy Bridging: Enabling Frictionless Critiquing in Knowledge Graph Recommendations
Huanyu Zhang, Xiaoxuan Shen, Yu Lei, Baolin Yi, Jianfang Liu, Yinao xie
Subjects: Information Retrieval (cs.IR)
[47] arXiv:2509.26063 [pdf, html, other]
Title: Fading to Grow: Growing Preference Ratios via Preference Fading Discrete Diffusion for Recommendation
Guoqing Hu, An Zhang. Shuchang Liu, Wenyu Mao, Jiancan Wu, Xun Yang, Xiang Li, Lantao Hu, Han Li, Kun Gai, Xiang Wang
Journal-ref: NeurIPS 2025
Subjects: Information Retrieval (cs.IR)
[48] arXiv:2509.25839 [pdf, html, other]
Title: RAE: A Neural Network Dimensionality Reduction Method for Nearest Neighbors Preservation in Vector Search
Han Zhang, Dongfang Zhao
Comments: submitted to ICLR 2026
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Databases (cs.DB)
[49] arXiv:2509.25803 [pdf, html, other]
Title: Better with Less: Small Proprietary Models Surpass Large Language Models in Financial Transaction Understanding
Wanying Ding, Savinay Narendra, Xiran Shi, Adwait Ratnaparkhi, Chengrui Yang, Nikoo Sabzevar, Ziyan Yin
Comments: 9 pages, 5 figures
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computational Engineering, Finance, and Science (cs.CE); Machine Learning (cs.LG)
[50] arXiv:2509.25755 [pdf, html, other]
Title: HiFIRec: Towards High-Frequency yet Low-Intention Behaviors for Multi-Behavior Recommendation
Ruiqi Luo, Ran Jin, Zhenglong Li, Kaixi Hu, Xiaohui Tao, Lin Li
Subjects: Information Retrieval (cs.IR); Social and Information Networks (cs.SI)
[51] arXiv:2509.25602 [pdf, html, other]
Title: TRUE: A Reproducible Framework for LLM-Driven Relevance Judgment in Information Retrieval
Mouly Dewan, Jiqun Liu, Chirag Shah
Subjects: Information Retrieval (cs.IR)
[52] arXiv:2509.25494 [pdf, html, other]
Title: On-Premise AI for the Newsroom: Evaluating Small Language Models for Investigative Document Search
Nick Hagar, Nicholas Diakopoulos, Jeremy Gilbert
Comments: Accepted to Computation + Journalism Symposium 2025
Subjects: Information Retrieval (cs.IR)
[53] arXiv:2509.26584 (cross-list from cs.AI) [pdf, html, other]
Title: Fairness Testing in Retrieval-Augmented Generation: How Small Perturbations Reveal Bias in Small Language Models
Matheus Vinicius da Silva de Oliveira, Jonathan de Andrade Silva, Awdren de Lima Fontao
Subjects: Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG); Software Engineering (cs.SE)
[54] arXiv:2509.26330 (cross-list from cs.CV) [pdf, html, other]
Title: SQUARE: Semantic Query-Augmented Fusion and Efficient Batch Reranking for Training-free Zero-Shot Composed Image Retrieval
Ren-Di Wu, Yu-Yen Lin, Huei-Fang Yang
Comments: 20 pages, 9 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
[55] arXiv:2509.26094 (cross-list from cs.DS) [pdf, html, other]
Title: On Computing Top-$k$ Simple Shortest Paths from a Single Source
Mattia D'Emidio, Gabriele Di Stefano
Comments: 21 pages, 2 figures, to be published in ALENEX 2026
Subjects: Data Structures and Algorithms (cs.DS); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Networking and Internet Architecture (cs.NI)
[56] arXiv:2509.26014 (cross-list from cs.SE) [pdf, html, other]
Title: Using GPT to build a Project Management assistant for Jira environments
Joel Garcia-Escribano, Arkaitz Carbajo, Mikel Egaña Aranguren, Unai Lopez-Novoa
Subjects: Software Engineering (cs.SE); Information Retrieval (cs.IR)
[57] arXiv:2509.25992 (cross-list from cs.SI) [pdf, html, other]
Title: MHINDR -- a DSM5 based mental health diagnosis and recommendation framework using LLM
Vaishali Agarwal, Sachin Thukral, Arnab Chatterjee
Comments: 7 pages, 1 figure, 4 tables
Subjects: Social and Information Networks (cs.SI); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[58] arXiv:2509.25716 (cross-list from cs.SE) [pdf, html, other]
Title: DeepCodeSeek: Real-Time API Retrieval for Context-Aware Code Generation
Esakkivel Esakkiraja, Denis Akhiyarov, Aditya Shanmugham, Chitra Ganapathy
Comments: Retrieval-Augmented Generation, API Prediction, Context-Aware Code Generation, Enterprise Code Completion, Reinforcement Learning, ServiceNow, Real-Time Code Search, Query Enhancement, Fine-Tuning, Embedding, Reranker
Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[59] arXiv:2509.25593 (cross-list from cs.AI) [pdf, html, other]
Title: Causal Autoencoder-like Generation of Feedback Fuzzy Cognitive Maps with an LLM Agent
Akash Kumar Panda, Olaoluwa Adigun, Bart Kosko
Comments: 8 pages, 4 figures
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Information Retrieval (cs.IR)
[60] arXiv:2509.25487 (cross-list from cs.LG) [pdf, html, other]
Title: Scalable Disk-Based Approximate Nearest Neighbor Search with Page-Aligned Graph
Dingyi Kang, Dongming Jiang, Hanshen Yang, Hang Liu, Bingzhe Li
Subjects: Machine Learning (cs.LG); Databases (cs.DB); Information Retrieval (cs.IR)
[61] arXiv:2509.25257 (cross-list from cs.SE) [pdf, html, other]
Title: RANGER -- Repository-Level Agent for Graph-Enhanced Retrieval
Pratik Shah, Rajat Ghosh, Aryan Singhal, Debojyoti Dutta
Comments: 24 pages, 4 figures
Subjects: Software Engineering (cs.SE); Information Retrieval (cs.IR); Machine Learning (cs.LG)

Tue, 30 Sep 2025 (showing 28 of 28 entries )

[62] arXiv:2509.24869 [pdf, html, other]
Title: Retro*: Optimizing LLMs for Reasoning-Intensive Document Retrieval
Junwei Lan, Jianlyu Chen, Zheng Liu, Chaofan Li, Siqi Bao, Defu Lian
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[63] arXiv:2509.24632 [pdf, html, other]
Title: UniDex: Rethinking Search Inverted Indexing with Unified Semantic Modeling
Zan Li, Jiahui Chen, Yuan Chai, Xiaoze Jiang, Xiaohua Qi, Zhiheng Qin, Runbin Zhou, Shun Zuo, Guangchao Hao, Kefeng Wang, Jingshan Lv, Yupeng Huang, Xiao Liang, Han Li
Comments: 11 pages, 6 figures and 5 tables
Subjects: Information Retrieval (cs.IR)
[64] arXiv:2509.24424 [pdf, html, other]
Title: Multi-Item-Query Attention for Stable Sequential Recommendation
Mingshi Xu, Haoren Zhu, Wilfred Siu Hung Ng
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[65] arXiv:2509.23874 [pdf, html, other]
Title: Multi-Value-Product Retrieval-Augmented Generation for Industrial Product Attribute Value Identification
Huike Zou, Haiyang Yang, Yindu Su, Liyu Chen, Chengbao Lian, Qingheng Zhang, Shuguang Han, Jufeng Chen
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI)
[66] arXiv:2509.23861 [pdf, html, other]
Title: Investigating Multi-layer Representations for Dense Passage Retrieval
Zhongbin Xie, Thomas Lukasiewicz
Comments: Accepted to Findings of EMNLP 2025
Subjects: Information Retrieval (cs.IR)
[67] arXiv:2509.23860 [pdf, html, other]
Title: GSID: Generative Semantic Indexing for E-Commerce Product Understanding
Haiyang Yang, Qinye Xie, Qingheng Zhang, Liyu Chen, Huike Zou, Chengbao Lian, Shuguang Han, Fei Huang, Jufeng Chen, Bo Zheng
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI)
[68] arXiv:2509.23776 [pdf, html, other]
Title: Semantic Representation of Processes with Ontology Design Patterns
Ebrahim Norouzi, Sven Hertling, Jörg Waitelonis, Harald Sack
Subjects: Information Retrieval (cs.IR); Information Theory (cs.IT)
[69] arXiv:2509.23771 [pdf, other]
Title: Constructing Opera Seria in the Iberian Courts: Metastasian Repertoire for Spain and Portugal
Ana Llorens, Alvaro Torrente
Journal-ref: Anuario Musical, 76 (2021), pp. 73-110
Subjects: Information Retrieval (cs.IR)
[70] arXiv:2509.23649 [pdf, html, other]
Title: From Past To Path: Masked History Learning for Next-Item Prediction in Generative Recommendation
KaiWen Wei, Kejun He, Xiaomian Kang, Jie Zhang, Yuming Yang, Jiang Zhong, He Bai, Junnan Zhu
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[71] arXiv:2509.23175 [pdf, html, other]
Title: WARBERT: A Hierarchical BERT-based Model for Web API Recommendation
Zishuo Xu, Yuhong Gu, Dezhong Yao
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI)
[72] arXiv:2509.22807 [pdf, html, other]
Title: MTRec: Learning to Align with User Preferences via Mental Reward Models
Mengchen Zhao, Yifan Gao, Yaqing Hou, Xiangyang Li, Pengjie Gu, Zhenhua Dong, Ruiming Tang, Yi Cai
Journal-ref: Proceedings of the 39th Conference on Neural Information Processing Systems (NeurIPS 2025)
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI)
[73] arXiv:2509.22661 [pdf, other]
Title: Next Point-of-interest (POI) Recommendation Model Based on Multi-modal Spatio-temporal Context Feature Embedding
Lingyu Zhang, Guobin Wu, Yan Wang, Pengfei Xu, Jian Liang, Xuan Song, Yunhai Wang
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI)
[74] arXiv:2509.22660 [pdf, html, other]
Title: Fairness for niche users and providers: algorithmic choice and profile portability
Elizabeth McKinnie, Anas Buhayh, Clement Canel, Robin Burke
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[75] arXiv:2509.22659 [pdf, html, other]
Title: Federated Consistency- and Complementarity-aware Consensus-enhanced Recommendation
Yunqi Mi, Boyang Yan, Guoshuai Zhao, Jialie Shen, Xueming Qian
Subjects: Information Retrieval (cs.IR)
[76] arXiv:2509.22658 [pdf, html, other]
Title: How good are LLMs at Retrieving Documents in a Specific Domain?
Nafis Tanveer Islam, Zhiming Zhao
Comments: Accepted at FAIEMA Conference 2025. DOI will be provided once the conference publishes the paper
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI)
[77] arXiv:2509.25106 (cross-list from cs.CL) [pdf, html, other]
Title: Towards Personalized Deep Research: Benchmarks and Evaluations
Yuan Liang, Jiaxian Li, Yuqing Wang, Piaohong Wang, Motong Tian, Pai Liu, Shuofei Qiao, Runnan Fang, He Zhu, Ge Zhang, Minghao Liu, Yuchen Eleanor Jiang, Ningyu Zhang, Wangchunshu Zhou
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[78] arXiv:2509.25085 (cross-list from cs.CL) [pdf, html, other]
Title: jina-reranker-v3: Last but Not Late Interaction for Listwise Document Reranking
Feng Wang, Yuqing Li, Han Xiao
Comments: early draft, CodeIR table needs to be updated (qwen baselines are missing)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[79] arXiv:2509.25084 (cross-list from cs.CL) [pdf, html, other]
Title: Scaling Generalist Data-Analytic Agents
Shuofei Qiao, Yanqiu Zhao, Zhisong Qiu, Xiaobin Wang, Jintian Zhang, Zhao Bin, Ningyu Zhang, Yong Jiang, Pengjun Xie, Fei Huang, Huajun Chen
Comments: Work in progress
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[80] arXiv:2509.24815 (cross-list from cs.DS) [pdf, html, other]
Title: Efficient Sketching and Nearest Neighbor Search Algorithms for Sparse Vector Sets
Sebastian Bruch, Franco Maria Nardini, Cosimo Rulli, Rossano Venturini
Subjects: Data Structures and Algorithms (cs.DS); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[81] arXiv:2509.24405 (cross-list from cs.CL) [pdf, html, other]
Title: Multilingual Text-to-SQL: Benchmarking the Limits of Language Models with Collaborative Language Agents
Khanh Trinh Pham, Thu Huong Nguyen, Jun Jo, Quoc Viet Hung Nguyen, Thanh Tam Nguyen
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Databases (cs.DB); Emerging Technologies (cs.ET); Information Retrieval (cs.IR)
[82] arXiv:2509.24193 (cross-list from cs.CL) [pdf, html, other]
Title: AceSearcher: Bootstrapping Reasoning and Search for LLMs via Reinforced Self-Play
Ran Xu, Yuchen Zhuang, Zihan Dong, Jonathan Wang, Yue Yu, Joyce C. Ho, Linjun Zhang, Haoyu Wang, Wenqi Shi, Carl Yang
Comments: Accepted to NeurIPS 2025 (Spotlight)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[83] arXiv:2509.23883 (cross-list from cs.CL) [pdf, html, other]
Title: DocPruner: A Storage-Efficient Framework for Multi-Vector Visual Document Retrieval via Adaptive Patch-Level Embedding Pruning
Yibo Yan, Guangwei Xu, Xin Zou, Shuliang Liu, James Kwok, Xuming Hu
Comments: Under review
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[84] arXiv:2509.23742 (cross-list from cs.LG) [pdf, html, other]
Title: GBSK: Skeleton Clustering via Granular-ball Computing and Multi-Sampling for Large-Scale Data
Yewang Chen, Junfeng Li, Shuyin Xia, Qinghong Lai, Xinbo Gao, Guoyin Wang, Dongdong Cheng, Yi Liu, Yi Wang
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
[85] arXiv:2509.23577 (cross-list from cs.DB) [pdf, html, other]
Title: ML-Asset Management: Curation, Discovery, and Utilization
Mengying Wang, Moming Duan, Yicong Huang, Chen Li, Bingsheng He, Yinghui Wu
Comments: Tutorial, VLDB 2025. Project page: this https URL
Journal-ref: PVLDB, 18(12): 5493 - 5498, 2025
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[86] arXiv:2509.23471 (cross-list from cs.LG) [pdf, html, other]
Title: Drift-Adapter: A Practical Approach to Near Zero-Downtime Embedding Model Upgrades in Vector Databases
Harshil Vejendla
Comments: EMNLP 2025 Main 12 pages, 6 figures
Subjects: Machine Learning (cs.LG); Information Retrieval (cs.IR)
[87] arXiv:2509.23338 (cross-list from cs.DB) [pdf, other]
Title: PARROT: A Benchmark for Evaluating LLMs in Cross-System SQL Translation
Wei Zhou, Guoliang Li, Haoyu Wang, Yuxing Han, Xufei Wu, Fan Wu, Xuanhe Zhou
Comments: To appear in NeurIPS 2025. Welcome your submission to challenge our leaderboard at: this https URL. Also visit our code repository at: this https URL
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[88] arXiv:2509.22991 (cross-list from cs.CL) [pdf, html, other]
Title: ADAM: A Diverse Archive of Mankind for Evaluating and Enhancing LLMs in Biographical Reasoning
Jasin Cekinmez, Omid Ghahroodi, Saad Fowad Chandle, Dhiman Gupta, Ehsaneddin Asgari
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[89] arXiv:2509.22845 (cross-list from cs.CL) [pdf, html, other]
Title: Learning to Detect Relevant Contexts and Knowledge for Response Selection in Retrieval-based Dialogue Systems
Kai Hua, Zhiyuan Feng, Chongyang Tao, Rui Yan, Lu Zhang
Comments: 10 pages, 4 figures, accepted by CIKM 2020
Journal-ref: Proc. CIKM 20, pp. 525-534, 2020
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
Total of 89 entries
Showing up to 2000 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack