RAE: A Neural Network Dimensionality Reduction Method for Nearest Neighbors Preservation in Vector Search

Zhang, Han; Zhao, Dongfang

Computer Science > Information Retrieval

arXiv:2509.25839 (cs)

[Submitted on 30 Sep 2025]

Title:RAE: A Neural Network Dimensionality Reduction Method for Nearest Neighbors Preservation in Vector Search

Authors:Han Zhang, Dongfang Zhao

View PDF HTML (experimental)

Abstract:While high-dimensional embedding vectors are being increasingly employed in various tasks like Retrieval-Augmented Generation and Recommendation Systems, popular dimensionality reduction (DR) methods such as PCA and UMAP have rarely been adopted for accelerating the retrieval process due to their inability of preserving the nearest neighbor (NN) relationship among vectors. Empowered by neural networks' optimization capability and the bounding effect of Rayleigh quotient, we propose a Regularized Auto-Encoder (RAE) for k-NN preserving dimensionality reduction. RAE constrains the network parameter variation through regularization terms, adjusting singular values to control embedding magnitude changes during reduction, thus preserving k-NN relationships. We provide a rigorous mathematical analysis demonstrating that regularization establishes an upper bound on the norm distortion rate of transformed vectors, thereby offering provable guarantees for k-NN preservation. With modest training overhead, RAE achieves superior k-NN recall compared to existing DR approaches while maintaining fast retrieval efficiency.

Comments:	submitted to ICLR 2026
Subjects:	Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Databases (cs.DB)
Cite as:	arXiv:2509.25839 [cs.IR]
	(or arXiv:2509.25839v1 [cs.IR] for this version)
	https://doi.org/10.48550/arXiv.2509.25839

Submission history

From: Han Zhang [view email]
[v1] Tue, 30 Sep 2025 06:25:38 UTC (394 KB)

Computer Science > Information Retrieval

Title:RAE: A Neural Network Dimensionality Reduction Method for Nearest Neighbors Preservation in Vector Search

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Retrieval

Title:RAE: A Neural Network Dimensionality Reduction Method for Nearest Neighbors Preservation in Vector Search

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators