Co-design Hardware and Algorithm for Vector Search

Jiang, Wenqi; Li, Shigang; Zhu, Yu; Licht, Johannes de Fine; He, Zhenhao; Shi, Runbin; Renggli, Cedric; Zhang, Shuai; Rekatsinas, Theodoros; Hoefler, Torsten; Alonso, Gustavo

Computer Science > Machine Learning

arXiv:2306.11182 (cs)

[Submitted on 19 Jun 2023 (v1), last revised 6 Jul 2023 (this version, v3)]

Title:Co-design Hardware and Algorithm for Vector Search

Authors:Wenqi Jiang, Shigang Li, Yu Zhu, Johannes de Fine Licht, Zhenhao He, Runbin Shi, Cedric Renggli, Shuai Zhang, Theodoros Rekatsinas, Torsten Hoefler, Gustavo Alonso

View PDF

Abstract:Vector search has emerged as the foundation for large-scale information retrieval and machine learning systems, with search engines like Google and Bing processing tens of thousands of queries per second on petabyte-scale document datasets by evaluating vector similarities between encoded query texts and web documents. As performance demands for vector search systems surge, accelerated hardware offers a promising solution in the post-Moore's Law era. We introduce \textit{FANNS}, an end-to-end and scalable vector search framework on FPGAs. Given a user-provided recall requirement on a dataset and a hardware resource budget, \textit{FANNS} automatically co-designs hardware and algorithm, subsequently generating the corresponding accelerator. The framework also supports scale-out by incorporating a hardware TCP/IP stack in the accelerator. \textit{FANNS} attains up to 23.0$\times$ and 37.2$\times$ speedup compared to FPGA and CPU baselines, respectively, and demonstrates superior scalability to GPUs, achieving 5.5$\times$ and 7.6$\times$ speedup in median and 95\textsuperscript{th} percentile (P95) latency within an eight-accelerator configuration. The remarkable performance of \textit{FANNS} lays a robust groundwork for future FPGA integration in data centers and AI supercomputers.

Comments:	11 pages
Subjects:	Machine Learning (cs.LG); Databases (cs.DB); Information Retrieval (cs.IR)
Cite as:	arXiv:2306.11182 [cs.LG]
	(or arXiv:2306.11182v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2306.11182

Submission history

From: Wenqi Jiang [view email]
[v1] Mon, 19 Jun 2023 22:12:37 UTC (23,270 KB)
[v2] Tue, 27 Jun 2023 10:37:34 UTC (23,924 KB)
[v3] Thu, 6 Jul 2023 13:52:31 UTC (23,925 KB)

Computer Science > Machine Learning

Title:Co-design Hardware and Algorithm for Vector Search

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Co-design Hardware and Algorithm for Vector Search

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators