Search | arXiv e-print repository

ColBERT-serve: Efficient Multi-Stage Memory-Mapped Scoring

Authors: Kaili Huang, Thejas Venkatesh, Uma Dingankar, Antonio Mallia, Daniel Campos, Jian Jiao, Christopher Potts, Matei Zaharia, Kwabena Boahen, Omar Khattab, Saarthak Sarup, Keshav Santhanam

Abstract: We study serving retrieval models, specifically late interaction models like ColBERT, to many concurrent users at once and under a small budget, in which the index may not fit in memory. We present ColBERT-serve, a novel serving system that applies a memory-mapping strategy to the ColBERT index, reducing RAM usage by 90% and permitting its deployment on cheap servers, and incorporates a multi-stag… ▽ More We study serving retrieval models, specifically late interaction models like ColBERT, to many concurrent users at once and under a small budget, in which the index may not fit in memory. We present ColBERT-serve, a novel serving system that applies a memory-mapping strategy to the ColBERT index, reducing RAM usage by 90% and permitting its deployment on cheap servers, and incorporates a multi-stage architecture with hybrid scoring, reducing ColBERT's query latency and supporting many concurrent queries in parallel. △ Less

Submitted 21 April, 2025; originally announced April 2025.

Comments: Accepted by ECIR 2025

arXiv:2210.08340 [pdf]

Toward Next-Generation Artificial Intelligence: Catalyzing the NeuroAI Revolution

Authors: Anthony Zador, Sean Escola, Blake Richards, Bence Ölveczky, Yoshua Bengio, Kwabena Boahen, Matthew Botvinick, Dmitri Chklovskii, Anne Churchland, Claudia Clopath, James DiCarlo, Surya Ganguli, Jeff Hawkins, Konrad Koerding, Alexei Koulakov, Yann LeCun, Timothy Lillicrap, Adam Marblestone, Bruno Olshausen, Alexandre Pouget, Cristina Savin, Terrence Sejnowski, Eero Simoncelli, Sara Solla, David Sussillo , et al. (2 additional authors not shown)

Abstract: Neuroscience has long been an essential driver of progress in artificial intelligence (AI). We propose that to accelerate progress in AI, we must invest in fundamental research in NeuroAI. A core component of this is the embodied Turing test, which challenges AI animal models to interact with the sensorimotor world at skill levels akin to their living counterparts. The embodied Turing test shifts… ▽ More Neuroscience has long been an essential driver of progress in artificial intelligence (AI). We propose that to accelerate progress in AI, we must invest in fundamental research in NeuroAI. A core component of this is the embodied Turing test, which challenges AI animal models to interact with the sensorimotor world at skill levels akin to their living counterparts. The embodied Turing test shifts the focus from those capabilities like game playing and language that are especially well-developed or uniquely human to those capabilities, inherited from over 500 million years of evolution, that are shared with all animals. Building models that can pass the embodied Turing test will provide a roadmap for the next generation of AI. △ Less

Submitted 22 February, 2023; v1 submitted 15 October, 2022; originally announced October 2022.

Comments: White paper, 10 pages + 8 pages of references, 1 figures

arXiv:2108.11820 [pdf, ps, other]

Large Large deviations for spatial telecommunication systems: The boolean model

Authors: A. K. Boahen, T. Katsekpor, K. Doku-Amponsah

Abstract: Spatial telecommunication systems have evolved along the years, leading to some concerns that telecommunication companies are facing today. The main inquietude is the ability to provide quality service to customers or users in a dense regime. Therefore, questions such as : what is the best possible configurations of base stations and users that maximizes quality service? Is it possible to estimate… ▽ More Spatial telecommunication systems have evolved along the years, leading to some concerns that telecommunication companies are facing today. The main inquietude is the ability to provide quality service to customers or users in a dense regime. Therefore, questions such as : what is the best possible configurations of base stations and users that maximizes quality service? Is it possible to estimate and control the probability of bad service, which may be seen as a rare event? and many more arise. These questions often involve estimating the tail distribution of events, which falls under the scope of large deviation principles. In this article, we associate with the Boolean model, the empirical marked measure which will serve as a statistic for the intensity measure of the Marked Poisson Point Process of devices or users and the empirical connectivity measure which will serve as a statistic for coverage probability density of the spatial telecommunication area. For these empirical measures, prove large deviation principle (LDP) for well-defined empirical measures. △ Less

Submitted 22 October, 2022; v1 submitted 26 August, 2021; originally announced August 2021.

Comments: 17 pages

MSC Class: 60F10; 05C80; 68Q87

arXiv:1703.02629 [pdf, ps, other]

Online Learning Without Prior Information

Authors: Ashok Cutkosky, Kwabena Boahen

Abstract: The vast majority of optimization and online learning algorithms today require some prior information about the data (often in the form of bounds on gradients or on the optimal parameter value). When this information is not available, these algorithms require laborious manual tuning of various hyperparameters, motivating the search for algorithms that can adapt to the data with no prior informatio… ▽ More The vast majority of optimization and online learning algorithms today require some prior information about the data (often in the form of bounds on gradients or on the optimal parameter value). When this information is not available, these algorithms require laborious manual tuning of various hyperparameters, motivating the search for algorithms that can adapt to the data with no prior information. We describe a frontier of new lower bounds on the performance of such algorithms, reflecting a tradeoff between a term that depends on the optimal parameter value and a term that depends on the gradients' rate of growth. Further, we construct a family of algorithms whose performance matches any desired point on this frontier, which no previous algorithm reaches. △ Less

Submitted 5 June, 2017; v1 submitted 7 March, 2017; originally announced March 2017.

Comments: 12 pages main text; 35 pages total; COLT 2017

arXiv:1703.02622 [pdf, other]

Online Convex Optimization with Unconstrained Domains and Losses

Authors: Ashok Cutkosky, Kwabena Boahen

Abstract: We propose an online convex optimization algorithm (RescaledExp) that achieves optimal regret in the unconstrained setting without prior knowledge of any bounds on the loss functions. We prove a lower bound showing an exponential separation between the regret of existing algorithms that require a known bound on the loss functions and any algorithm that does not require such knowledge. RescaledExp… ▽ More We propose an online convex optimization algorithm (RescaledExp) that achieves optimal regret in the unconstrained setting without prior knowledge of any bounds on the loss functions. We prove a lower bound showing an exponential separation between the regret of existing algorithms that require a known bound on the loss functions and any algorithm that does not require such knowledge. RescaledExp matches this lower bound asymptotically in the number of iterations. RescaledExp is naturally hyperparameter-free and we demonstrate empirically that it matches prior optimization algorithms that require hyperparameter optimization. △ Less

Submitted 7 March, 2017; originally announced March 2017.

Journal ref: Advances in Neural Information Processing Systems 29 (2016) 748-756

Showing 1–5 of 5 results for author: Boahen, K