-
ColBERT-serve: Efficient Multi-Stage Memory-Mapped Scoring
Authors:
Kaili Huang,
Thejas Venkatesh,
Uma Dingankar,
Antonio Mallia,
Daniel Campos,
Jian Jiao,
Christopher Potts,
Matei Zaharia,
Kwabena Boahen,
Omar Khattab,
Saarthak Sarup,
Keshav Santhanam
Abstract:
We study serving retrieval models, specifically late interaction models like ColBERT, to many concurrent users at once and under a small budget, in which the index may not fit in memory. We present ColBERT-serve, a novel serving system that applies a memory-mapping strategy to the ColBERT index, reducing RAM usage by 90% and permitting its deployment on cheap servers, and incorporates a multi-stag…
▽ More
We study serving retrieval models, specifically late interaction models like ColBERT, to many concurrent users at once and under a small budget, in which the index may not fit in memory. We present ColBERT-serve, a novel serving system that applies a memory-mapping strategy to the ColBERT index, reducing RAM usage by 90% and permitting its deployment on cheap servers, and incorporates a multi-stage architecture with hybrid scoring, reducing ColBERT's query latency and supporting many concurrent queries in parallel.
△ Less
Submitted 21 April, 2025;
originally announced April 2025.
-
Toward Next-Generation Artificial Intelligence: Catalyzing the NeuroAI Revolution
Authors:
Anthony Zador,
Sean Escola,
Blake Richards,
Bence Ölveczky,
Yoshua Bengio,
Kwabena Boahen,
Matthew Botvinick,
Dmitri Chklovskii,
Anne Churchland,
Claudia Clopath,
James DiCarlo,
Surya Ganguli,
Jeff Hawkins,
Konrad Koerding,
Alexei Koulakov,
Yann LeCun,
Timothy Lillicrap,
Adam Marblestone,
Bruno Olshausen,
Alexandre Pouget,
Cristina Savin,
Terrence Sejnowski,
Eero Simoncelli,
Sara Solla,
David Sussillo
, et al. (2 additional authors not shown)
Abstract:
Neuroscience has long been an essential driver of progress in artificial intelligence (AI). We propose that to accelerate progress in AI, we must invest in fundamental research in NeuroAI. A core component of this is the embodied Turing test, which challenges AI animal models to interact with the sensorimotor world at skill levels akin to their living counterparts. The embodied Turing test shifts…
▽ More
Neuroscience has long been an essential driver of progress in artificial intelligence (AI). We propose that to accelerate progress in AI, we must invest in fundamental research in NeuroAI. A core component of this is the embodied Turing test, which challenges AI animal models to interact with the sensorimotor world at skill levels akin to their living counterparts. The embodied Turing test shifts the focus from those capabilities like game playing and language that are especially well-developed or uniquely human to those capabilities, inherited from over 500 million years of evolution, that are shared with all animals. Building models that can pass the embodied Turing test will provide a roadmap for the next generation of AI.
△ Less
Submitted 22 February, 2023; v1 submitted 15 October, 2022;
originally announced October 2022.
-
Large Large deviations for spatial telecommunication systems: The boolean model
Authors:
A. K. Boahen,
T. Katsekpor,
K. Doku-Amponsah
Abstract:
Spatial telecommunication systems have evolved along the years, leading to some concerns that telecommunication companies are facing today. The main inquietude is the ability to provide quality service to customers or users in a dense regime. Therefore, questions such as : what is the best possible configurations of base stations and users that maximizes quality service? Is it possible to estimate…
▽ More
Spatial telecommunication systems have evolved along the years, leading to some concerns that telecommunication companies are facing today. The main inquietude is the ability to provide quality service to customers or users in a dense regime. Therefore, questions such as : what is the best possible configurations of base stations and users that maximizes quality service? Is it possible to estimate and control the probability of bad service, which may be seen as a rare event? and many more arise. These questions often involve estimating the tail distribution of events, which falls under the scope of large deviation principles. In this article, we associate with the Boolean model, the empirical marked measure which will serve as a statistic for the intensity measure of the Marked Poisson Point Process of devices or users and the empirical connectivity measure which will serve as a statistic for coverage probability density of the spatial telecommunication area. For these empirical measures, prove large deviation principle (LDP) for well-defined empirical measures.
△ Less
Submitted 22 October, 2022; v1 submitted 26 August, 2021;
originally announced August 2021.
-
Online Learning Without Prior Information
Authors:
Ashok Cutkosky,
Kwabena Boahen
Abstract:
The vast majority of optimization and online learning algorithms today require some prior information about the data (often in the form of bounds on gradients or on the optimal parameter value). When this information is not available, these algorithms require laborious manual tuning of various hyperparameters, motivating the search for algorithms that can adapt to the data with no prior informatio…
▽ More
The vast majority of optimization and online learning algorithms today require some prior information about the data (often in the form of bounds on gradients or on the optimal parameter value). When this information is not available, these algorithms require laborious manual tuning of various hyperparameters, motivating the search for algorithms that can adapt to the data with no prior information. We describe a frontier of new lower bounds on the performance of such algorithms, reflecting a tradeoff between a term that depends on the optimal parameter value and a term that depends on the gradients' rate of growth. Further, we construct a family of algorithms whose performance matches any desired point on this frontier, which no previous algorithm reaches.
△ Less
Submitted 5 June, 2017; v1 submitted 7 March, 2017;
originally announced March 2017.
-
Online Convex Optimization with Unconstrained Domains and Losses
Authors:
Ashok Cutkosky,
Kwabena Boahen
Abstract:
We propose an online convex optimization algorithm (RescaledExp) that achieves optimal regret in the unconstrained setting without prior knowledge of any bounds on the loss functions. We prove a lower bound showing an exponential separation between the regret of existing algorithms that require a known bound on the loss functions and any algorithm that does not require such knowledge. RescaledExp…
▽ More
We propose an online convex optimization algorithm (RescaledExp) that achieves optimal regret in the unconstrained setting without prior knowledge of any bounds on the loss functions. We prove a lower bound showing an exponential separation between the regret of existing algorithms that require a known bound on the loss functions and any algorithm that does not require such knowledge. RescaledExp matches this lower bound asymptotically in the number of iterations. RescaledExp is naturally hyperparameter-free and we demonstrate empirically that it matches prior optimization algorithms that require hyperparameter optimization.
△ Less
Submitted 7 March, 2017;
originally announced March 2017.