Skip to main content

Showing 1–9 of 9 results for author: Louis, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.07109  [pdf, other

    cs.IR cs.AI cs.CL

    OSCAR: Online Soft Compression And Reranking

    Authors: Maxime Louis, Thibault Formal, Hervé Dejean, Stéphane Clinchant

    Abstract: Retrieval-Augmented Generation (RAG) enhances Large Language Models (LLMs) by integrating external knowledge, leading to improved accuracy and relevance. However, scaling RAG pipelines remains computationally expensive as retrieval sizes grow. To address this, we introduce OSCAR, a novel query-dependent online soft compression method that reduces computational overhead while preserving performance… ▽ More

    Submitted 17 March, 2025; originally announced April 2025.

  2. arXiv:2504.02411  [pdf, other

    cs.CL

    Adapting Large Language Models for Multi-Domain Retrieval-Augmented-Generation

    Authors: Alexandre Misrahi, Nadezhda Chirkova, Maxime Louis, Vassilina Nikoulina

    Abstract: Retrieval-Augmented Generation (RAG) enhances LLM factuality, but multi-domain applications face challenges like lack of diverse benchmarks and poor out-of-domain generalization. The first contribution of this work is to introduce a diverse benchmark comprising a variety of question-answering tasks from 8 sources and covering 13 domains. Our second contribution consists in systematically testing o… ▽ More

    Submitted 3 April, 2025; originally announced April 2025.

    Comments: 25 pages, 8 figures, 21 tables

  3. arXiv:2501.16075  [pdf, other

    cs.CL cs.AI cs.IR

    PISCO: Pretty Simple Compression for Retrieval-Augmented Generation

    Authors: Maxime Louis, Hervé Déjean, Stéphane Clinchant

    Abstract: Retrieval-Augmented Generation (RAG) pipelines enhance Large Language Models (LLMs) by retrieving relevant documents, but they face scalability issues due to high inference costs and limited context size. Document compression is a practical solution, but current soft compression methods suffer from accuracy losses and require extensive pretraining. In this paper, we introduce PISCO, a novel method… ▽ More

    Submitted 27 January, 2025; originally announced January 2025.

  4. arXiv:2310.10312  [pdf, other

    cs.AI cs.LG q-bio.QM

    End-to-end Offline Reinforcement Learning for Glycemia Control

    Authors: Tristan Beolet, Alice Adenis, Erik Huneker, Maxime Louis

    Abstract: The development of closed-loop systems for glycemia control in type I diabetes relies heavily on simulated patients. Improving the performances and adaptability of these close-loops raises the risk of over-fitting the simulator. This may have dire consequences, especially in unusual cases which were not faithfully-if at all-captured by the simulator. To address this, we propose to use offline RL a… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

  5. arXiv:2201.12027  [pdf, other

    cs.AR cs.LG cs.PF

    Puppeteer: A Random Forest-based Manager for Hardware Prefetchers across the Memory Hierarchy

    Authors: Furkan Eris, Marcia S. Louis, Kubra Eris, Jose L. Abellan, Ajay Joshi

    Abstract: Over the years, processor throughput has steadily increased. However, the memory throughput has not increased at the same rate, which has led to the memory wall problem in turn increasing the gap between effective and theoretical peak processor performance. To cope with this, there has been an abundance of work in the area of data/instruction prefetcher designs. Broadly, prefetchers predict future… ▽ More

    Submitted 28 January, 2022; originally announced January 2022.

  6. arXiv:2101.10674  [pdf, other

    eess.IV cs.CV cs.LG

    Leveraging 3D Information in Unsupervised Brain MRI Segmentation

    Authors: Benjamin Lambert, Maxime Louis, Senan Doyle, Florence Forbes, Michel Dojat, Alan Tucholka

    Abstract: Automatic segmentation of brain abnormalities is challenging, as they vary considerably from one pathology to another. Current methods are supervised and require numerous annotated images for each pathology, a strenuous task. To tackle anatomical variability, Unsupervised Anomaly Detection (UAD) methods are proposed, detecting anomalies as outliers of a healthy model learned using a Variational Au… ▽ More

    Submitted 26 January, 2021; originally announced January 2021.

    Comments: Accepted for presentation at IEEE International Symposium on Biomedical Imaging 2021

  7. arXiv:1711.08725  [pdf, other

    cs.CV math.DG stat.ML

    Parallel transport in shape analysis: a scalable numerical scheme

    Authors: Maxime Louis, Alexandre Bône, Benjamin Charlier, Stanley Durrleman

    Abstract: The analysis of manifold-valued data requires efficient tools from Riemannian geometry to cope with the computational complexity at stake. This complexity arises from the always-increasing dimension of the data, and the absence of closed-form expressions to basic operations such as the Riemannian logarithm. In this paper, we adapt a generic numerical scheme recently introduced for computing parall… ▽ More

    Submitted 23 November, 2017; originally announced November 2017.

  8. Prediction of the progression of subcortical brain structures in Alzheimer's disease from baseline

    Authors: Alexandre Bône, Maxime Louis, Alexandre Routier, Jorge Samper, Michael Bacci, Benjamin Charlier, Olivier Colliot, Stanley Durrleman

    Abstract: We propose a method to predict the subject-specific longitudinal progression of brain structures extracted from baseline MRI, and evaluate its performance on Alzheimer's disease data. The disease progression is modeled as a trajectory on a group of diffeomorphisms in the context of large deformation diffeomorphic metric mapping (LDDMM). We first exhibit the limited predictive abilities of geodesic… ▽ More

    Submitted 23 November, 2017; originally announced November 2017.

  9. arXiv:1704.02978  [pdf, other

    cs.DC stat.ML

    Field of Groves: An Energy-Efficient Random Forest

    Authors: Zafar Takhirov, Joseph Wang, Marcia S. Louis, Venkatesh Saligrama, Ajay Joshi

    Abstract: Machine Learning (ML) algorithms, like Convolutional Neural Networks (CNN), Support Vector Machines (SVM), etc. have become widespread and can achieve high statistical performance. However their accuracy decreases significantly in energy-constrained mobile and embedded systems space, where all computations need to be completed under a tight energy budget. In this work, we present a field of groves… ▽ More

    Submitted 10 April, 2017; originally announced April 2017.

    Comments: Submitted as Work in Progress to DAC'17