Skip to main content

Showing 1–50 of 164 results for author: Thomas, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.18885  [pdf, ps, other

    cs.RO cs.CV

    GRAND-SLAM: Local Optimization for Globally Consistent Large-Scale Multi-Agent Gaussian SLAM

    Authors: Annika Thomas, Aneesa Sonawalla, Alex Rose, Jonathan P. How

    Abstract: 3D Gaussian splatting has emerged as an expressive scene representation for RGB-D visual SLAM, but its application to large-scale, multi-agent outdoor environments remains unexplored. Multi-agent Gaussian SLAM is a promising approach to rapid exploration and reconstruction of environments, offering scalable environment representations, but existing approaches are limited to small-scale, indoor env… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

  2. arXiv:2506.16940  [pdf, ps, other

    cs.CV

    LunarLoc: Segment-Based Global Localization on the Moon

    Authors: Annika Thomas, Robaire Galliath, Aleksander Garbuz, Luke Anger, Cormac O'Neill, Trevor Johst, Dami Thomas, George Lordos, Jonathan P. How

    Abstract: Global localization is necessary for autonomous operations on the lunar surface where traditional Earth-based navigation infrastructure, such as GPS, is unavailable. As NASA advances toward sustained lunar presence under the Artemis program, autonomous operations will be an essential component of tasks such as robotic exploration and infrastructure deployment. Tasks such as excavation and transpor… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

  3. arXiv:2506.14111  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Essential-Web v1.0: 24T tokens of organized web data

    Authors: Essential AI, :, Andrew Hojel, Michael Pust, Tim Romanski, Yash Vanjani, Ritvik Kapila, Mohit Parmar, Adarsh Chaluvaraju, Alok Tripathy, Anil Thomas, Ashish Tanwer, Darsh J Shah, Ishaan Shah, Karl Stratos, Khoi Nguyen, Kurt Smith, Michael Callahan, Peter Rushton, Philip Monk, Platon Mazarakis, Saad Jamal, Saurabh Srivastava, Somanshu Singla, Ashish Vaswani

    Abstract: Data plays the most prominent role in how language models acquire skills and knowledge. The lack of massive, well-organized pre-training datasets results in costly and inaccessible data pipelines. We present Essential-Web v1.0, a 24-trillion-token dataset in which every document is annotated with a twelve-category taxonomy covering topic, format, content complexity, and quality. Taxonomy labels ar… ▽ More

    Submitted 19 June, 2025; v1 submitted 16 June, 2025; originally announced June 2025.

    Comments: include MegaMath-Web-Pro

  4. arXiv:2505.24223  [pdf, ps, other

    cs.CL

    Automated Structured Radiology Report Generation

    Authors: Jean-Benoit Delbrouck, Justin Xu, Johannes Moll, Alois Thomas, Zhihong Chen, Sophie Ostmeier, Asfandyar Azhar, Kelvin Zhenghao Li, Andrew Johnston, Christian Bluethgen, Eduardo Reis, Mohamed Muneer, Maya Varma, Curtis Langlotz

    Abstract: Automated radiology report generation from chest X-ray (CXR) images has the potential to improve clinical efficiency and reduce radiologists' workload. However, most datasets, including the publicly available MIMIC-CXR and CheXpert Plus, consist entirely of free-form reports, which are inherently variable and unstructured. This variability poses challenges for both generation and evaluation: exist… ▽ More

    Submitted 2 June, 2025; v1 submitted 30 May, 2025; originally announced May 2025.

    Comments: Accepted to ACL Main 2025

  5. arXiv:2505.17576  [pdf, ps, other

    cs.RO

    CU-Multi: A Dataset for Multi-Robot Data Association

    Authors: Doncey Albin, Miles Mena, Annika Thomas, Harel Biggie, Xuefei Sun, Dusty Woods, Steve McGuire, Christoffer Heckman

    Abstract: Multi-robot systems (MRSs) are valuable for tasks such as search and rescue due to their ability to coordinate over shared observations. A central challenge in these systems is aligning independently collected perception data across space and time, i.e., multi-robot data association. While recent advances in collaborative SLAM (C-SLAM), map merging, and inter-robot loop closure detection have sign… ▽ More

    Submitted 2 July, 2025; v1 submitted 23 May, 2025; originally announced May 2025.

    Comments: 8 pages, 6 figures, 4 tables

  6. arXiv:2505.07141  [pdf, ps, other

    cs.RO

    Terrain-aware Low Altitude Path Planning

    Authors: Yixuan Jia, Andrea Tagliabue, Annika Thomas, Navid Dadkhah Tehrani, Jonathan P. How

    Abstract: In this paper, we study the problem of generating low-altitude path plans for nap-of-the-earth (NOE) flight in real time with only RGB images from onboard cameras and the vehicle pose. We propose a novel training method that combines behavior cloning and self-supervised learning, where the self-supervision component allows the learned policy to refine the paths generated by the expert planner. Sim… ▽ More

    Submitted 23 June, 2025; v1 submitted 11 May, 2025; originally announced May 2025.

  7. arXiv:2505.02222  [pdf, other

    cs.LG stat.ML

    Practical Efficiency of Muon for Pretraining

    Authors: Essential AI, :, Ishaan Shah, Anthony M. Polloreno, Karl Stratos, Philip Monk, Adarsh Chaluvaraju, Andrew Hojel, Andrew Ma, Anil Thomas, Ashish Tanwer, Darsh J Shah, Khoi Nguyen, Kurt Smith, Michael Callahan, Michael Pust, Mohit Parmar, Peter Rushton, Platon Mazarakis, Ritvik Kapila, Saurabh Srivastava, Somanshu Singla, Tim Romanski, Yash Vanjani, Ashish Vaswani

    Abstract: We demonstrate that Muon, the simplest instantiation of a second-order optimizer, explicitly expands the Pareto frontier over AdamW on the compute-time tradeoff. We find that Muon is more effective than AdamW in retaining data efficiency at large batch sizes, far beyond the so-called critical batch size, while remaining computationally efficient, thus enabling more economical training. We study th… ▽ More

    Submitted 19 May, 2025; v1 submitted 4 May, 2025; originally announced May 2025.

  8. arXiv:2504.19561  [pdf, other

    cs.LG

    Quantifying Memory Utilization with Effective State-Size

    Authors: Rom N. Parnichkun, Neehal Tumma, Armin W. Thomas, Alessandro Moro, Qi An, Taiji Suzuki, Atsushi Yamashita, Michael Poli, Stefano Massaroli

    Abstract: The need to develop a general framework for architecture analysis is becoming increasingly important, given the expanding design space of sequence models. To this end, we draw insights from classical signal processing and control theory, to develop a quantitative measure of \textit{memory utilization}: the internal mechanisms through which a model stores past information to produce future outputs.… ▽ More

    Submitted 28 April, 2025; originally announced April 2025.

  9. arXiv:2504.17690  [pdf, other

    quant-ph cs.LG

    On the Generalization of Adversarially Trained Quantum Classifiers

    Authors: Petros Georgiou, Aaron Mark Thomas, Sharu Theresa Jose, Osvaldo Simeone

    Abstract: Quantum classifiers are vulnerable to adversarial attacks that manipulate their input classical or quantum data. A promising countermeasure is adversarial training, where quantum classifiers are trained by using an attack-aware, adversarial loss function. This work establishes novel bounds on the generalization error of adversarially trained quantum classifiers when tested in the presence of pertu… ▽ More

    Submitted 24 April, 2025; originally announced April 2025.

    Comments: 22 pages, 6 figures

  10. arXiv:2504.07310  [pdf, other

    cs.SE

    Dependency Update Adoption Patterns in the Maven Software Ecosystem

    Authors: Baltasar Berretta, Augustus Thomas, Heather Guarnera

    Abstract: Regular dependency updates protect dependent software components from upstream bugs, security vulnerabilities, and poor code quality. Measures of dependency updates across software ecosystems involve two key dimensions: the time span during which a release is being newly adopted (adoption lifespan) and the extent of adoption across the ecosystem (adoption reach). We examine correlations between ad… ▽ More

    Submitted 9 April, 2025; originally announced April 2025.

    Comments: Pre-print for MSR 2025, see https://2025.msrconf.org/details/msr-2025-mining-challenge/19/Dependency-Update-Adoption-Patterns-in-the-Maven-Software-Ecosystem

  11. arXiv:2504.04022  [pdf, other

    cs.CL cs.AI

    Rethinking Reflection in Pre-Training

    Authors: Essential AI, :, Darsh J Shah, Peter Rushton, Somanshu Singla, Mohit Parmar, Kurt Smith, Yash Vanjani, Ashish Vaswani, Adarsh Chaluvaraju, Andrew Hojel, Andrew Ma, Anil Thomas, Anthony Polloreno, Ashish Tanwer, Burhan Drak Sibai, Divya S Mansingka, Divya Shivaprasad, Ishaan Shah, Karl Stratos, Khoi Nguyen, Michael Callahan, Michael Pust, Mrinal Iyer, Philip Monk , et al. (4 additional authors not shown)

    Abstract: A language model's ability to reflect on its own reasoning provides a key advantage for solving complex problems. While most recent research has focused on how this ability develops during reinforcement learning, we show that it actually begins to emerge much earlier - during the model's pre-training. To study this, we introduce deliberate errors into chains-of-thought and test whether the model c… ▽ More

    Submitted 4 April, 2025; originally announced April 2025.

  12. arXiv:2504.03623  [pdf, other

    cs.CV

    Quantifying the uncertainty of model-based synthetic image quality metrics

    Authors: Ciaran Bench, Spencer A. Thomas

    Abstract: The quality of synthetically generated images (e.g. those produced by diffusion models) are often evaluated using information about image contents encoded by pretrained auxiliary models. For example, the Fréchet Inception Distance (FID) uses embeddings from an InceptionV3 model pretrained to classify ImageNet. The effectiveness of this feature embedding model has considerable impact on the trustwo… ▽ More

    Submitted 4 April, 2025; originally announced April 2025.

  13. arXiv:2504.03486  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Structured Legal Document Generation in India: A Model-Agnostic Wrapper Approach with VidhikDastaavej

    Authors: Shubham Kumar Nigam, Balaramamahanthi Deepak Patnaik, Ajay Varghese Thomas, Noel Shallum, Kripabandhu Ghosh, Arnab Bhattacharya

    Abstract: Automating legal document drafting can significantly enhance efficiency, reduce manual effort, and streamline legal workflows. While prior research has explored tasks such as judgment prediction and case summarization, the structured generation of private legal documents in the Indian legal domain remains largely unaddressed. To bridge this gap, we introduce VidhikDastaavej, a novel, anonymized da… ▽ More

    Submitted 4 April, 2025; originally announced April 2025.

  14. arXiv:2503.08817  [pdf, other

    cs.RO

    Geometric Data-Driven Multi-Jet Locomotion Inspired by Salps

    Authors: Yanhao Yang, Nina L. Hecht, Yousef Salaman-Maclara, Nathan Justus, Zachary A. Thomas, Farhan Rozaidi, Ross L. Hatton

    Abstract: Salps are marine animals consisting of chains of jellyfish-like units. Their capacity for effective underwater undulatory locomotion through coordinating multi-jet propulsion has aroused significant interest in the field of robotics and inspired extensive research including design, modeling, and control. In this paper, we conduct a comprehensive analysis of the locomotion of salp-like systems usin… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

    Comments: 17 pages, 13 figures

  15. arXiv:2503.08719  [pdf, other

    eess.IV cs.CV cs.LG

    QuantU-Net: Efficient Wearable Medical Imaging Using Bitwidth as a Trainable Parameter

    Authors: Christiaan Boerkamp, Akhil John Thomas

    Abstract: Medical image segmentation, particularly tumor segmentation, is a critical task in medical imaging, with U-Net being a widely adopted convolutional neural network (CNN) architecture for this purpose. However, U-Net's high computational and memory requirements pose challenges for deployment on resource-constrained devices such as wearable medical systems. This paper addresses these challenges by in… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

  16. arXiv:2503.07700  [pdf, other

    cs.RO cs.AI

    A Task and Motion Planning Framework Using Iteratively Deepened AND/OR Graph Networks

    Authors: Hossein Karami, Antony Thomas, Fulvio Mastrogiovanni

    Abstract: In this paper, we present an approach for integrated task and motion planning based on an AND/OR graph network, which is used to represent task-level states and actions, and we leverage it to implement different classes of task and motion planning problems (TAMP). Several problems that fall under task and motion planning do not have a predetermined number of sub-tasks to achieve a goal. For exampl… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

    Journal ref: Robotics and Autonomous Systems, Volume 189, July 2025, 104943

  17. arXiv:2503.04734  [pdf, ps, other

    cs.CY cs.AI cs.CL

    What can large language models do for sustainable food?

    Authors: Anna T. Thomas, Adam Yee, Andrew Mayne, Maya B. Mathur, Dan Jurafsky, Kristina Gligorić

    Abstract: Food systems are responsible for a third of human-caused greenhouse gas emissions. We investigate what Large Language Models (LLMs) can contribute to reducing the environmental impacts of food production. We define a typology of design and prediction tasks based on the sustainable food literature and collaboration with domain experts, and evaluate six LLMs on four tasks in our typology. For exampl… ▽ More

    Submitted 28 June, 2025; v1 submitted 2 February, 2025; originally announced March 2025.

    Comments: ICML camera ready version

  18. arXiv:2503.02112  [pdf, other

    cs.LG astro-ph.IM

    Building Machine Learning Challenges for Anomaly Detection in Science

    Authors: Elizabeth G. Campolongo, Yuan-Tang Chou, Ekaterina Govorkova, Wahid Bhimji, Wei-Lun Chao, Chris Harris, Shih-Chieh Hsu, Hilmar Lapp, Mark S. Neubauer, Josephine Namayanja, Aneesh Subramanian, Philip Harris, Advaith Anand, David E. Carlyn, Subhankar Ghosh, Christopher Lawrence, Eric Moreno, Ryan Raikman, Jiaman Wu, Ziheng Zhang, Bayu Adhi, Mohammad Ahmadi Gharehtoragh, Saúl Alonso Monsalve, Marta Babicz, Furqan Baig , et al. (125 additional authors not shown)

    Abstract: Scientific discoveries are often made by finding a pattern or object that was not predicted by the known rules of science. Oftentimes, these anomalous events or objects that do not conform to the norms are an indication that the rules of science governing the data are incomplete, and something new needs to be present to explain these unexpected outliers. The challenge of finding anomalies can be c… ▽ More

    Submitted 29 March, 2025; v1 submitted 3 March, 2025; originally announced March 2025.

    Comments: 17 pages 6 figures to be submitted to Nature Communications

  19. arXiv:2502.15425  [pdf, other

    cs.AI cs.LG eess.SY

    TAG: A Decentralized Framework for Multi-Agent Hierarchical Reinforcement Learning

    Authors: Giuseppe Paolo, Abdelhakim Benechehab, Hamza Cherkaoui, Albert Thomas, Balázs Kégl

    Abstract: Hierarchical organization is fundamental to biological systems and human societies, yet artificial intelligence systems often rely on monolithic architectures that limit adaptability and scalability. Current hierarchical reinforcement learning (HRL) approaches typically restrict hierarchies to two levels or require centralized training, which limits their practical applicability. We introduce TAME… ▽ More

    Submitted 5 March, 2025; v1 submitted 21 February, 2025; originally announced February 2025.

  20. arXiv:2502.10235  [pdf, other

    stat.ML cs.LG

    AdaPTS: Adapting Univariate Foundation Models to Probabilistic Multivariate Time Series Forecasting

    Authors: Abdelhakim Benechehab, Vasilii Feofanov, Giuseppe Paolo, Albert Thomas, Maurizio Filippone, Balázs Kégl

    Abstract: Pre-trained foundation models (FMs) have shown exceptional performance in univariate time series forecasting tasks. However, several practical challenges persist, including managing intricate dependencies among features and quantifying uncertainty in predictions. This study aims to tackle these critical limitations by introducing adapters; feature-space transformations that facilitate the effectiv… ▽ More

    Submitted 14 February, 2025; originally announced February 2025.

  21. arXiv:2502.02475  [pdf, other

    eess.IV cs.CV physics.med-ph

    Style transfer as data augmentation: evaluating unpaired image-to-image translation models in mammography

    Authors: Emir Ahmed, Spencer A. Thomas, Ciaran Bench

    Abstract: Several studies indicate that deep learning models can learn to detect breast cancer from mammograms (X-ray images of the breasts). However, challenges with overfitting and poor generalisability prevent their routine use in the clinic. Models trained on data from one patient population may not perform well on another due to differences in their data domains, emerging due to variations in scanning… ▽ More

    Submitted 4 February, 2025; originally announced February 2025.

  22. arXiv:2501.17570  [pdf, other

    eess.IV cs.CV physics.med-ph

    Trustworthy image-to-image translation: evaluating uncertainty calibration in unpaired training scenarios

    Authors: Ciaran Bench, Emir Ahmed, Spencer A. Thomas

    Abstract: Mammographic screening is an effective method for detecting breast cancer, facilitating early diagnosis. However, the current need to manually inspect images places a heavy burden on healthcare systems, spurring a desire for automated diagnostic protocols. Techniques based on deep neural networks have been shown effective in some studies, but their tendency to overfit leaves considerable risk for… ▽ More

    Submitted 29 January, 2025; originally announced January 2025.

  23. arXiv:2501.11434  [pdf, other

    cs.RO

    An Incremental Sampling and Segmentation-Based Approach for Motion Planning Infeasibility

    Authors: Antony Thomas, Fulvio Mastrogiovanni, Marco Baglietto

    Abstract: We present a simple and easy-to-implement algorithm to detect plan infeasibility in kinematic motion planning. Our method involves approximating the robot's configuration space to a discrete space, where each degree of freedom has a finite set of values. The obstacle region separates the free configuration space into different connected regions. For a path to exist between the start and goal confi… ▽ More

    Submitted 28 April, 2025; v1 submitted 20 January, 2025; originally announced January 2025.

  24. arXiv:2501.09878  [pdf, other

    cs.CV cs.AI

    ASTRA: A Scene-aware TRAnsformer-based model for trajectory prediction

    Authors: Izzeddin Teeti, Aniket Thomas, Munish Monga, Sachin Kumar, Uddeshya Singh, Andrew Bradley, Biplab Banerjee, Fabio Cuzzolin

    Abstract: We present ASTRA (A} Scene-aware TRAnsformer-based model for trajectory prediction), a light-weight pedestrian trajectory forecasting model that integrates the scene context, spatial dynamics, social inter-agent interactions and temporal progressions for precise forecasting. We utilised a U-Net-based feature extractor, via its latent vector representation, to capture scene representations and a gr… ▽ More

    Submitted 16 January, 2025; originally announced January 2025.

  25. arXiv:2501.06076  [pdf, other

    cs.LG

    A monthly sub-national Harmonized Food Insecurity Dataset for comprehensive analysis and predictive modeling

    Authors: Mélissande Machefer, Michele Ronco, Anne-Claire Thomas, Michael Assouline, Melanie Rabier, Christina Corbane, Felix Rembold

    Abstract: Food security is a complex, multidimensional concept challenging to measure comprehensively. Effective anticipation, monitoring, and mitigation of food crises require timely and comprehensive global data. This paper introduces the Harmonized Food Insecurity Dataset (HFID), an open-source resource consolidating four key data sources: the Integrated Food Security Phase Classification (IPC)/Cadre Har… ▽ More

    Submitted 13 January, 2025; v1 submitted 10 January, 2025; originally announced January 2025.

    Comments: The authors Melissande Machefer and Michele Ronco have contributed equally as both first authors to this work. This work is currently being reviewed in a peer-reviewed journal

  26. arXiv:2412.13488  [pdf, ps, other

    cs.CL cs.AI

    Refining Salience-Aware Sparse Fine-Tuning Strategies for Language Models

    Authors: Xinxin Liu, Aaron Thomas, Cheng Zhang, Jianyi Cheng, Yiren Zhao, Xitong Gao

    Abstract: Parameter-Efficient Fine-Tuning (PEFT) has gained prominence through low-rank adaptation methods like LoRA. In this paper, we focus on sparsity-based PEFT (SPEFT), which introduces trainable sparse adaptations to the weight matrices in the model, offering greater flexibility in selecting fine-tuned parameters compared to low-rank methods. We conduct the first systematic evaluation of salience metr… ▽ More

    Submitted 27 June, 2025; v1 submitted 17 December, 2024; originally announced December 2024.

    Comments: ACL 2025

  27. arXiv:2411.17800  [pdf, other

    cs.LG cs.AI cs.NE

    STAR: Synthesis of Tailored Architectures

    Authors: Armin W. Thomas, Rom Parnichkun, Alexander Amini, Stefano Massaroli, Michael Poli

    Abstract: Iterative improvement of model architectures is fundamental to deep learning: Transformers first enabled scaling, and recent advances in model hybridization have pushed the quality-efficiency frontier. However, optimizing architectures remains challenging and expensive. Current automated or manual approaches fall short, largely due to limited progress in the design of search spaces and due to the… ▽ More

    Submitted 26 November, 2024; originally announced November 2024.

  28. arXiv:2411.15266  [pdf, other

    astro-ph.IM cond-mat.dis-nn cond-mat.mtrl-sci cs.RO physics.class-ph

    Continuous Design and Reprogramming of Totimorphic Structures for Space Applications

    Authors: Dominik Dold, Amy Thomas, Nicole Rosi, Jai Grover, Dario Izzo

    Abstract: Recently, a class of mechanical lattices with reconfigurable, zero-stiffness structures has been proposed, called Totimorphic structures. In this work, we introduce a computational framework that allows continuous reprogramming of a Totimorphic lattice's effective properties, such as mechanical and optical properties, via continuous geometric changes alone. Our approach is differentiable and guara… ▽ More

    Submitted 22 November, 2024; originally announced November 2024.

  29. arXiv:2411.05270  [pdf

    cs.CL cs.AI

    Seeing Through the Fog: A Cost-Effectiveness Analysis of Hallucination Detection Systems

    Authors: Alexander Thomas, Seth Rosen, Vishnu Vettrivel

    Abstract: This paper presents a comparative analysis of hallucination detection systems for AI, focusing on automatic summarization and question answering tasks for Large Language Models (LLMs). We evaluate different hallucination detection systems using the diagnostic odds ratio (DOR) and cost-effectiveness metrics. Our results indicate that although advanced models can perform better they come at a much h… ▽ More

    Submitted 7 November, 2024; originally announced November 2024.

    Comments: 18 pags, 13 figures, 2 tables

    ACM Class: I.2.7

  30. arXiv:2411.03562  [pdf, other

    cs.LG cs.AI

    Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

    Authors: Antoine Grosnit, Alexandre Maraval, James Doran, Giuseppe Paolo, Albert Thomas, Refinath Shahul Hameed Nabeezath Beevi, Jonas Gonzalez, Khyati Khandelwal, Ignacio Iacobacci, Abdelhakim Benechehab, Hamza Cherkaoui, Youssef Attia El-Hili, Kun Shao, Jianye Hao, Jun Yao, Balazs Kegl, Haitham Bou-Ammar, Jun Wang

    Abstract: We introduce Agent K v1.0, an end-to-end autonomous data science agent designed to automate, optimise, and generalise across diverse data science tasks. Fully automated, Agent K v1.0 manages the entire data science life cycle by learning from experience. It leverages a highly flexible structured reasoning framework to enable it to dynamically process memory in a nested structure, effectively learn… ▽ More

    Submitted 5 November, 2024; originally announced November 2024.

  31. arXiv:2410.20004  [pdf, other

    cs.CR cs.DC

    Lightweight, Secure and Stateful Serverless Computing with PSL

    Authors: Alexander Thomas, Shubham Mishra, Kaiyuan Chen, John Kubiatowicz

    Abstract: We present PSL, a lightweight, secure and stateful Function-as-a-Serivce (FaaS) framework for Trusted Execution Environments (TEEs). The framework provides rich programming language support on heterogeneous TEE hardware for statically compiled binaries and/or WebAssembly (WASM) bytecodes, with a familiar Key-Value Store (KVS) interface to secure, performant, network-embedded storage. It achieves n… ▽ More

    Submitted 25 October, 2024; originally announced October 2024.

  32. arXiv:2410.11711  [pdf, other

    stat.ML cs.LG

    Zero-shot Model-based Reinforcement Learning using Large Language Models

    Authors: Abdelhakim Benechehab, Youssef Attia El Hili, Ambroise Odonnat, Oussama Zekri, Albert Thomas, Giuseppe Paolo, Maurizio Filippone, Ievgen Redko, Balázs Kégl

    Abstract: The emerging zero-shot capabilities of Large Language Models (LLMs) have led to their applications in areas extending well beyond natural language processing tasks. In reinforcement learning, while LLMs have been extensively used in text-based environments, their integration with continuous state spaces remains understudied. In this paper, we investigate how pre-trained LLMs can be leveraged to pr… ▽ More

    Submitted 13 February, 2025; v1 submitted 15 October, 2024; originally announced October 2024.

    Journal ref: The Thirteenth International Conference on Learning Representations (ICLR 2025)

  33. arXiv:2410.08262  [pdf, other

    cs.RO

    ROMAN: Open-Set Object Map Alignment for Robust View-Invariant Global Localization

    Authors: Mason B. Peterson, Yixuan Jia, Yulun Tian, Annika Thomas, Jonathan P. How

    Abstract: Global localization is a fundamental capability required for long-term and drift-free robot navigation. However, current methods fail to relocalize when faced with significantly different viewpoints. We present ROMAN (Robust Object Map Alignment Anywhere), a global localization method capable of localizing in challenging and diverse environments by creating and aligning maps of open-set and view-i… ▽ More

    Submitted 28 April, 2025; v1 submitted 10 October, 2024; originally announced October 2024.

    Comments: 11 pages, 5 figures, accepted to Robotics: Science and Systems (RSS) 2025

  34. arXiv:2409.10339  [pdf

    quant-ph cs.CV cs.LG

    VAE-QWGAN: Addressing Mode Collapse in Quantum GANs via Autoencoding Priors

    Authors: Aaron Mark Thomas, Harry Youel, Sharu Theresa Jose

    Abstract: Recent proposals for quantum generative adversarial networks (GANs) suffer from the issue of mode collapse, analogous to classical GANs, wherein the distribution learnt by the GAN fails to capture the high mode complexities of the target distribution. Mode collapse can arise due to the use of uninformed prior distributions in the generative learning task. To alleviate the issue of mode collapse fo… ▽ More

    Submitted 21 May, 2025; v1 submitted 16 September, 2024; originally announced September 2024.

    Comments: 30 pages, 13 figures

  35. arXiv:2408.04661  [pdf, other

    cs.CL cond-mat.mtrl-sci

    MaterioMiner -- An ontology-based text mining dataset for extraction of process-structure-property entities

    Authors: Ali Riza Durmaz, Akhil Thomas, Lokesh Mishra, Rachana Niranjan Murthy, Thomas Straub

    Abstract: While large language models learn sound statistical representations of the language and information therein, ontologies are symbolic knowledge representations that can complement the former ideally. Research at this critical intersection relies on datasets that intertwine ontologies and text corpora to enable training and comprehensive benchmarking of neurosymbolic models. We present the MaterioMi… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

  36. arXiv:2407.05786  [pdf, ps, other

    cs.CL cs.AI

    Large Language Models for Judicial Entity Extraction: A Comparative Study

    Authors: Atin Sakkeer Hussain, Anu Thomas

    Abstract: Domain-specific Entity Recognition holds significant importance in legal contexts, serving as a fundamental task that supports various applications such as question-answering systems, text summarization, machine translation, sentiment analysis, and information retrieval specifically within case law documents. Recent advancements have highlighted the efficacy of Large Language Models in natural lan… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    ACM Class: I.2.1

  37. arXiv:2406.18808  [pdf, other

    q-bio.NC cs.NE

    Binding in hippocampal-entorhinal circuits enables compositionality in cognitive maps

    Authors: Christopher J. Kymn, Sonia Mazelet, Anthony Thomas, Denis Kleyko, E. Paxon Frady, Friedrich T. Sommer, Bruno A. Olshausen

    Abstract: We propose a normative model for spatial representation in the hippocampal formation that combines optimality principles, such as maximizing coding range and spatial information per neuron, with an algebraic framework for computing in distributed representation. Spatial position is encoded in a residue number system, with individual residues represented by high-dimensional, complex-valued vectors.… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: 23 pages, 12 figures

  38. Evaluating Tenant-Landlord Tensions Using Generative AI on Online Tenant Forums

    Authors: Xin Chen, Cheng Ren, Timothy A Thomas

    Abstract: Tenant-landlord relationships exhibit a power asymmetry where landlords' power to evict the tenants at a low-cost results in their dominating status in such relationships. Tenant concerns are thus often unspoken, unresolved, or ignored and this could lead to blatant conflicts as suppressed tenant concerns accumulate. Modern machine learning methods and Large Language Models (LLM) have demonstrated… ▽ More

    Submitted 11 March, 2025; v1 submitted 17 April, 2024; originally announced April 2024.

    Journal ref: J Comput Soc Sc 8, 50 (2025)

  39. arXiv:2403.17844  [pdf, other

    cs.LG

    Mechanistic Design and Scaling of Hybrid Architectures

    Authors: Michael Poli, Armin W Thomas, Eric Nguyen, Pragaash Ponnusamy, Björn Deiseroth, Kristian Kersting, Taiji Suzuki, Brian Hie, Stefano Ermon, Christopher Ré, Ce Zhang, Stefano Massaroli

    Abstract: The development of deep learning architectures is a resource-demanding process, due to a vast design space, long prototyping times, and high compute costs associated with at-scale model training and evaluation. We set out to simplify this process by grounding it in an end-to-end mechanistic architecture design (MAD) pipeline, encompassing small-scale capability unit tests predictive of scaling law… ▽ More

    Submitted 19 August, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

  40. arXiv:2403.10459  [pdf, other

    cs.LG cs.CV stat.ML

    Understanding the Double Descent Phenomenon in Deep Learning

    Authors: Marc Lafon, Alexandre Thomas

    Abstract: Combining empirical risk minimization with capacity control is a classical strategy in machine learning when trying to control the generalization gap and avoid overfitting, as the model class capacity gets larger. Yet, in modern deep learning practice, very large over-parameterized models (e.g. neural networks) are optimized to fit perfectly the training data and still obtain great generalization… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  41. arXiv:2403.04759  [pdf, other

    cs.LG cs.NE

    Lifelong Intelligence Beyond the Edge using Hyperdimensional Computing

    Authors: Xiaofan Yu, Anthony Thomas, Ivannia Gomez Moreno, Louis Gutierrez, Tajana Rosing

    Abstract: On-device learning has emerged as a prevailing trend that avoids the slow response time and costly communication of cloud-based learning. The ability to learn continuously and indefinitely in a changing environment, and with resource constraints, is critical for real sensor deployments. However, existing designs are inadequate for practical scenarios with (i) streaming data input, (ii) lack of sup… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: Accepted by IPSN'24

  42. arXiv:2402.12008  [pdf, other

    cs.LG cs.AI stat.ML

    Cluster Metric Sensitivity to Irrelevant Features

    Authors: Miles McCrory, Spencer A. Thomas

    Abstract: Clustering algorithms are used extensively in data analysis for data exploration and discovery. Technological advancements lead to continually growth of data in terms of volume, dimensionality and complexity. This provides great opportunities in data analytics as the data can be interrogated for many different purposes. This however leads challenges, such as identification of relevant features for… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  43. arXiv:2402.05525  [pdf, other

    cs.LG cs.AI cs.CR stat.ML

    Differentially Private Deep Model-Based Reinforcement Learning

    Authors: Alexandre Rio, Merwan Barlier, Igor Colin, Albert Thomas

    Abstract: We address private deep offline reinforcement learning (RL), where the goal is to train a policy on standard control tasks that is differentially private (DP) with respect to individual trajectories in the dataset. To achieve this, we introduce PriMORL, a model-based RL algorithm with formal differential privacy guarantees. PriMORL first learns an ensemble of trajectory-level DP models of the envi… ▽ More

    Submitted 9 October, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

  44. arXiv:2402.03146  [pdf, other

    cs.LG stat.ML

    A Multi-step Loss Function for Robust Learning of the Dynamics in Model-based Reinforcement Learning

    Authors: Abdelhakim Benechehab, Albert Thomas, Giuseppe Paolo, Maurizio Filippone, Balázs Kégl

    Abstract: In model-based reinforcement learning, most algorithms rely on simulating trajectories from one-step models of the dynamics learned on data. A critical challenge of this approach is the compounding of one-step prediction errors as the length of the trajectory grows. In this paper we tackle this issue by using a multi-step objective to train one-step models. Our objective is a weighted sum of the m… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  45. arXiv:2402.02858  [pdf, other

    cs.LG stat.ML

    Deep autoregressive density nets vs neural ensembles for model-based offline reinforcement learning

    Authors: Abdelhakim Benechehab, Albert Thomas, Balázs Kégl

    Abstract: We consider the problem of offline reinforcement learning where only a set of system transitions is made available for policy optimization. Following recent advances in the field, we consider a model-based reinforcement learning algorithm that infers the system dynamics from the available data and performs policy optimization on imaginary model rollouts. This approach is vulnerable to exploiting m… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  46. arXiv:2401.04791  [pdf, other

    cs.RO cs.CV

    SOS-Match: Segmentation for Open-Set Robust Correspondence Search and Robot Localization in Unstructured Environments

    Authors: Annika Thomas, Jouko Kinnari, Parker Lusk, Kota Kondo, Jonathan P. How

    Abstract: We present SOS-Match, a novel framework for detecting and matching objects in unstructured environments. Our system consists of 1) a front-end mapping pipeline using a zero-shot segmentation model to extract object masks from images and track them across frames and 2) a frame alignment pipeline that uses the geometric consistency of object relationships to efficiently localize across a variety of… ▽ More

    Submitted 26 November, 2024; v1 submitted 9 January, 2024; originally announced January 2024.

    Comments: 8 pages, 7 figures

  47. arXiv:2311.14116  [pdf, other

    cs.IT

    Hierarchical Coded Gradient Aggregation Based on Layered MDS Codes

    Authors: M. Nikhil Krishnan, Anoop Thomas, Birenjith Sasidharan

    Abstract: The growing privacy concerns and the communication costs associated with transmitting raw data have resulted in techniques like federated learning, where the machine learning models are trained at the edge nodes, and the parameter updates are shared with a central server. Because communications from the edge nodes are often unreliable, a hierarchical setup involving intermediate helper nodes is co… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

    Comments: Presented at 2023 IEEE International Symposium on Information Theory (ISIT)

  48. arXiv:2311.03655  [pdf, other

    cs.RO cs.MA

    PUMA: Fully Decentralized Uncertainty-aware Multiagent Trajectory Planner with Real-time Image Segmentation-based Frame Alignment

    Authors: Kota Kondo, Claudius T. Tewari, Mason B. Peterson, Annika Thomas, Jouko Kinnari, Andrea Tagliabue, Jonathan P. How

    Abstract: Fully decentralized, multiagent trajectory planners enable complex tasks like search and rescue or package delivery by ensuring safe navigation in unknown environments. However, deconflicting trajectories with other agents and ensuring collision-free paths in a fully decentralized setting is complicated by dynamic elements and localization uncertainty. To this end, this paper presents (1) an uncer… ▽ More

    Submitted 6 March, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

    Comments: 7 pages, 13 figures, conference paper

  49. arXiv:2310.12109  [pdf, other

    cs.LG

    Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture

    Authors: Daniel Y. Fu, Simran Arora, Jessica Grogan, Isys Johnson, Sabri Eyuboglu, Armin W. Thomas, Benjamin Spector, Michael Poli, Atri Rudra, Christopher Ré

    Abstract: Machine learning models are increasingly being scaled in both sequence length and model dimension to reach longer contexts and better performance. However, existing architectures such as Transformers scale quadratically along both these axes. We ask: are there performant architectures that can scale sub-quadratically along sequence length and model dimension? We introduce Monarch Mixer (M2), a new… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023 (Oral)

  50. arXiv:2310.05672  [pdf, other

    cs.LG stat.ML

    Multi-timestep models for Model-based Reinforcement Learning

    Authors: Abdelhakim Benechehab, Giuseppe Paolo, Albert Thomas, Maurizio Filippone, Balázs Kégl

    Abstract: In model-based reinforcement learning (MBRL), most algorithms rely on simulating trajectories from one-step dynamics models learned on data. A critical challenge of this approach is the compounding of one-step prediction errors as length of the trajectory grows. In this paper we tackle this issue by using a multi-timestep objective to train one-step models. Our objective is a weighted sum of a los… ▽ More

    Submitted 11 October, 2023; v1 submitted 9 October, 2023; originally announced October 2023.