-
ReassembleNet: Learnable Keypoints and Diffusion for 2D Fresco Reconstruction
Authors:
Adeela Islam,
Stefano Fiorini,
Stuart James,
Pietro Morerio,
Alessio Del Bue
Abstract:
The task of reassembly is a significant challenge across multiple domains, including archaeology, genomics, and molecular docking, requiring the precise placement and orientation of elements to reconstruct an original structure. In this work, we address key limitations in state-of-the-art Deep Learning methods for reassembly, namely i) scalability; ii) multimodality; and iii) real-world applicabil…
▽ More
The task of reassembly is a significant challenge across multiple domains, including archaeology, genomics, and molecular docking, requiring the precise placement and orientation of elements to reconstruct an original structure. In this work, we address key limitations in state-of-the-art Deep Learning methods for reassembly, namely i) scalability; ii) multimodality; and iii) real-world applicability: beyond square or simple geometric shapes, realistic and complex erosion, or other real-world problems. We propose ReassembleNet, a method that reduces complexity by representing each input piece as a set of contour keypoints and learning to select the most informative ones by Graph Neural Networks pooling inspired techniques. ReassembleNet effectively lowers computational complexity while enabling the integration of features from multiple modalities, including both geometric and texture data. Further enhanced through pretraining on a semi-synthetic dataset. We then apply diffusion-based pose estimation to recover the original structure. We improve on prior methods by 55% and 86% for RMSE Rotation and Translation, respectively.
△ Less
Submitted 29 May, 2025; v1 submitted 27 May, 2025;
originally announced May 2025.
-
Photomultiplier Requirements and Pre-Calibration for the SABRE South Liquid Scintillator Veto
Authors:
L. J. Milligan,
P. Urquijo,
E. Barberio,
V. U. Bashu,
L. J. Bignell,
I. Bolognino,
S. S. Chhun,
F. Dastgiri,
T. Fruth,
G. Fu,
G. C. Hill,
Y. Hua,
R. S. James,
K. Janssens,
S. Kapoor,
G. J. Lane,
K. T. Leaver,
P. McGee,
L. J. McKie,
J. McKenzie,
P. C. McNamara,
W. J. D. Melbourne,
M. Mews,
W. H. Ng,
K. J. Rule
, et al. (10 additional authors not shown)
Abstract:
We present a study of the oil-proof base Hamamatsu R5912 photomultiplier tubes that will be used in the SABRE South linear-alkylbenzene liquid scintillator veto. SABRE South is a dark matter direct detection experiment at the Stawell Underground Physics Laboratory, aiming to test the DAMA/LIBRA dark matter annual modulation signal. We discuss the requirements of the liquid scintillator system and…
▽ More
We present a study of the oil-proof base Hamamatsu R5912 photomultiplier tubes that will be used in the SABRE South linear-alkylbenzene liquid scintillator veto. SABRE South is a dark matter direct detection experiment at the Stawell Underground Physics Laboratory, aiming to test the DAMA/LIBRA dark matter annual modulation signal. We discuss the requirements of the liquid scintillator system and its photomultipliers, outline the methods and analysis used for the characterisation measurements, and results from initial tests. We discuss the impact of these measurements on the performance of the active veto system and explore analysis methods to allow for low threshold operation. Finally, we include results from a small scale liquid scintillator detector prototype used to assess the future performance of pulse shape discrimination in the liquid scintillator veto, and how well accommodated it is by the R5912 PMTs.
△ Less
Submitted 19 May, 2025; v1 submitted 15 May, 2025;
originally announced May 2025.
-
Characterisation of Hamamatsu R11065-20 PMTs for use in the SABRE South NaI(Tl) Crystal Detectors
Authors:
O. Stanley,
W. J. D. Melbourne,
P. Urquijo,
E. Barberio,
V. U. Bashu,
L. J. Bignell,
I. Bolognino,
G. Brooks,
S. S. Chhun,
F. Dastgiri,
M. B. Froehlich,
T. Fruth,
G. Fu,
G. C. Hill,
R. S. James,
K. Janssens,
S. Kapoor,
G. J. Lane,
K. T. Leaver,
P. McGee,
P. C. McNamara,
J. McKenzie,
L. J. McKie,
M. Mews,
L. J. Milligan
, et al. (9 additional authors not shown)
Abstract:
The SABRE Experiment is a direct detection dark matter experiment using a target composed of multiple NaI(Tl) crystals. The experiment aims to be an independent check of the DAMA/LIBRA results with a detector in the Northern (Laboratori Nazionali Del Gran Sasso, LNGS) and Southern (Stawell Underground Physics Laboratory, SUPL) hemispheres. The SABRE South photomultiplier tubes (PMTs) will be used…
▽ More
The SABRE Experiment is a direct detection dark matter experiment using a target composed of multiple NaI(Tl) crystals. The experiment aims to be an independent check of the DAMA/LIBRA results with a detector in the Northern (Laboratori Nazionali Del Gran Sasso, LNGS) and Southern (Stawell Underground Physics Laboratory, SUPL) hemispheres. The SABRE South photomultiplier tubes (PMTs) will be used near the low energy noise threshold and require a detailed calibration of their performance and contributions to the background in the NaI(Tl) dark matter search, prior to installation. We present the development of the pre-calibration procedures for the R11065-20 Hamamatsu PMTs. These PMTs are directly coupled to the NaI(Tl) crystals within the SABRE South experiment. In this paper we present methodologies to characterise the gain, dark rate, and timing properties of the PMTs. We develop a method for in-situ calibration without a light injection source. Additionally we explore the application of machine learning techniques using a Boosted Decision Tree (BDT) trained on the response of single PMTs to understand the information available for background rejection. Finally, we briefly present the simulation tool used to generate digitised PMT data from optical Monte Carlo simulations.
△ Less
Submitted 15 June, 2025; v1 submitted 23 April, 2025;
originally announced April 2025.
-
New constraints on cosmic ray-boosted dark matter from the LUX-ZEPLIN experiment
Authors:
J. Aalbers,
D. S. Akerib,
A. K. Al Musalhi,
F. Alder,
C. S. Amarasinghe,
A. Ames,
T. J. Anderson,
N. Angelides,
H. M. Araujo,
J. E. Armstrong,
M. Arthurs,
A. Baker,
S. Balashov,
J. Bang,
J. W. Bargemann,
E. E. Barillier,
K. Beattie,
A. Bhatti,
A. Biekert,
T. P. Biesiadzinski,
H. J. Birch,
E. Bishop,
G. M. Blockinger,
B. Boxer,
C. A. J. Brew
, et al. (179 additional authors not shown)
Abstract:
While dual-phase xenon time projection chambers (TPCs) have driven the sensitivity towards weakly interacting massive particles (WIMPs) at the GeV/c^2 to TeV/c^2 mass scale, the scope for sub-GeV/c^2 dark matter particles is hindered by a limited nuclear recoil energy detection threshold. One approach to probe for lighter candidates is to consider cases where they have been boosted by collisions w…
▽ More
While dual-phase xenon time projection chambers (TPCs) have driven the sensitivity towards weakly interacting massive particles (WIMPs) at the GeV/c^2 to TeV/c^2 mass scale, the scope for sub-GeV/c^2 dark matter particles is hindered by a limited nuclear recoil energy detection threshold. One approach to probe for lighter candidates is to consider cases where they have been boosted by collisions with cosmic rays in the Milky Way, such that the additional kinetic energy lifts their induced signatures above the nominal threshold. In this Letter, we report first results of a search for cosmic ray-boosted dark matter (CRDM) with a combined 4.2 tonne-year exposure from the LUX-ZEPLIN (LZ) experiment. We observe no excess above the expected backgrounds and establish world-leading constraints on the spin-independent CRDM-nucleon cross section as small as 3.9 * 10^{-33} cm^2 at 90% confidence level for sub-GeV/c^2 masses.
△ Less
Submitted 2 June, 2025; v1 submitted 23 March, 2025;
originally announced March 2025.
-
Word2Minecraft: Generating 3D Game Levels through Large Language Models
Authors:
Shuo Huang,
Muhammad Umair Nasir,
Steven James,
Julian Togelius
Abstract:
We present Word2Minecraft, a system that leverages large language models to generate playable game levels in Minecraft based on structured stories. The system transforms narrative elements-such as protagonist goals, antagonist challenges, and environmental settings-into game levels with both spatial and gameplay constraints. We introduce a flexible framework that allows for the customization of st…
▽ More
We present Word2Minecraft, a system that leverages large language models to generate playable game levels in Minecraft based on structured stories. The system transforms narrative elements-such as protagonist goals, antagonist challenges, and environmental settings-into game levels with both spatial and gameplay constraints. We introduce a flexible framework that allows for the customization of story complexity, enabling dynamic level generation. The system employs a scaling algorithm to maintain spatial consistency while adapting key game elements. We evaluate Word2Minecraft using both metric-based and human-based methods. Our results show that GPT-4-Turbo outperforms GPT-4o-Mini in most areas, including story coherence and objective enjoyment, while the latter excels in aesthetic appeal. We also demonstrate the system' s ability to generate levels with high map enjoyment, offering a promising step forward in the intersection of story generation and game design. We open-source the code at https://github.com/JMZ-kk/Word2Minecraft/tree/word2mc_v0
△ Less
Submitted 18 March, 2025;
originally announced March 2025.
-
Measurements and models of enhanced recombination following inner-shell vacancies in liquid xenon
Authors:
J. Aalbers,
D. S. Akerib,
A. K. Al Musalhi,
F. Alder,
C. S. Amarasinghe,
A. Ames,
T. J. Anderson,
N. Angelides,
H. M. Araújo,
J. E. Armstrong,
M. Arthurs,
A. Baker,
S. Balashov,
J. Bang,
J. W. Bargemann,
E. E. Barillier,
D. Bauer,
K. Beattie,
T. Benson,
A. Bhatti,
A. Biekert,
T. P. Biesiadzinski,
H. J. Birch,
E. Bishop,
G. M. Blockinger
, et al. (193 additional authors not shown)
Abstract:
Electron-capture decays of $^{125}$Xe and $^{127}$Xe, and double-electron-capture decays of $^{124}$Xe, are backgrounds in searches for weakly interacting massive particles (WIMPs) conducted by dual-phase xenon time projection chambers such as LUX-ZEPLIN (LZ). These decays produce signals with more light and less charge than equivalent-energy $β$ decays, and correspondingly overlap more with WIMP…
▽ More
Electron-capture decays of $^{125}$Xe and $^{127}$Xe, and double-electron-capture decays of $^{124}$Xe, are backgrounds in searches for weakly interacting massive particles (WIMPs) conducted by dual-phase xenon time projection chambers such as LUX-ZEPLIN (LZ). These decays produce signals with more light and less charge than equivalent-energy $β$ decays, and correspondingly overlap more with WIMP signals. We measure three electron-capture charge yields in LZ: the 1.1~keV M-shell, 5.2~keV L-shell, and 33.2~keV K-shell at drift fields of 193 and 96.5~V/cm. The LL double-electron-capture decay of $^{124}$Xe exhibits even more pronounced shifts in charge and light. We provide a first model of double-electron-capture charge yields using the link between ionization density and electron-ion recombination, and identify a need for more accurate calculations. Finally, we discuss the implications of the reduced charge yield of these decays and other interactions creating inner-shell vacancies for future dark matter searches.
△ Less
Submitted 17 June, 2025; v1 submitted 7 March, 2025;
originally announced March 2025.
-
Robust detection of overlapping bioacoustic sound events
Authors:
Louis Mahon,
Benjamin Hoffman,
Logan S James,
Maddie Cusimano,
Masato Hagiwara,
Sarah C Woolley,
Olivier Pietquin
Abstract:
We propose a method for accurately detecting bioacoustic sound events that is robust to overlapping events, a common issue in domains such as ethology, ecology and conservation. While standard methods employ a frame-based, multi-label approach, we introduce an onset-based detection method which we name Voxaboxen. It takes inspiration from object detection methods in computer vision, but simultaneo…
▽ More
We propose a method for accurately detecting bioacoustic sound events that is robust to overlapping events, a common issue in domains such as ethology, ecology and conservation. While standard methods employ a frame-based, multi-label approach, we introduce an onset-based detection method which we name Voxaboxen. It takes inspiration from object detection methods in computer vision, but simultaneously takes advantage of recent advances in self-supervised audio encoders. For each time window, Voxaboxen predicts whether it contains the start of a vocalization and how long the vocalization is. It also does the same in reverse, predicting whether each window contains the end of a vocalization, and how long ago it started. The two resulting sets of bounding boxes are then fused using a graph-matching algorithm. We also release a new dataset designed to measure performance on detecting overlapping vocalizations. This consists of recordings of zebra finches annotated with temporally-strong labels and showing frequent overlaps. We test Voxaboxen on seven existing data sets and on our new data set. We compare Voxaboxen to natural baselines and existing sound event detection methods and demonstrate SotA results. Further experiments show that improvements are robust to frequent vocalization overlap.
△ Less
Submitted 4 March, 2025;
originally announced March 2025.
-
Flexible Blood Glucose Control: Offline Reinforcement Learning from Human Feedback
Authors:
Harry Emerson,
Sam Gordon James,
Matthew Guy,
Ryan McConville
Abstract:
Reinforcement learning (RL) has demonstrated success in automating insulin dosing in simulated type 1 diabetes (T1D) patients but is currently unable to incorporate patient expertise and preference. This work introduces PAINT (Preference Adaptation for INsulin control in T1D), an original RL framework for learning flexible insulin dosing policies from patient records. PAINT employs a sketch-based…
▽ More
Reinforcement learning (RL) has demonstrated success in automating insulin dosing in simulated type 1 diabetes (T1D) patients but is currently unable to incorporate patient expertise and preference. This work introduces PAINT (Preference Adaptation for INsulin control in T1D), an original RL framework for learning flexible insulin dosing policies from patient records. PAINT employs a sketch-based approach for reward learning, where past data is annotated with a continuous reward signal to reflect patient's desired outcomes. Labelled data trains a reward model, informing the actions of a novel safety-constrained offline RL algorithm, designed to restrict actions to a safe strategy and enable preference tuning via a sliding scale. In-silico evaluation shows PAINT achieves common glucose goals through simple labelling of desired states, reducing glycaemic risk by 15% over a commercial benchmark. Action labelling can also be used to incorporate patient expertise, demonstrating an ability to pre-empt meals (+10% time-in-range post-meal) and address certain device errors (-1.6% variance post-error) with patient guidance. These results hold under realistic conditions, including limited samples, labelling errors, and intra-patient variability. This work illustrates PAINT's potential in real-world T1D management and more broadly any tasks requiring rapid and precise preference learning under safety constraints.
△ Less
Submitted 27 January, 2025;
originally announced January 2025.
-
Compositional Instruction Following with Language Models and Reinforcement Learning
Authors:
Vanya Cohen,
Geraud Nangue Tasse,
Nakul Gopalan,
Steven James,
Matthew Gombolay,
Ray Mooney,
Benjamin Rosman
Abstract:
Combining reinforcement learning with language grounding is challenging as the agent needs to explore the environment while simultaneously learning multiple language-conditioned tasks. To address this, we introduce a novel method: the compositionally-enabled reinforcement learning language agent (CERLLA). Our method reduces the sample complexity of tasks specified with language by leveraging compo…
▽ More
Combining reinforcement learning with language grounding is challenging as the agent needs to explore the environment while simultaneously learning multiple language-conditioned tasks. To address this, we introduce a novel method: the compositionally-enabled reinforcement learning language agent (CERLLA). Our method reduces the sample complexity of tasks specified with language by leveraging compositional policy representations and a semantic parser trained using reinforcement learning and in-context learning. We evaluate our approach in an environment requiring function approximation and demonstrate compositional generalization to novel tasks. Our method significantly outperforms the previous best non-compositional baseline in terms of sample complexity on 162 tasks designed to test compositional generalization. Our model attains a higher success rate and learns in fewer steps than the non-compositional baseline. It reaches a success rate equal to an oracle policy's upper-bound performance of 92%. With the same number of environment steps, the baseline only reaches a success rate of 80%.
△ Less
Submitted 21 January, 2025;
originally announced January 2025.
-
First constraint for atmospheric millicharged particles with the LUX-ZEPLIN experiment
Authors:
J. Aalbers,
D. S. Akerib,
A. K. Al Musalhi,
F. Alder,
C. S. Amarasinghe,
A. Ames,
T. J. Anderson,
N. Angelides,
H. M. Araújo,
J. E. Armstrong,
M. Arthurs,
A. Baker,
S. Balashov,
J. Bang,
J. W. Bargemann,
E. E. Barillier,
D. Bauer,
K. Beattie,
T. Benson,
A. Bhatti,
A. Biekert,
T. P. Biesiadzinski,
H. J. Birch,
E. Bishop,
G. M. Blockinger
, et al. (193 additional authors not shown)
Abstract:
We report on a search for millicharged particles (mCPs) produced in cosmic ray proton atmospheric interactions using data collected during the first science run of the LUX-ZEPLIN experiment. The mCPs produced by two processes -- meson decay and proton bremsstrahlung -- are considered in this study. This search utilized a novel signature unique to liquid xenon (LXe) time projection chambers (TPCs),…
▽ More
We report on a search for millicharged particles (mCPs) produced in cosmic ray proton atmospheric interactions using data collected during the first science run of the LUX-ZEPLIN experiment. The mCPs produced by two processes -- meson decay and proton bremsstrahlung -- are considered in this study. This search utilized a novel signature unique to liquid xenon (LXe) time projection chambers (TPCs), allowing sensitivity to mCPs with masses ranging from 10 to 1000 MeV/c$^2$ and fractional charges between 0.001 and 0.02 of the electron charge e. With an exposure of 60 live days and a 5.5 tonne fiducial mass, we observed no significant excess over background. This represents the first experimental search for atmospheric mCPs and the first search for mCPs using an underground LXe experiment.
△ Less
Submitted 9 June, 2025; v1 submitted 6 December, 2024;
originally announced December 2024.
-
The SABRE South Technical Design Report Executive Summary
Authors:
E. Barberio,
T. Baroncelli,
V. U. Bashu,
L. J. Bignell,
I. Bolognino,
G. Brooks,
S. S. Chhun,
F. Dastgiri,
A. Di Giacinto,
G. D'Imperio,
A. R. Duffy,
M. B. Froehlich,
T. Fruth,
G. Fu,
G. C. Hill,
R. S. James,
K. Janssens,
S. Kapoor,
G. J. Lane,
K. T. Leaver,
A. Mariani,
P. McGee,
L. J. McKie,
P. C. McNamara,
J. McKenzie
, et al. (20 additional authors not shown)
Abstract:
In this technical design report (TDR) executive summary we describe the SABRE South detector to be built at the Stawell Underground Physics Laboratory (SUPL). The SABRE South detector is designed to test the long-standing DAMA/LIBRA signal of an annually modulating rate consistent with dark matter by using the same target material. Located in the Southern Hemisphere, the detector is uniquely posit…
▽ More
In this technical design report (TDR) executive summary we describe the SABRE South detector to be built at the Stawell Underground Physics Laboratory (SUPL). The SABRE South detector is designed to test the long-standing DAMA/LIBRA signal of an annually modulating rate consistent with dark matter by using the same target material. Located in the Southern Hemisphere, the detector is uniquely positioned to disentangle modulating seasonal effects. SABRE South uses seven ultra-high purity NaI(Tl) crystals (with a total target mass of either 35 kg or 50 kg), hermetically sealed in copper enclosures that are suspended within a liquid scintillator active veto. High quantum efficiency and low background Hamamatsu R11065 photomultiplier tubes are directly coupled to both ends of the crystal, and enclosed with the crystal in an oxygen free copper enclosure. The active veto system consists of 11.6 kL of linear alkylbenzene (LAB) doped with a mixture of fluorophores and contained in a steel vessel, which is instrumented with at least 18 Hamamatsu R5912 photomultipliers. The active veto tags key radiogenic backgrounds intrinsic to the crystals, such as ${^{40}}$K, and is expected to suppress the total background by 27% in the 1-6 keV region of interest. In addition to the liquid scintillator veto, a muon veto is positioned above the detector shielding. This muon veto consists of eight EJ-200 scintillator modules, with Hamamatsu R13089 photomultipliers coupled to both ends. With an expected total background of 0.72 cpd/kg/keV, SABRE South can test the DAMA/LIBRA signal with 5$σ$ discovery or 3$σ$ exclusion after two years of data taking.
△ Less
Submitted 9 April, 2025; v1 submitted 21 November, 2024;
originally announced November 2024.
-
Maps from Motion (MfM): Generating 2D Semantic Maps from Sparse Multi-view Images
Authors:
Matteo Toso,
Stefano Fiorini,
Stuart James,
Alessio Del Bue
Abstract:
World-wide detailed 2D maps require enormous collective efforts. OpenStreetMap is the result of 11 million registered users manually annotating the GPS location of over 1.75 billion entries, including distinctive landmarks and common urban objects. At the same time, manual annotations can include errors and are slow to update, limiting the map's accuracy. Maps from Motion (MfM) is a step forward t…
▽ More
World-wide detailed 2D maps require enormous collective efforts. OpenStreetMap is the result of 11 million registered users manually annotating the GPS location of over 1.75 billion entries, including distinctive landmarks and common urban objects. At the same time, manual annotations can include errors and are slow to update, limiting the map's accuracy. Maps from Motion (MfM) is a step forward to automatize such time-consuming map making procedure by computing 2D maps of semantic objects directly from a collection of uncalibrated multi-view images. From each image, we extract a set of object detections, and estimate their spatial arrangement in a top-down local map centered in the reference frame of the camera that captured the image. Aligning these local maps is not a trivial problem, since they provide incomplete, noisy fragments of the scene, and matching detections across them is unreliable because of the presence of repeated pattern and the limited appearance variability of urban objects. We address this with a novel graph-based framework, that encodes the spatial and semantic distribution of the objects detected in each image, and learns how to combine them to predict the objects' poses in a global reference system, while taking into account all possible detection matches and preserving the topology observed in each image. Despite the complexity of the problem, our best model achieves global 2D registration with an average accuracy within 4 meters (i.e., below GPS accuracy) even on sparse sequences with strong viewpoint change, on which COLMAP has an 80% failure rate. We provide extensive evaluation on synthetic and real-world data, showing how the method obtains a solution even in scenarios where standard optimization techniques fail.
△ Less
Submitted 22 November, 2024; v1 submitted 19 November, 2024;
originally announced November 2024.
-
Re-assembling the past: The RePAIR dataset and benchmark for real world 2D and 3D puzzle solving
Authors:
Theodore Tsesmelis,
Luca Palmieri,
Marina Khoroshiltseva,
Adeela Islam,
Gur Elkin,
Ofir Itzhak Shahar,
Gianluca Scarpellini,
Stefano Fiorini,
Yaniv Ohayon,
Nadav Alali,
Sinem Aslan,
Pietro Morerio,
Sebastiano Vascon,
Elena Gravina,
Maria Cristina Napolitano,
Giuseppe Scarpati,
Gabriel Zuchtriegel,
Alexandra Spühler,
Michel E. Fuchs,
Stuart James,
Ohad Ben-Shahar,
Marcello Pelillo,
Alessio Del Bue
Abstract:
This paper proposes the RePAIR dataset that represents a challenging benchmark to test modern computational and data driven methods for puzzle-solving and reassembly tasks. Our dataset has unique properties that are uncommon to current benchmarks for 2D and 3D puzzle solving. The fragments and fractures are realistic, caused by a collapse of a fresco during a World War II bombing at the Pompeii ar…
▽ More
This paper proposes the RePAIR dataset that represents a challenging benchmark to test modern computational and data driven methods for puzzle-solving and reassembly tasks. Our dataset has unique properties that are uncommon to current benchmarks for 2D and 3D puzzle solving. The fragments and fractures are realistic, caused by a collapse of a fresco during a World War II bombing at the Pompeii archaeological park. The fragments are also eroded and have missing pieces with irregular shapes and different dimensions, challenging further the reassembly algorithms. The dataset is multi-modal providing high resolution images with characteristic pictorial elements, detailed 3D scans of the fragments and meta-data annotated by the archaeologists. Ground truth has been generated through several years of unceasing fieldwork, including the excavation and cleaning of each fragment, followed by manual puzzle solving by archaeologists of a subset of approx. 1000 pieces among the 16000 available. After digitizing all the fragments in 3D, a benchmark was prepared to challenge current reassembly and puzzle-solving methods that often solve more simplistic synthetic scenarios. The tested baselines show that there clearly exists a gap to fill in solving this computationally complex problem.
△ Less
Submitted 5 November, 2024; v1 submitted 31 October, 2024;
originally announced October 2024.
-
Neutrinoless Double Beta Decay Sensitivity of the XLZD Rare Event Observatory
Authors:
XLZD Collaboration,
J. Aalbers,
K. Abe,
M. Adrover,
S. Ahmed Maouloud,
D. S. Akerib,
A. K. Al Musalhi,
F. Alder,
L. Althueser,
D. W. P. Amaral,
C. S. Amarasinghe,
A. Ames,
B. Andrieu,
N. Angelides,
E. Angelino,
B. Antunovic,
E. Aprile,
H. M. Araújo,
J. E. Armstrong,
M. Arthurs,
M. Babicz,
D. Bajpai,
A. Baker,
M. Balzer,
J. Bang
, et al. (419 additional authors not shown)
Abstract:
The XLZD collaboration is developing a two-phase xenon time projection chamber with an active mass of 60 to 80 t capable of probing the remaining WIMP-nucleon interaction parameter space down to the so-called neutrino fog. In this work we show that, based on the performance of currently operating detectors using the same technology and a realistic reduction of radioactivity in detector materials,…
▽ More
The XLZD collaboration is developing a two-phase xenon time projection chamber with an active mass of 60 to 80 t capable of probing the remaining WIMP-nucleon interaction parameter space down to the so-called neutrino fog. In this work we show that, based on the performance of currently operating detectors using the same technology and a realistic reduction of radioactivity in detector materials, such an experiment will also be able to competitively search for neutrinoless double beta decay in $^{136}$Xe using a natural-abundance xenon target. XLZD can reach a 3$σ$ discovery potential half-life of 5.7$\times$10$^{27}$ yr (and a 90% CL exclusion of 1.3$\times$10$^{28}$ yr) with 10 years of data taking, corresponding to a Majorana mass range of 7.3-31.3 meV (4.8-20.5 meV). XLZD will thus exclude the inverted neutrino mass ordering parameter space and will start to probe the normal ordering region for most of the nuclear matrix elements commonly considered by the community.
△ Less
Submitted 30 April, 2025; v1 submitted 23 October, 2024;
originally announced October 2024.
-
The XLZD Design Book: Towards the Next-Generation Liquid Xenon Observatory for Dark Matter and Neutrino Physics
Authors:
XLZD Collaboration,
J. Aalbers,
K. Abe,
M. Adrover,
S. Ahmed Maouloud,
D. S. Akerib,
A. K. Al Musalhi,
F. Alder,
L. Althueser,
D. W. P. Amaral,
C. S. Amarasinghe,
A. Ames,
B. Andrieu,
N. Angelides,
E. Angelino,
B. Antunovic,
E. Aprile,
H. M. Araújo,
J. E. Armstrong,
M. Arthurs,
M. Babicz,
A. Baker,
M. Balzer,
J. Bang,
E. Barberio
, et al. (419 additional authors not shown)
Abstract:
This report describes the experimental strategy and technologies for XLZD, the next-generation xenon observatory sensitive to dark matter and neutrino physics. In the baseline design, the detector will have an active liquid xenon target of 60 tonnes, which could be increased to 80 tonnes if the market conditions for xenon are favorable. It is based on the mature liquid xenon time projection chambe…
▽ More
This report describes the experimental strategy and technologies for XLZD, the next-generation xenon observatory sensitive to dark matter and neutrino physics. In the baseline design, the detector will have an active liquid xenon target of 60 tonnes, which could be increased to 80 tonnes if the market conditions for xenon are favorable. It is based on the mature liquid xenon time projection chamber technology used in current-generation experiments, LZ and XENONnT. The report discusses the baseline design and opportunities for further optimization of the individual detector components. The experiment envisaged here has the capability to explore parameter space for Weakly Interacting Massive Particle (WIMP) dark matter down to the neutrino fog, with a 3$σ$ evidence potential for WIMP-nucleon cross sections as low as $3\times10^{-49}\rm\,cm^2$ (at 40 GeV/c$^2$ WIMP mass). The observatory will also have leading sensitivity to a wide range of alternative dark matter models. It is projected to have a 3$σ$ observation potential of neutrinoless double beta decay of $^{136}$Xe at a half-life of up to $5.7\times 10^{27}$ years. Additionally, it is sensitive to astrophysical neutrinos from the sun and galactic supernovae.
△ Less
Submitted 14 April, 2025; v1 submitted 22 October, 2024;
originally announced October 2024.
-
Dark Matter Search Results from 4.2 Tonne-Years of Exposure of the LUX-ZEPLIN (LZ) Experiment
Authors:
J. Aalbers,
D. S. Akerib,
A. K. Al Musalhi,
F. Alder,
C. S. Amarasinghe,
A. Ames,
T. J. Anderson,
N. Angelides,
H. M. Araújo,
J. E. Armstrong,
M. Arthurs,
A. Baker,
S. Balashov,
J. Bang,
J. W. Bargemann,
E. E. Barillier,
D. Bauer,
K. Beattie,
T. Benson,
A. Bhatti,
A. Biekert,
T. P. Biesiadzinski,
H. J. Birch,
E. Bishop,
G. M. Blockinger
, et al. (193 additional authors not shown)
Abstract:
We report results of a search for nuclear recoils induced by weakly interacting massive particle (WIMP) dark matter using the LUX-ZEPLIN (LZ) two-phase xenon time projection chamber. This analysis uses a total exposure of $4.2\pm0.1$ tonne-years from 280 live days of LZ operation, of which $3.3\pm0.1$ tonne-years and 220 live days are new. A technique to actively tag background electronic recoils…
▽ More
We report results of a search for nuclear recoils induced by weakly interacting massive particle (WIMP) dark matter using the LUX-ZEPLIN (LZ) two-phase xenon time projection chamber. This analysis uses a total exposure of $4.2\pm0.1$ tonne-years from 280 live days of LZ operation, of which $3.3\pm0.1$ tonne-years and 220 live days are new. A technique to actively tag background electronic recoils from $^{214}$Pb $β$ decays is featured for the first time. Enhanced electron-ion recombination is observed in two-neutrino double electron capture decays of $^{124}$Xe, representing a noteworthy new background. After removal of artificial signal-like events injected into the data set to mitigate analyzer bias, we find no evidence for an excess over expected backgrounds. World-leading constraints are placed on spin-independent (SI) and spin-dependent WIMP-nucleon cross sections for masses $\geq$9 GeV/$c^2$. The strongest SI exclusion set is $2.2\times10^{-48}$ cm$^{2}$ at the 90% confidence level and the best SI median sensitivity achieved is $5.1\times10^{-48}$ cm$^{2}$, both for a mass of 40 GeV/$c^2$.
△ Less
Submitted 1 July, 2025; v1 submitted 22 October, 2024;
originally announced October 2024.
-
GameTraversalBenchmark: Evaluating Planning Abilities Of Large Language Models Through Traversing 2D Game Maps
Authors:
Muhammad Umair Nasir,
Steven James,
Julian Togelius
Abstract:
Large language models (LLMs) have recently demonstrated great success in generating and understanding natural language. While they have also shown potential beyond the domain of natural language, it remains an open question as to what extent and in which way these LLMs can plan. We investigate their planning capabilities by proposing GameTraversalBenchmark (GTB), a benchmark consisting of diverse…
▽ More
Large language models (LLMs) have recently demonstrated great success in generating and understanding natural language. While they have also shown potential beyond the domain of natural language, it remains an open question as to what extent and in which way these LLMs can plan. We investigate their planning capabilities by proposing GameTraversalBenchmark (GTB), a benchmark consisting of diverse 2D grid-based game maps. An LLM succeeds if it can traverse through given objectives, with a minimum number of steps and a minimum number of generation errors. We evaluate a number of LLMs on GTB and found that GPT-4-Turbo achieved the highest score of 44.97% on GTB\_Score (GTBS), a composite score that combines the three above criteria. Furthermore, we preliminarily test large reasoning models, namely o1, which scores $67.84\%$ on GTBS, indicating that the benchmark remains challenging for current models. Code, data, and documentation are available at https://github.com/umair-nasir14/Game-Traversal-Benchmark.
△ Less
Submitted 10 October, 2024;
originally announced October 2024.
-
Model-independent searches of new physics in DARWIN with a semi-supervised deep learning pipeline
Authors:
J. Aalbers,
K. Abe,
M. Adrover,
S. Ahmed Maouloud,
L. Althueser,
D. W. P. Amaral,
B. Andrieu,
E. Angelino,
D. Antón Martin,
B. Antunovic,
E. Aprile,
M. Babicz,
D. Bajpai,
M. Balzer,
E. Barberio,
L. Baudis,
M. Bazyk,
N. F. Bell,
L. Bellagamba,
R. Biondi,
Y. Biondi,
A. Bismark,
C. Boehm,
K. Boese,
R. Braun
, et al. (209 additional authors not shown)
Abstract:
We present a novel deep learning pipeline to perform a model-independent, likelihood-free search for anomalous (i.e., non-background) events in the proposed next generation multi-ton scale liquid Xenon-based direct detection experiment, DARWIN. We train an anomaly detector comprising a variational autoencoder and a classifier on extensive, high-dimensional simulated detector response data and cons…
▽ More
We present a novel deep learning pipeline to perform a model-independent, likelihood-free search for anomalous (i.e., non-background) events in the proposed next generation multi-ton scale liquid Xenon-based direct detection experiment, DARWIN. We train an anomaly detector comprising a variational autoencoder and a classifier on extensive, high-dimensional simulated detector response data and construct a one-dimensional anomaly score optimised to reject the background only hypothesis in the presence of an excess of non-background-like events. We benchmark the procedure with a sensitivity study that determines its power to reject the background-only hypothesis in the presence of an injected WIMP dark matter signal, outperforming the classical, likelihood-based background rejection test. We show that our neural networks learn relevant energy features of the events from low-level, high-dimensional detector outputs, without the need to compress this data into lower-dimensional observables, thus reducing computational effort and information loss. For the future, our approach lays the foundation for an efficient end-to-end pipeline that eliminates the need for many of the corrections and cuts that are traditionally part of the analysis chain, with the potential of achieving higher accuracy and significant reduction of analysis time.
△ Less
Submitted 1 October, 2024;
originally announced October 2024.
-
Two-neutrino double electron capture of $^{124}$Xe in the first LUX-ZEPLIN exposure
Authors:
J. Aalbers,
D. S. Akerib,
A. K. Al Musalhi,
F. Alder,
C. S. Amarasinghe,
A. Ames,
T. J. Anderson,
N. Angelides,
H. M. Araújo,
J. E. Armstrong,
M. Arthurs,
A. Baker,
S. Balashov,
J. Bang,
J. W. Bargemann,
E. E. Barillier,
K. Beattie,
A. Bhatti,
A. Biekert,
T. P. Biesiadzinski,
H. J. Birch,
E. Bishop,
G. M. Blockinger,
B. Boxer,
C. A. J. Brew
, et al. (180 additional authors not shown)
Abstract:
The broad physics reach of the LUX-ZEPLIN (LZ) experiment covers rare phenomena beyond the direct detection of dark matter. We report precise measurements of the extremely rare decay of $^{124}$Xe through the process of two-neutrino double electron capture (2$ν$2EC), utilizing a $1.39\,\mathrm{kg} \times \mathrm{yr}$ isotopic exposure from the first LZ science run. A half-life of…
▽ More
The broad physics reach of the LUX-ZEPLIN (LZ) experiment covers rare phenomena beyond the direct detection of dark matter. We report precise measurements of the extremely rare decay of $^{124}$Xe through the process of two-neutrino double electron capture (2$ν$2EC), utilizing a $1.39\,\mathrm{kg} \times \mathrm{yr}$ isotopic exposure from the first LZ science run. A half-life of $T_{1/2}^{2\nu2\mathrm{EC}} = (1.09 \pm 0.14_{\text{stat}} \pm 0.05_{\text{sys}}) \times 10^{22}\,\mathrm{yr}$ is observed with a statistical significance of $8.3\,σ$, in agreement with literature. First empirical measurements of the KK capture fraction relative to other K-shell modes were conducted, and demonstrate consistency with respect to recent signal models at the $1.4\,σ$ level.
△ Less
Submitted 7 December, 2024; v1 submitted 30 August, 2024;
originally announced August 2024.
-
The DAMA/LIBRA signal: an induced modulation effect?
Authors:
R. S. James,
K. Rule,
E. Barberio,
V. U. Bashu,
L. J. Bignell,
I. Bolognino,
G. Brooks,
S. S. Chhun,
F. Dastgiri,
A. R. Duffy,
M. Froehlich,
T. M. A. Fruth,
G. Fu,
G. C. Hill,
K. Janssens,
S. Kapoor,
G. J. Lane,
K. T. Leaver,
P. McGee,
L. J. McKie,
P. C. McNamara,
J. McKenzie,
W. J. D. Melbourne,
M. Mews,
L. J. Milligan
, et al. (14 additional authors not shown)
Abstract:
The persistence of the DAMA/LIBRA (DAMA) modulation over the past two decades has been a source of great contention within the dark matter community. The DAMA collaboration reports a persistent, modulating event rate within their setup of NaI(Tl) scintillating crystals at the INFN Laboratori Nazionali del Gran Sasso (LNGS) underground laboratory. A recent work alluded that this signal could have a…
▽ More
The persistence of the DAMA/LIBRA (DAMA) modulation over the past two decades has been a source of great contention within the dark matter community. The DAMA collaboration reports a persistent, modulating event rate within their setup of NaI(Tl) scintillating crystals at the INFN Laboratori Nazionali del Gran Sasso (LNGS) underground laboratory. A recent work alluded that this signal could have arisen due to an analysis artefact, caused by DAMA not accounting for time variation of decaying background radioisotopes in their analysis procedure. In this work, we examine in detail this 'induced modulation' effect, arguing that a number of aspects of the DAMA signal are incompatible with an induced modulation arising from decays of background isotopes over the lifetime of the experiment. Using a toy model of the DAMA/LIBRA experiment, we explore the induced modulation effect under different variations of the activities of the relevant isotopes - namely, $^3$H and $^{210}$Pb - highlighting the various inconsistencies between the resultant toy datasets and the DAMA signal. We stress the importance of the SABRE experiment, whose goal is to unambiguously test for the presence of such a modulating signal in an experiment using the same target material and comparable levels of background.
△ Less
Submitted 28 March, 2025; v1 submitted 16 August, 2024;
originally announced August 2024.
-
6DGS: 6D Pose Estimation from a Single Image and a 3D Gaussian Splatting Model
Authors:
Matteo Bortolon,
Theodore Tsesmelis,
Stuart James,
Fabio Poiesi,
Alessio Del Bue
Abstract:
We propose 6DGS to estimate the camera pose of a target RGB image given a 3D Gaussian Splatting (3DGS) model representing the scene. 6DGS avoids the iterative process typical of analysis-by-synthesis methods (e.g. iNeRF) that also require an initialization of the camera pose in order to converge. Instead, our method estimates a 6DoF pose by inverting the 3DGS rendering process. Starting from the o…
▽ More
We propose 6DGS to estimate the camera pose of a target RGB image given a 3D Gaussian Splatting (3DGS) model representing the scene. 6DGS avoids the iterative process typical of analysis-by-synthesis methods (e.g. iNeRF) that also require an initialization of the camera pose in order to converge. Instead, our method estimates a 6DoF pose by inverting the 3DGS rendering process. Starting from the object surface, we define a radiant Ellicell that uniformly generates rays departing from each ellipsoid that parameterize the 3DGS model. Each Ellicell ray is associated with the rendering parameters of each ellipsoid, which in turn is used to obtain the best bindings between the target image pixels and the cast rays. These pixel-ray bindings are then ranked to select the best scoring bundle of rays, which their intersection provides the camera center and, in turn, the camera rotation. The proposed solution obviates the necessity of an "a priori" pose for initialization, and it solves 6DoF pose estimation in closed form, without the need for iterations. Moreover, compared to the existing Novel View Synthesis (NVS) baselines for pose estimation, 6DGS can improve the overall average rotational accuracy by 12% and translation accuracy by 22% on real scenes, despite not requiring any initialization pose. At the same time, our method operates near real-time, reaching 15fps on consumer hardware.
△ Less
Submitted 22 July, 2024;
originally announced July 2024.
-
Generative Image as Action Models
Authors:
Mohit Shridhar,
Yat Long Lo,
Stephen James
Abstract:
Image-generation diffusion models have been fine-tuned to unlock new capabilities such as image-editing and novel view synthesis. Can we similarly unlock image-generation models for visuomotor control? We present GENIMA, a behavior-cloning agent that fine-tunes Stable Diffusion to 'draw joint-actions' as targets on RGB images. These images are fed into a controller that maps the visual targets int…
▽ More
Image-generation diffusion models have been fine-tuned to unlock new capabilities such as image-editing and novel view synthesis. Can we similarly unlock image-generation models for visuomotor control? We present GENIMA, a behavior-cloning agent that fine-tunes Stable Diffusion to 'draw joint-actions' as targets on RGB images. These images are fed into a controller that maps the visual targets into a sequence of joint-positions. We study GENIMA on 25 RLBench and 9 real-world manipulation tasks. We find that, by lifting actions into image-space, internet pre-trained diffusion models can generate policies that outperform state-of-the-art visuomotor approaches, especially in robustness to scene perturbations and generalizing to novel objects. Our method is also competitive with 3D agents, despite lacking priors such as depth, keypoints, or motion-planners.
△ Less
Submitted 8 October, 2024; v1 submitted 10 July, 2024;
originally announced July 2024.
-
Green Screen Augmentation Enables Scene Generalisation in Robotic Manipulation
Authors:
Eugene Teoh,
Sumit Patidar,
Xiao Ma,
Stephen James
Abstract:
Generalising vision-based manipulation policies to novel environments remains a challenging area with limited exploration. Current practices involve collecting data in one location, training imitation learning or reinforcement learning policies with this data, and deploying the policy in the same location. However, this approach lacks scalability as it necessitates data collection in multiple loca…
▽ More
Generalising vision-based manipulation policies to novel environments remains a challenging area with limited exploration. Current practices involve collecting data in one location, training imitation learning or reinforcement learning policies with this data, and deploying the policy in the same location. However, this approach lacks scalability as it necessitates data collection in multiple locations for each task. This paper proposes a novel approach where data is collected in a location predominantly featuring green screens. We introduce Green-screen Augmentation (GreenAug), employing a chroma key algorithm to overlay background textures onto a green screen. Through extensive real-world empirical studies with over 850 training demonstrations and 8.2k evaluation episodes, we demonstrate that GreenAug surpasses no augmentation, standard computer vision augmentation, and prior generative augmentation methods in performance. While no algorithmic novelties are claimed, our paper advocates for a fundamental shift in data collection practices. We propose that real-world demonstrations in future research should utilise green screens, followed by the application of GreenAug. We believe GreenAug unlocks policy generalisation to visually distinct novel locations, addressing the current scene generalisation limitations in robot learning.
△ Less
Submitted 8 September, 2024; v1 submitted 10 July, 2024;
originally announced July 2024.
-
BiGym: A Demo-Driven Mobile Bi-Manual Manipulation Benchmark
Authors:
Nikita Chernyadev,
Nicholas Backshall,
Xiao Ma,
Yunfan Lu,
Younggyo Seo,
Stephen James
Abstract:
We introduce BiGym, a new benchmark and learning environment for mobile bi-manual demo-driven robotic manipulation. BiGym features 40 diverse tasks set in home environments, ranging from simple target reaching to complex kitchen cleaning. To capture the real-world performance accurately, we provide human-collected demonstrations for each task, reflecting the diverse modalities found in real-world…
▽ More
We introduce BiGym, a new benchmark and learning environment for mobile bi-manual demo-driven robotic manipulation. BiGym features 40 diverse tasks set in home environments, ranging from simple target reaching to complex kitchen cleaning. To capture the real-world performance accurately, we provide human-collected demonstrations for each task, reflecting the diverse modalities found in real-world robot trajectories. BiGym supports a variety of observations, including proprioceptive data and visual inputs such as RGB, and depth from 3 camera views. To validate the usability of BiGym, we thoroughly benchmark the state-of-the-art imitation learning algorithms and demo-driven reinforcement learning algorithms within the environment and discuss the future opportunities.
△ Less
Submitted 11 July, 2024; v1 submitted 10 July, 2024;
originally announced July 2024.
-
Continuous Control with Coarse-to-fine Reinforcement Learning
Authors:
Younggyo Seo,
Jafar Uruç,
Stephen James
Abstract:
Despite recent advances in improving the sample-efficiency of reinforcement learning (RL) algorithms, designing an RL algorithm that can be practically deployed in real-world environments remains a challenge. In this paper, we present Coarse-to-fine Reinforcement Learning (CRL), a framework that trains RL agents to zoom-into a continuous action space in a coarse-to-fine manner, enabling the use of…
▽ More
Despite recent advances in improving the sample-efficiency of reinforcement learning (RL) algorithms, designing an RL algorithm that can be practically deployed in real-world environments remains a challenge. In this paper, we present Coarse-to-fine Reinforcement Learning (CRL), a framework that trains RL agents to zoom-into a continuous action space in a coarse-to-fine manner, enabling the use of stable, sample-efficient value-based RL algorithms for fine-grained continuous control tasks. Our key idea is to train agents that output actions by iterating the procedure of (i) discretizing the continuous action space into multiple intervals and (ii) selecting the interval with the highest Q-value to further discretize at the next level. We then introduce a concrete, value-based algorithm within the CRL framework called Coarse-to-fine Q-Network (CQN). Our experiments demonstrate that CQN significantly outperforms RL and behavior cloning baselines on 20 sparsely-rewarded RLBench manipulation tasks with a modest number of environment interactions and expert demonstrations. We also show that CQN robustly learns to solve real-world manipulation tasks within a few minutes of online training.
△ Less
Submitted 10 July, 2024;
originally announced July 2024.
-
The Design, Implementation, and Performance of the LZ Calibration Systems
Authors:
J. Aalbers,
D. S. Akerib,
A. K. Al Musalhi,
F. Alder,
C. S. Amarasinghe,
A. Ames,
T. J. Anderson,
N. Angelides,
H. M. Araújo,
J. E. Armstrong,
M. Arthurs,
A. Baker,
S. Balashov,
J. Bang,
E. E. Barillier,
J. W. Bargemann,
K. Beattie,
T. Benson,
A. Bhatti,
A. Biekert,
T. P. Biesiadzinski,
H. J. Birch,
E. Bishop,
G. M. Blockinger,
B. Boxer
, et al. (179 additional authors not shown)
Abstract:
LUX-ZEPLIN (LZ) is a tonne-scale experiment searching for direct dark matter interactions and other rare events. It is located at the Sanford Underground Research Facility (SURF) in Lead, South Dakota, USA. The core of the LZ detector is a dual-phase xenon time projection chamber (TPC), designed with the primary goal of detecting Weakly Interacting Massive Particles (WIMPs) via their induced low e…
▽ More
LUX-ZEPLIN (LZ) is a tonne-scale experiment searching for direct dark matter interactions and other rare events. It is located at the Sanford Underground Research Facility (SURF) in Lead, South Dakota, USA. The core of the LZ detector is a dual-phase xenon time projection chamber (TPC), designed with the primary goal of detecting Weakly Interacting Massive Particles (WIMPs) via their induced low energy nuclear recoils. Surrounding the TPC, two veto detectors immersed in an ultra-pure water tank enable reducing background events to enhance the discovery potential. Intricate calibration systems are purposely designed to precisely understand the responses of these three detector volumes to various types of particle interactions and to demonstrate LZ's ability to discriminate between signals and backgrounds. In this paper, we present a comprehensive discussion of the key features, requirements, and performance of the LZ calibration systems, which play a crucial role in enabling LZ's WIMP-search and its broad science program. The thorough description of these calibration systems, with an emphasis on their novel aspects, is valuable for future calibration efforts in direct dark matter and other rare-event search experiments.
△ Less
Submitted 5 September, 2024; v1 submitted 2 May, 2024;
originally announced June 2024.
-
M-SET: Multi-Drone Swarm Intelligence Experimentation with Collision Avoidance Realism
Authors:
Chuhao Qin,
Alexander Robins,
Callum Lillywhite-Roake,
Adam Pearce,
Hritik Mehta,
Scott James,
Tsz Ho Wong,
Evangelos Pournaras
Abstract:
Distributed sensing by cooperative drone swarms is crucial for several Smart City applications, such as traffic monitoring and disaster response. Using an indoor lab with inexpensive drones, a testbed supports complex and ambitious studies on these systems while maintaining low cost, rigor, and external validity. This paper introduces the Multi-drone Sensing Experimentation Testbed (M-SET), a nove…
▽ More
Distributed sensing by cooperative drone swarms is crucial for several Smart City applications, such as traffic monitoring and disaster response. Using an indoor lab with inexpensive drones, a testbed supports complex and ambitious studies on these systems while maintaining low cost, rigor, and external validity. This paper introduces the Multi-drone Sensing Experimentation Testbed (M-SET), a novel platform designed to prototype, develop, test, and evaluate distributed sensing with swarm intelligence. M-SET addresses the limitations of existing testbeds that fail to emulate collisions, thus lacking realism in outdoor environments. By integrating a collision avoidance method based on a potential field algorithm, M-SET ensures collision-free navigation and sensing, further optimized via a multi-agent collective learning algorithm. Extensive evaluation demonstrates accurate energy consumption estimation and a low risk of collisions, providing a robust proof-of-concept. New insights show that M-SET has significant potential to support ambitious research with minimal cost, simplicity, and high sensing quality.
△ Less
Submitted 21 November, 2024; v1 submitted 16 June, 2024;
originally announced June 2024.
-
Redundancy-aware Action Spaces for Robot Learning
Authors:
Pietro Mazzaglia,
Nicholas Backshall,
Xiao Ma,
Stephen James
Abstract:
Joint space and task space control are the two dominant action modes for controlling robot arms within the robot learning literature. Actions in joint space provide precise control over the robot's pose, but tend to suffer from inefficient training; actions in task space boast data-efficient training but sacrifice the ability to perform tasks in confined spaces due to limited control over the full…
▽ More
Joint space and task space control are the two dominant action modes for controlling robot arms within the robot learning literature. Actions in joint space provide precise control over the robot's pose, but tend to suffer from inefficient training; actions in task space boast data-efficient training but sacrifice the ability to perform tasks in confined spaces due to limited control over the full joint configuration. This work analyses the criteria for designing action spaces for robot manipulation and introduces ER (End-effector Redundancy), a novel action space formulation that, by addressing the redundancies present in the manipulator, aims to combine the advantages of both joint and task spaces, offering fine-grained comprehensive control with overactuated robot arms whilst achieving highly efficient robot learning. We present two implementations of ER, ERAngle (ERA) and ERJoint (ERJ), and we show that ERJ in particular demonstrates superior performance across multiple settings, especially when precise control over the robot configuration is required. We validate our results both in simulated and real robotic environments.
△ Less
Submitted 6 June, 2024;
originally announced June 2024.
-
Probing the Scalar WIMP-Pion Coupling with the first LUX-ZEPLIN data
Authors:
J. Aalbers,
D. S. Akerib,
A. K. Al Musalhi,
F. Alder,
C. S. Amarasinghe,
A. Ames,
T. J. Anderson,
N. Angelides,
H. M. Araújo,
J. E. Armstrong,
M. Arthurs,
A. Baker,
S. Balashov,
J. Bang,
E. E. Barillier,
J. W. Bargemann,
K. Beattie,
T. Benson,
A. Bhatti,
A. Biekert,
T. P. Biesiadzinski,
H. J. Birch,
E. J. Bishop,
G. M. Blockinger,
B. Boxer
, et al. (178 additional authors not shown)
Abstract:
Weakly interacting massive particles (WIMPs) may interact with a virtual pion that is exchanged between nucleons. This interaction channel is important to consider in models where the spin-independent isoscalar channel is suppressed. Using data from the first science run of the LUX-ZEPLIN dark matter experiment, containing 60 live days of data in a 5.5~tonne fiducial mass of liquid xenon, we repor…
▽ More
Weakly interacting massive particles (WIMPs) may interact with a virtual pion that is exchanged between nucleons. This interaction channel is important to consider in models where the spin-independent isoscalar channel is suppressed. Using data from the first science run of the LUX-ZEPLIN dark matter experiment, containing 60 live days of data in a 5.5~tonne fiducial mass of liquid xenon, we report the results on a search for WIMP-pion interactions. We observe no significant excess and set an upper limit of $1.5\times10^{-46}$~cm$^2$ at a 90\% confidence level for a WIMP mass of 33~GeV/c$^2$ for this interaction.
△ Less
Submitted 4 June, 2024;
originally announced June 2024.
-
Render and Diffuse: Aligning Image and Action Spaces for Diffusion-based Behaviour Cloning
Authors:
Vitalis Vosylius,
Younggyo Seo,
Jafar Uruç,
Stephen James
Abstract:
In the field of Robot Learning, the complex mapping between high-dimensional observations such as RGB images and low-level robotic actions, two inherently very different spaces, constitutes a complex learning problem, especially with limited amounts of data. In this work, we introduce Render and Diffuse (R&D) a method that unifies low-level robot actions and RGB observations within the image space…
▽ More
In the field of Robot Learning, the complex mapping between high-dimensional observations such as RGB images and low-level robotic actions, two inherently very different spaces, constitutes a complex learning problem, especially with limited amounts of data. In this work, we introduce Render and Diffuse (R&D) a method that unifies low-level robot actions and RGB observations within the image space using virtual renders of the 3D model of the robot. Using this joint observation-action representation it computes low-level robot actions using a learnt diffusion process that iteratively updates the virtual renders of the robot. This space unification simplifies the learning problem and introduces inductive biases that are crucial for sample efficiency and spatial generalisation. We thoroughly evaluate several variants of R&D in simulation and showcase their applicability on six everyday tasks in the real world. Our results show that R&D exhibits strong spatial generalisation capabilities and is more sample efficient than more common image-to-action methods.
△ Less
Submitted 28 May, 2024;
originally announced May 2024.
-
The Data Acquisition System of the LZ Dark Matter Detector: FADR
Authors:
J. Aalbers,
D. S. Akerib,
A. K. Al Musalhi,
F. Alder,
C. S. Amarasinghe,
A. Ames,
T. J. Anderson,
N. Angelides,
H. M. Araújo,
J. E. Armstrong,
M. Arthurs,
A. Baker,
S. Balashov,
J. Bang,
E. E. Barillier,
J. W. Bargemann,
K. Beattie,
T. Benson,
A. Bhatti,
A. Biekert,
T. P. Biesiadzinski,
H. J. Birch,
E. Bishop,
G. M. Blockinger,
B. Boxer
, et al. (191 additional authors not shown)
Abstract:
The Data Acquisition System (DAQ) for the LUX-ZEPLIN (LZ) dark matter detector is described. The signals from 745 PMTs, distributed across three subsystems, are sampled with 100-MHz 32-channel digitizers (DDC-32s). A basic waveform analysis is carried out on the on-board Field Programmable Gate Arrays (FPGAs) to extract information about the observed scintillation and electroluminescence signals.…
▽ More
The Data Acquisition System (DAQ) for the LUX-ZEPLIN (LZ) dark matter detector is described. The signals from 745 PMTs, distributed across three subsystems, are sampled with 100-MHz 32-channel digitizers (DDC-32s). A basic waveform analysis is carried out on the on-board Field Programmable Gate Arrays (FPGAs) to extract information about the observed scintillation and electroluminescence signals. This information is used to determine if the digitized waveforms should be preserved for offline analysis.
The system is designed around the Kintex-7 FPGA. In addition to digitizing the PMT signals and providing basic event selection in real time, the flexibility provided by the use of FPGAs allows us to monitor the performance of the detector and the DAQ in parallel to normal data acquisition.
The hardware and software/firmware of this FPGA-based Architecture for Data acquisition and Realtime monitoring (FADR) are discussed and performance measurements are described.
△ Less
Submitted 16 August, 2024; v1 submitted 23 May, 2024;
originally announced May 2024.
-
Word2World: Generating Stories and Worlds through Large Language Models
Authors:
Muhammad U. Nasir,
Steven James,
Julian Togelius
Abstract:
Large Language Models (LLMs) have proven their worth across a diverse spectrum of disciplines. LLMs have shown great potential in Procedural Content Generation (PCG) as well, but directly generating a level through a pre-trained LLM is still challenging. This work introduces Word2World, a system that enables LLMs to procedurally design playable games through stories, without any task-specific fine…
▽ More
Large Language Models (LLMs) have proven their worth across a diverse spectrum of disciplines. LLMs have shown great potential in Procedural Content Generation (PCG) as well, but directly generating a level through a pre-trained LLM is still challenging. This work introduces Word2World, a system that enables LLMs to procedurally design playable games through stories, without any task-specific fine-tuning. Word2World leverages the abilities of LLMs to create diverse content and extract information. Combining these abilities, LLMs can create a story for the game, design narrative, and place tiles in appropriate places to create coherent worlds and playable games. We test Word2World with different LLMs and perform a thorough ablation study to validate each step. We open-source the code at https://github.com/umair-nasir14/Word2World.
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
Constraints On Covariant WIMP-Nucleon Effective Field Theory Interactions from the First Science Run of the LUX-ZEPLIN Experiment
Authors:
J. Aalbers,
D. S. Akerib,
A. K. Al Musalhi,
F. Alder,
C. S. Amarasinghe,
A. Ames,
T. J. Anderson,
N. Angelides,
H. M. Araújo,
J. E. Armstrong,
M. Arthurs,
A. Baker,
S. Balashov,
J. Bang,
E. E. Barillier,
J. W. Bargemann,
K. Beattie,
T. Benson,
A. Bhatti,
A. Biekert,
T. P. Biesiadzinski,
H. J. Birch,
E. J. Bishop,
G. M. Blockinger,
B. Boxer
, et al. (179 additional authors not shown)
Abstract:
The first science run of the LUX-ZEPLIN (LZ) experiment, a dual-phase xenon time project chamber operating in the Sanford Underground Research Facility in South Dakota, USA, has reported leading limits on spin-independent WIMP-nucleon interactions and interactions described from a non-relativistic effective field theory (NREFT). Using the same 5.5~t fiducial mass and 60 live days of exposure we re…
▽ More
The first science run of the LUX-ZEPLIN (LZ) experiment, a dual-phase xenon time project chamber operating in the Sanford Underground Research Facility in South Dakota, USA, has reported leading limits on spin-independent WIMP-nucleon interactions and interactions described from a non-relativistic effective field theory (NREFT). Using the same 5.5~t fiducial mass and 60 live days of exposure we report on the results of a relativistic extension to the NREFT. We present constraints on couplings from covariant interactions arising from the coupling of vector, axial currents, and electric dipole moments of the nucleon to the magnetic and electric dipole moments of the WIMP which cannot be described by recasting previous results described by an NREFT. Using a profile-likelihood ratio analysis, in an energy region between 0~keV$_\text{nr}$ to 270~keV$_\text{nr}$, we report 90% confidence level exclusion limits on the coupling strength of five interactions in both the isoscalar and isovector bases.
△ Less
Submitted 26 April, 2024;
originally announced April 2024.
-
Multi-Agent Team Access Monitoring: Environments that Benefit from Target Information Sharing
Authors:
Andrew Dudash,
Scott James,
Ryan Rubel
Abstract:
Robotic access monitoring of multiple target areas has applications including checkpoint enforcement, surveillance and containment of fire and flood hazards. Monitoring access for a single target region has been successfully modeled as a minimum-cut problem. We generalize this model to support multiple target areas using two approaches: iterating on individual targets and examining the collections…
▽ More
Robotic access monitoring of multiple target areas has applications including checkpoint enforcement, surveillance and containment of fire and flood hazards. Monitoring access for a single target region has been successfully modeled as a minimum-cut problem. We generalize this model to support multiple target areas using two approaches: iterating on individual targets and examining the collections of targets holistically. Through simulation we measure the performance of each approach on different scenarios.
△ Less
Submitted 28 March, 2024;
originally announced March 2024.
-
IFFNeRF: Initialisation Free and Fast 6DoF pose estimation from a single image and a NeRF model
Authors:
Matteo Bortolon,
Theodore Tsesmelis,
Stuart James,
Fabio Poiesi,
Alessio Del Bue
Abstract:
We introduce IFFNeRF to estimate the six degrees-of-freedom (6DoF) camera pose of a given image, building on the Neural Radiance Fields (NeRF) formulation. IFFNeRF is specifically designed to operate in real-time and eliminates the need for an initial pose guess that is proximate to the sought solution. IFFNeRF utilizes the Metropolis-Hasting algorithm to sample surface points from within the NeRF…
▽ More
We introduce IFFNeRF to estimate the six degrees-of-freedom (6DoF) camera pose of a given image, building on the Neural Radiance Fields (NeRF) formulation. IFFNeRF is specifically designed to operate in real-time and eliminates the need for an initial pose guess that is proximate to the sought solution. IFFNeRF utilizes the Metropolis-Hasting algorithm to sample surface points from within the NeRF model. From these sampled points, we cast rays and deduce the color for each ray through pixel-level view synthesis. The camera pose can then be estimated as the solution to a Least Squares problem by selecting correspondences between the query image and the resulting bundle. We facilitate this process through a learned attention mechanism, bridging the query image embedding with the embedding of parameterized rays, thereby matching rays pertinent to the image. Through synthetic and real evaluation settings, we show that our method can improve the angular and translation error accuracy by 80.1% and 67.3%, respectively, compared to iNeRF while performing at 34fps on consumer hardware and not requiring the initial pose guess.
△ Less
Submitted 19 March, 2024;
originally announced March 2024.
-
Towards the Reusability and Compositionality of Causal Representations
Authors:
Davide Talon,
Phillip Lippe,
Stuart James,
Alessio Del Bue,
Sara Magliacane
Abstract:
Causal Representation Learning (CRL) aims at identifying high-level causal factors and their relationships from high-dimensional observations, e.g., images. While most CRL works focus on learning causal representations in a single environment, in this work we instead propose a first step towards learning causal representations from temporal sequences of images that can be adapted in a new environm…
▽ More
Causal Representation Learning (CRL) aims at identifying high-level causal factors and their relationships from high-dimensional observations, e.g., images. While most CRL works focus on learning causal representations in a single environment, in this work we instead propose a first step towards learning causal representations from temporal sequences of images that can be adapted in a new environment, or composed across multiple related environments. In particular, we introduce DECAF, a framework that detects which causal factors can be reused and which need to be adapted from previously learned causal representations. Our approach is based on the availability of intervention targets, that indicate which variables are perturbed at each time step. Experiments on three benchmark datasets show that integrating our framework with four state-of-the-art CRL approaches leads to accurate representations in a new environment with only a few samples.
△ Less
Submitted 14 March, 2024;
originally announced March 2024.
-
PRAGO: Differentiable Multi-View Pose Optimization From Objectness Detections
Authors:
Matteo Taiana,
Matteo Toso,
Stuart James,
Alessio Del Bue
Abstract:
Robustly estimating camera poses from a set of images is a fundamental task which remains challenging for differentiable methods, especially in the case of small and sparse camera pose graphs. To overcome this challenge, we propose Pose-refined Rotation Averaging Graph Optimization (PRAGO). From a set of objectness detections on unordered images, our method reconstructs the rotational pose, and in…
▽ More
Robustly estimating camera poses from a set of images is a fundamental task which remains challenging for differentiable methods, especially in the case of small and sparse camera pose graphs. To overcome this challenge, we propose Pose-refined Rotation Averaging Graph Optimization (PRAGO). From a set of objectness detections on unordered images, our method reconstructs the rotational pose, and in turn, the absolute pose, in a differentiable manner benefiting from the optimization of a sequence of geometrical tasks. We show how our objectness pose-refinement module in PRAGO is able to refine the inherent ambiguities in pairwise relative pose estimation without removing edges and avoiding making early decisions on the viability of graph edges. PRAGO then refines the absolute rotations through iterative graph construction, reweighting the graph edges to compute the final rotational pose, which can be converted into absolute poses using translation averaging. We show that PRAGO is able to outperform non-differentiable solvers on small and sparse scenes extracted from 7-Scenes achieving a relative improvement of 21% for rotations while achieving similar translation estimates.
△ Less
Submitted 15 March, 2024; v1 submitted 13 March, 2024;
originally announced March 2024.
-
Hierarchical Diffusion Policy for Kinematics-Aware Multi-Task Robotic Manipulation
Authors:
Xiao Ma,
Sumit Patidar,
Iain Haughton,
Stephen James
Abstract:
This paper introduces Hierarchical Diffusion Policy (HDP), a hierarchical agent for multi-task robotic manipulation. HDP factorises a manipulation policy into a hierarchical structure: a high-level task-planning agent which predicts a distant next-best end-effector pose (NBP), and a low-level goal-conditioned diffusion policy which generates optimal motion trajectories. The factorised policy repre…
▽ More
This paper introduces Hierarchical Diffusion Policy (HDP), a hierarchical agent for multi-task robotic manipulation. HDP factorises a manipulation policy into a hierarchical structure: a high-level task-planning agent which predicts a distant next-best end-effector pose (NBP), and a low-level goal-conditioned diffusion policy which generates optimal motion trajectories. The factorised policy representation allows HDP to tackle both long-horizon task planning while generating fine-grained low-level actions. To generate context-aware motion trajectories while satisfying robot kinematics constraints, we present a novel kinematics-aware goal-conditioned control agent, Robot Kinematics Diffuser (RK-Diffuser). Specifically, RK-Diffuser learns to generate both the end-effector pose and joint position trajectories, and distill the accurate but kinematics-unaware end-effector pose diffuser to the kinematics-aware but less accurate joint position diffuser via differentiable kinematics. Empirically, we show that HDP achieves a significantly higher success rate than the state-of-the-art methods in both simulation and real-world.
△ Less
Submitted 6 March, 2024;
originally announced March 2024.
-
New constraints on ultraheavy dark matter from the LZ experiment
Authors:
J. Aalbers,
D. S. Akerib,
A. K. Al Musalhi,
C. S. Amarasinghe,
A. Ames,
T. J. Anderson,
N. Angelides,
H. M. Araújo,
J. E. Armstrong,
M. Arthurs,
A. Baker,
S. Balashov,
J. Bang,
J. W. Bargemann,
A. Baxter,
K. Beattie,
T. Benson,
A. Bhatti,
A. Biekert,
T. P. Biesiadzinski,
H. J. Birch,
E. Bishop,
G. M. Blockinger,
B. Boxer,
C. A. J. Brew
, et al. (174 additional authors not shown)
Abstract:
Searches for dark matter with liquid xenon time projection chamber experiments have traditionally focused on the region of the parameter space that is characteristic of weakly interacting massive particles, ranging from a few GeV/$c^2$ to a few TeV/$c^2$. Models of dark matter with a mass much heavier than this are well motivated by early production mechanisms different from the standard thermal f…
▽ More
Searches for dark matter with liquid xenon time projection chamber experiments have traditionally focused on the region of the parameter space that is characteristic of weakly interacting massive particles, ranging from a few GeV/$c^2$ to a few TeV/$c^2$. Models of dark matter with a mass much heavier than this are well motivated by early production mechanisms different from the standard thermal freeze-out, but they have generally been less explored experimentally. In this work, we present a re-analysis of the first science run (SR1) of the LZ experiment, with an exposure of $0.9$ tonne$\times$year, to search for ultraheavy particle dark matter. The signal topology consists of multiple energy deposits in the active region of the detector forming a straight line, from which the velocity of the incoming particle can be reconstructed on an event-by-event basis. Zero events with this topology were observed after applying the data selection calibrated on a simulated sample of signal-like events. New experimental constraints are derived, which rule out previously unexplored regions of the dark matter parameter space of spin-independent interactions beyond a mass of 10$^{17}$ GeV/$c^2$.
△ Less
Submitted 13 February, 2024;
originally announced February 2024.
-
MinePlanner: A Benchmark for Long-Horizon Planning in Large Minecraft Worlds
Authors:
William Hill,
Ireton Liu,
Anita De Mello Koch,
Damion Harvey,
Nishanth Kumar,
George Konidaris,
Steven James
Abstract:
We propose a new benchmark for planning tasks based on the Minecraft game. Our benchmark contains 45 tasks overall, but also provides support for creating both propositional and numeric instances of new Minecraft tasks automatically. We benchmark numeric and propositional planning systems on these tasks, with results demonstrating that state-of-the-art planners are currently incapable of dealing w…
▽ More
We propose a new benchmark for planning tasks based on the Minecraft game. Our benchmark contains 45 tasks overall, but also provides support for creating both propositional and numeric instances of new Minecraft tasks automatically. We benchmark numeric and propositional planning systems on these tasks, with results demonstrating that state-of-the-art planners are currently incapable of dealing with many of the challenges advanced by our new benchmark, such as scaling to instances with thousands of objects. Based on these results, we identify areas of improvement for future planners. Our framework is made available at https://github.com/IretonLiu/mine-pddl/.
△ Less
Submitted 28 April, 2024; v1 submitted 20 December, 2023;
originally announced December 2023.
-
Counting Reward Automata: Sample Efficient Reinforcement Learning Through the Exploitation of Reward Function Structure
Authors:
Tristan Bester,
Benjamin Rosman,
Steven James,
Geraud Nangue Tasse
Abstract:
We present counting reward automata-a finite state machine variant capable of modelling any reward function expressible as a formal language. Unlike previous approaches, which are limited to the expression of tasks as regular languages, our framework allows for tasks described by unrestricted grammars. We prove that an agent equipped with such an abstract machine is able to solve a larger set of t…
▽ More
We present counting reward automata-a finite state machine variant capable of modelling any reward function expressible as a formal language. Unlike previous approaches, which are limited to the expression of tasks as regular languages, our framework allows for tasks described by unrestricted grammars. We prove that an agent equipped with such an abstract machine is able to solve a larger set of tasks than those utilising current approaches. We show that this increase in expressive power does not come at the cost of increased automaton complexity. A selection of learning algorithms are presented which exploit automaton structure to improve sample efficiency. We show that the state machines required in our formulation can be specified from natural language task descriptions using large language models. Empirical results demonstrate that our method outperforms competing approaches in terms of sample efficiency, automaton complexity, and task completion.
△ Less
Submitted 16 February, 2024; v1 submitted 18 December, 2023;
originally announced December 2023.
-
First Constraints on WIMP-Nucleon Effective Field Theory Couplings in an Extended Energy Region From LUX-ZEPLIN
Authors:
LZ Collaboration,
J. Aalbers,
D. S. Akerib,
A. K. Al Musalhi,
F. Alder,
C. S. Amarasinghe,
A. Ames,
T. J. Anderson,
N. Angelides,
H. M. Araújo,
J. E. Armstrong,
M. Arthurs,
A. Baker,
S. Balashov,
J. Bang,
J. W. Bargemann,
A. Baxter,
K. Beattie,
T. Benson,
A. Bhatti,
A. Biekert,
T. P. Biesiadzinski,
H. J. Birch,
E. Bishop,
G. M. Blockinger
, et al. (175 additional authors not shown)
Abstract:
Following the first science results of the LUX-ZEPLIN (LZ) experiment, a dual-phase xenon time projection chamber operating from the Sanford Underground Research Facility in Lead, South Dakota, USA, we report the initial limits on a model-independent non-relativistic effective field theory describing the complete set of possible interactions of a weakly interacting massive particle (WIMP) with a n…
▽ More
Following the first science results of the LUX-ZEPLIN (LZ) experiment, a dual-phase xenon time projection chamber operating from the Sanford Underground Research Facility in Lead, South Dakota, USA, we report the initial limits on a model-independent non-relativistic effective field theory describing the complete set of possible interactions of a weakly interacting massive particle (WIMP) with a nucleon. These results utilize the same 5.5 t fiducial mass and 60 live days of exposure collected for the LZ spin-independent and spin-dependent analyses while extending the upper limit of the energy region of interest by a factor of 7.5 to 270 keVnr. No significant excess in this high energy region is observed. Using a profile-likelihood ratio analysis, we report 90% confidence level exclusion limits on the coupling of each individual non-relativistic WIMP-nucleon operator for both elastic and inelastic interactions in the isoscalar and isovector bases.
△ Less
Submitted 26 February, 2024; v1 submitted 4 December, 2023;
originally announced December 2023.
-
Dynamics Generalisation in Reinforcement Learning via Adaptive Context-Aware Policies
Authors:
Michael Beukman,
Devon Jarvis,
Richard Klein,
Steven James,
Benjamin Rosman
Abstract:
While reinforcement learning has achieved remarkable successes in several domains, its real-world application is limited due to many methods failing to generalise to unfamiliar conditions. In this work, we consider the problem of generalising to new transition dynamics, corresponding to cases in which the environment's response to the agent's actions differs. For example, the gravitational force e…
▽ More
While reinforcement learning has achieved remarkable successes in several domains, its real-world application is limited due to many methods failing to generalise to unfamiliar conditions. In this work, we consider the problem of generalising to new transition dynamics, corresponding to cases in which the environment's response to the agent's actions differs. For example, the gravitational force exerted on a robot depends on its mass and changes the robot's mobility. Consequently, in such cases, it is necessary to condition an agent's actions on extrinsic state information and pertinent contextual information reflecting how the environment responds. While the need for context-sensitive policies has been established, the manner in which context is incorporated architecturally has received less attention. Thus, in this work, we present an investigation into how context information should be incorporated into behaviour learning to improve generalisation. To this end, we introduce a neural network architecture, the Decision Adapter, which generates the weights of an adapter module and conditions the behaviour of an agent on the context information. We show that the Decision Adapter is a useful generalisation of a previously proposed architecture and empirically demonstrate that it results in superior generalisation performance compared to previous approaches in several environments. Beyond this, the Decision Adapter is more robust to irrelevant distractor variables than several alternative methods.
△ Less
Submitted 25 October, 2023;
originally announced October 2023.
-
Speed Co-Augmentation for Unsupervised Audio-Visual Pre-training
Authors:
Jiangliu Wang,
Jianbo Jiao,
Yibing Song,
Stephen James,
Zhan Tong,
Chongjian Ge,
Pieter Abbeel,
Yun-hui Liu
Abstract:
This work aims to improve unsupervised audio-visual pre-training. Inspired by the efficacy of data augmentation in visual contrastive learning, we propose a novel speed co-augmentation method that randomly changes the playback speeds of both audio and video data. Despite its simplicity, the speed co-augmentation method possesses two compelling attributes: (1) it increases the diversity of audio-vi…
▽ More
This work aims to improve unsupervised audio-visual pre-training. Inspired by the efficacy of data augmentation in visual contrastive learning, we propose a novel speed co-augmentation method that randomly changes the playback speeds of both audio and video data. Despite its simplicity, the speed co-augmentation method possesses two compelling attributes: (1) it increases the diversity of audio-visual pairs and doubles the size of negative pairs, resulting in a significant enhancement in the learned representations, and (2) it changes the strict correlation between audio-visual pairs but introduces a partial relationship between the augmented pairs, which is modeled by our proposed SoftInfoNCE loss to further boost the performance. Experimental results show that the proposed method significantly improves the learned representations when compared to vanilla audio-visual contrastive learning.
△ Less
Submitted 25 September, 2023;
originally announced September 2023.
-
Language-Conditioned Path Planning
Authors:
Amber Xie,
Youngwoon Lee,
Pieter Abbeel,
Stephen James
Abstract:
Contact is at the core of robotic manipulation. At times, it is desired (e.g. manipulation and grasping), and at times, it is harmful (e.g. when avoiding obstacles). However, traditional path planning algorithms focus solely on collision-free paths, limiting their applicability in contact-rich tasks. To address this limitation, we propose the domain of Language-Conditioned Path Planning, where con…
▽ More
Contact is at the core of robotic manipulation. At times, it is desired (e.g. manipulation and grasping), and at times, it is harmful (e.g. when avoiding obstacles). However, traditional path planning algorithms focus solely on collision-free paths, limiting their applicability in contact-rich tasks. To address this limitation, we propose the domain of Language-Conditioned Path Planning, where contact-awareness is incorporated into the path planning problem. As a first step in this domain, we propose Language-Conditioned Collision Functions (LACO) a novel approach that learns a collision function using only a single-view image, language prompt, and robot configuration. LACO predicts collisions between the robot and the environment, enabling flexible, conditional path planning without the need for manual object annotations, point cloud data, or ground-truth object meshes. In both simulation and the real world, we demonstrate that LACO can facilitate complex, nuanced path plans that allow for interaction with objects that are safe to collide, rather than prohibiting any collision.
△ Less
Submitted 31 August, 2023;
originally announced August 2023.
-
Language Reward Modulation for Pretraining Reinforcement Learning
Authors:
Ademi Adeniji,
Amber Xie,
Carmelo Sferrazza,
Younggyo Seo,
Stephen James,
Pieter Abbeel
Abstract:
Using learned reward functions (LRFs) as a means to solve sparse-reward reinforcement learning (RL) tasks has yielded some steady progress in task-complexity through the years. In this work, we question whether today's LRFs are best-suited as a direct replacement for task rewards. Instead, we propose leveraging the capabilities of LRFs as a pretraining signal for RL. Concretely, we propose…
▽ More
Using learned reward functions (LRFs) as a means to solve sparse-reward reinforcement learning (RL) tasks has yielded some steady progress in task-complexity through the years. In this work, we question whether today's LRFs are best-suited as a direct replacement for task rewards. Instead, we propose leveraging the capabilities of LRFs as a pretraining signal for RL. Concretely, we propose $\textbf{LA}$nguage Reward $\textbf{M}$odulated $\textbf{P}$retraining (LAMP) which leverages the zero-shot capabilities of Vision-Language Models (VLMs) as a $\textit{pretraining}$ utility for RL as opposed to a downstream task reward. LAMP uses a frozen, pretrained VLM to scalably generate noisy, albeit shaped exploration rewards by computing the contrastive alignment between a highly diverse collection of language instructions and the image observations of an agent in its pretraining environment. LAMP optimizes these rewards in conjunction with standard novelty-seeking exploration rewards with reinforcement learning to acquire a language-conditioned, pretrained policy. Our VLM pretraining approach, which is a departure from previous attempts to use LRFs, can warmstart sample-efficient learning on robot manipulation tasks in RLBench.
△ Less
Submitted 23 August, 2023;
originally announced August 2023.
-
A search for new physics in low-energy electron recoils from the first LZ exposure
Authors:
The LZ Collaboration,
J. Aalbers,
D. S. Akerib,
A. K. Al Musalhi,
F. Alder,
C. S. Amarasinghe,
A. Ames,
T. J. Anderson,
N. Angelides,
H. M. Araújo,
J. E. Armstrong,
M. Arthurs,
A. Baker,
S. Balashov,
J. Bang,
J. W. Bargemann,
A. Baxter,
K. Beattie,
P. Beltrame,
T. Benson,
A. Bhatti,
A. Biekert,
T. P. Biesiadzinski,
H. J. Birch,
G. M. Blockinger
, et al. (178 additional authors not shown)
Abstract:
The LUX-ZEPLIN (LZ) experiment is a dark matter detector centered on a dual-phase xenon time projection chamber. We report searches for new physics appearing through few-keV-scale electron recoils, using the experiment's first exposure of 60 live days and a fiducial mass of 5.5t. The data are found to be consistent with a background-only hypothesis, and limits are set on models for new physics inc…
▽ More
The LUX-ZEPLIN (LZ) experiment is a dark matter detector centered on a dual-phase xenon time projection chamber. We report searches for new physics appearing through few-keV-scale electron recoils, using the experiment's first exposure of 60 live days and a fiducial mass of 5.5t. The data are found to be consistent with a background-only hypothesis, and limits are set on models for new physics including solar axion electron coupling, solar neutrino magnetic moment and millicharge, and electron couplings to galactic axion-like particles and hidden photons. Similar limits are set on weakly interacting massive particle (WIMP) dark matter producing signals through ionized atomic states from the Migdal effect.
△ Less
Submitted 9 September, 2023; v1 submitted 28 July, 2023;
originally announced July 2023.
-
LLMatic: Neural Architecture Search via Large Language Models and Quality Diversity Optimization
Authors:
Muhammad U. Nasir,
Sam Earle,
Christopher Cleghorn,
Steven James,
Julian Togelius
Abstract:
Large Language Models (LLMs) have emerged as powerful tools capable of accomplishing a broad spectrum of tasks. Their abilities span numerous areas, and one area where they have made a significant impact is in the domain of code generation. Here, we propose using the coding abilities of LLMs to introduce meaningful variations to code defining neural networks. Meanwhile, Quality-Diversity (QD) algo…
▽ More
Large Language Models (LLMs) have emerged as powerful tools capable of accomplishing a broad spectrum of tasks. Their abilities span numerous areas, and one area where they have made a significant impact is in the domain of code generation. Here, we propose using the coding abilities of LLMs to introduce meaningful variations to code defining neural networks. Meanwhile, Quality-Diversity (QD) algorithms are known to discover diverse and robust solutions. By merging the code-generating abilities of LLMs with the diversity and robustness of QD solutions, we introduce \texttt{LLMatic}, a Neural Architecture Search (NAS) algorithm. While LLMs struggle to conduct NAS directly through prompts, \texttt{LLMatic} uses a procedural approach, leveraging QD for prompts and network architecture to create diverse and high-performing networks. We test \texttt{LLMatic} on the CIFAR-10 and NAS-bench-201 benchmarks, demonstrating that it can produce competitive networks while evaluating just $2,000$ candidates, even without prior knowledge of the benchmark domain or exposure to any previous top-performing models for the benchmark. The open-sourced code is available in \url{https://github.com/umair-nasir14/LLMatic}.
△ Less
Submitted 12 April, 2024; v1 submitted 1 June, 2023;
originally announced June 2023.
-
ROSARL: Reward-Only Safe Reinforcement Learning
Authors:
Geraud Nangue Tasse,
Tamlin Love,
Mark Nemecek,
Steven James,
Benjamin Rosman
Abstract:
An important problem in reinforcement learning is designing agents that learn to solve tasks safely in an environment. A common solution is for a human expert to define either a penalty in the reward function or a cost to be minimised when reaching unsafe states. However, this is non-trivial, since too small a penalty may lead to agents that reach unsafe states, while too large a penalty increases…
▽ More
An important problem in reinforcement learning is designing agents that learn to solve tasks safely in an environment. A common solution is for a human expert to define either a penalty in the reward function or a cost to be minimised when reaching unsafe states. However, this is non-trivial, since too small a penalty may lead to agents that reach unsafe states, while too large a penalty increases the time to convergence. Additionally, the difficulty in designing reward or cost functions can increase with the complexity of the problem. Hence, for a given environment with a given set of unsafe states, we are interested in finding the upper bound of rewards at unsafe states whose optimal policies minimise the probability of reaching those unsafe states, irrespective of task rewards. We refer to this exact upper bound as the "Minmax penalty", and show that it can be obtained by taking into account both the controllability and diameter of an environment. We provide a simple practical model-free algorithm for an agent to learn this Minmax penalty while learning the task policy, and demonstrate that using it leads to agents that learn safe policies in high-dimensional continuous control environments.
△ Less
Submitted 31 May, 2023;
originally announced June 2023.
-
3DoF Localization from a Single Image and an Object Map: the Flatlandia Problem and Dataset
Authors:
Matteo Toso,
Matteo Taiana,
Stuart James,
Alessio Del Bue
Abstract:
Efficient visual localization is crucial to many applications, such as large-scale deployment of autonomous agents and augmented reality. Traditional visual localization, while achieving remarkable accuracy, relies on extensive 3D models of the scene or large collections of geolocalized images, which are often inefficient to store and to scale to novel environments. In contrast, humans orient them…
▽ More
Efficient visual localization is crucial to many applications, such as large-scale deployment of autonomous agents and augmented reality. Traditional visual localization, while achieving remarkable accuracy, relies on extensive 3D models of the scene or large collections of geolocalized images, which are often inefficient to store and to scale to novel environments. In contrast, humans orient themselves using very abstract 2D maps, using the location of clearly identifiable landmarks. Drawing on this and on the success of recent works that explored localization on 2D abstract maps, we propose Flatlandia, a novel visual localization challenge. With Flatlandia, we investigate whether it is possible to localize a visual query by comparing the layout of its common objects detected against the known spatial layout of objects in the map. We formalize the challenge as two tasks at different levels of accuracy to investigate the problem and its possible limitations; for each, we propose initial baseline models and compare them against state-of-the-art 6DoF and 3DoF methods. Code and dataset are publicly available at github.com/IIT-PAVIS/Flatlandia.
△ Less
Submitted 8 November, 2023; v1 submitted 13 April, 2023;
originally announced April 2023.