Skip to main content

Showing 51–100 of 146 results for author: James, S

.
  1. arXiv:2303.11120  [pdf, other

    cs.CV

    Positional Diffusion: Ordering Unordered Sets with Diffusion Probabilistic Models

    Authors: Francesco Giuliari, Gianluca Scarpellini, Stuart James, Yiming Wang, Alessio Del Bue

    Abstract: Positional reasoning is the process of ordering unsorted parts contained in a set into a consistent structure. We present Positional Diffusion, a plug-and-play graph formulation with Diffusion Probabilistic Models to address positional reasoning. We use the forward process to map elements' positions in a set to random positions in a continuous space. Positional Diffusion learns to reverse the nois… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

  2. Hierarchical clustering with OWA-based linkages, the Lance-Williams formula, and dendrogram inversions

    Authors: Marek Gagolewski, Anna Cena, Simon James, Gleb Beliakov

    Abstract: Agglomerative hierarchical clustering based on Ordered Weighted Averaging (OWA) operators not only generalises the single, complete, and average linkages, but also includes intercluster distances based on a few nearest or farthest neighbours, trimmed and winsorised means of pairwise point similarities, amongst many others. We explore the relationships between the famous Lance-Williams update formu… ▽ More

    Submitted 25 October, 2023; v1 submitted 9 March, 2023; originally announced March 2023.

    Journal ref: Fuzzy Sets and Systems 473, 108740, 2023

  3. arXiv:2302.02408  [pdf, other

    cs.RO cs.CV cs.LG

    Multi-View Masked World Models for Visual Robotic Manipulation

    Authors: Younggyo Seo, Junsu Kim, Stephen James, Kimin Lee, Jinwoo Shin, Pieter Abbeel

    Abstract: Visual robotic manipulation research and applications often use multiple cameras, or views, to better perceive the world. How else can we utilize the richness of multi-view data? In this paper, we investigate how to learn good representations with multi-view data and utilize them for visual robotic manipulation. Specifically, we train a multi-view masked autoencoder which reconstructs pixels of ra… ▽ More

    Submitted 31 May, 2023; v1 submitted 5 February, 2023; originally announced February 2023.

    Comments: Accepted to ICML 2023. First two authors contributed equally. Project webpage: https://sites.google.com/view/mv-mwm

  4. arXiv:2302.01561  [pdf, other

    cs.AI

    Hierarchically Composing Level Generators for the Creation of Complex Structures

    Authors: Michael Beukman, Manuel Fokam, Marcel Kruger, Guy Axelrod, Muhammad Nasir, Branden Ingram, Benjamin Rosman, Steven James

    Abstract: Procedural content generation (PCG) is a growing field, with numerous applications in the video game industry and great potential to help create better games at a fraction of the cost of manual creation. However, much of the work in PCG is focused on generating relatively straightforward levels in simple games, as it is challenging to design an optimisable objective function for complex settings.… ▽ More

    Submitted 19 July, 2023; v1 submitted 3 February, 2023; originally announced February 2023.

    Comments: Code is available at https://github.com/Michael-Beukman/MCHAMR. This work has been accepted to IEEE Transactions on Games, with copyright transferred to the IEEE

  5. arXiv:2211.17120  [pdf, other

    hep-ex physics.ins-det

    Background Determination for the LUX-ZEPLIN (LZ) Dark Matter Experiment

    Authors: J. Aalbers, D. S. Akerib, A. K. Al Musalhi, F. Alder, S. K. Alsum, C. S. Amarasinghe, A. Ames, T. J. Anderson, N. Angelides, H. M. Araújo, J. E. Armstrong, M. Arthurs, A. Baker, J. Bang, J. W. Bargemann, A. Baxter, K. Beattie, P. Beltrame, E. P. Bernard, A. Bhatti, A. Biekert, T. P. Biesiadzinski, H. J. Birch, G. M. Blockinger, B. Boxer , et al. (178 additional authors not shown)

    Abstract: The LUX-ZEPLIN experiment recently reported limits on WIMP-nucleus interactions from its initial science run, down to $9.2\times10^{-48}$ cm$^2$ for the spin-independent interaction of a 36 GeV/c$^2$ WIMP at 90% confidence level. In this paper, we present a comprehensive analysis of the backgrounds important for this result and for other upcoming physics analyses, including neutrinoless double-bet… ▽ More

    Submitted 17 July, 2023; v1 submitted 30 November, 2022; originally announced November 2022.

    Comments: 25 pages, 15 figures

    Journal ref: Phys. Rev. D 108, 012010 (2023)

  6. arXiv:2211.01644  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    StereoPose: Category-Level 6D Transparent Object Pose Estimation from Stereo Images via Back-View NOCS

    Authors: Kai Chen, Stephen James, Congying Sui, Yun-Hui Liu, Pieter Abbeel, Qi Dou

    Abstract: Most existing methods for category-level pose estimation rely on object point clouds. However, when considering transparent objects, depth cameras are usually not able to capture meaningful data, resulting in point clouds with severe artifacts. Without a high-quality point cloud, existing methods are not applicable to challenging transparent objects. To tackle this problem, we present StereoPose,… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

    Comments: 7 pages, 6 figures, Project homepage: https://appsrv.cse.cuhk.edu.hk/~kaichen/stereopose.html

  7. arXiv:2210.14721  [pdf, other

    cs.LG cs.AI

    Sim-to-Real via Sim-to-Seg: End-to-end Off-road Autonomous Driving Without Real Data

    Authors: John So, Amber Xie, Sunggoo Jung, Jeffrey Edlund, Rohan Thakker, Ali Agha-mohammadi, Pieter Abbeel, Stephen James

    Abstract: Autonomous driving is complex, requiring sophisticated 3D scene understanding, localization, mapping, and control. Rather than explicitly modelling and fusing each of these components, we instead consider an end-to-end approach via reinforcement learning (RL). However, collecting exploration driving data in the real world is impractical and dangerous. While training in simulation and deploying vis… ▽ More

    Submitted 25 October, 2022; originally announced October 2022.

    Comments: CoRL 2022 Paper

  8. arXiv:2210.11442  [pdf, other

    cs.AI cs.NE

    Augmentative Topology Agents For Open-Ended Learning

    Authors: Muhammad Umair Nasir, Michael Beukman, Steven James, Christopher Wesley Cleghorn

    Abstract: In this work, we tackle the problem of open-ended learning by introducing a method that simultaneously evolves agents and increasingly challenging environments. Unlike previous open-ended approaches that optimize agents using a fixed neural network topology, we hypothesize that generalization can be improved by allowing agents' controllers to become more complex as they encounter more difficult en… ▽ More

    Submitted 11 October, 2023; v1 submitted 20 October, 2022; originally announced October 2022.

    Comments: Accepted to The Proceedings of Genetic and Evolutionary Computation Conference (GECCO) 2023

  9. arXiv:2210.03109  [pdf, other

    cs.RO cs.CV cs.LG

    Real-World Robot Learning with Masked Visual Pre-training

    Authors: Ilija Radosavovic, Tete Xiao, Stephen James, Pieter Abbeel, Jitendra Malik, Trevor Darrell

    Abstract: In this work, we explore self-supervised visual pre-training on images from diverse, in-the-wild videos for real-world robotic tasks. Like prior work, our visual representations are pre-trained via a masked autoencoder (MAE), frozen, and then passed into a learnable control module. Unlike prior work, we show that the pre-trained representations are effective across a range of real-world robotic ta… ▽ More

    Submitted 6 October, 2022; originally announced October 2022.

    Comments: CoRL 2022; Project page: https://tetexiao.com/projects/real-mvp

  10. arXiv:2210.02396  [pdf, other

    cs.CV cs.AI cs.LG

    Temporally Consistent Transformers for Video Generation

    Authors: Wilson Yan, Danijar Hafner, Stephen James, Pieter Abbeel

    Abstract: To generate accurate videos, algorithms have to understand the spatial and temporal dependencies in the world. Current algorithms enable accurate predictions over short horizons but tend to suffer from temporal inconsistencies. When generated content goes out of view and is later revisited, the model invents different content instead. Despite this severe limitation, no established benchmarks on co… ▽ More

    Submitted 31 May, 2023; v1 submitted 5 October, 2022; originally announced October 2022.

    Comments: Project website: https://wilson1yan.github.io/teco

  11. arXiv:2209.07143  [pdf, other

    cs.CV

    HARP: Autoregressive Latent Video Prediction with High-Fidelity Image Generator

    Authors: Younggyo Seo, Kimin Lee, Fangchen Liu, Stephen James, Pieter Abbeel

    Abstract: Video prediction is an important yet challenging problem; burdened with the tasks of generating future frames and learning environment dynamics. Recently, autoregressive latent video models have proved to be a powerful video prediction tool, by separating the video prediction into two sub-problems: pre-training an image generator model, followed by learning an autoregressive prediction model in th… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

    Comments: Extended draft of the paper accepted to ICIP 2022 conference

  12. arXiv:2209.03638  [pdf, other

    cs.LG cs.CL cs.SI

    Geolocation of Cultural Heritage using Multi-View Knowledge Graph Embedding

    Authors: Hebatallah A. Mohamed, Sebastiano Vascon, Feliks Hibraj, Stuart James, Diego Pilutti, Alessio Del Bue, Marcello Pelillo

    Abstract: Knowledge Graphs (KGs) have proven to be a reliable way of structuring data. They can provide a rich source of contextual information about cultural heritage collections. However, cultural heritage KGs are far from being complete. They are often missing important attributes such as geographical location, especially for sculptures and mobile or indoor entities such as paintings. In this paper, we f… ▽ More

    Submitted 8 September, 2022; originally announced September 2022.

  13. Combining Evolutionary Search with Behaviour Cloning for Procedurally Generated Content

    Authors: Nicholas Muir, Steven James

    Abstract: In this work, we consider the problem of procedural content generation for video game levels. Prior approaches have relied on evolutionary search (ES) methods capable of generating diverse levels, but this generation procedure is slow, which is problematic in real-time settings. Reinforcement learning (RL) has also been proposed to tackle the same problem, and while level generation is fast, train… ▽ More

    Submitted 29 July, 2022; originally announced July 2022.

    Journal ref: Proceedings of 43rd Conference of the South African Institute of Computer Scientists and Information Technologists, July 2022

  14. arXiv:2207.09445  [pdf, other

    cs.CV

    PoserNet: Refining Relative Camera Poses Exploiting Object Detections

    Authors: Matteo Taiana, Matteo Toso, Stuart James, Alessio Del Bue

    Abstract: The estimation of the camera poses associated with a set of images commonly relies on feature matches between the images. In contrast, we are the first to address this challenge by using objectness regions to guide the pose estimation problem rather than explicit semantic object detections. We propose Pose Refiner Network (PoserNet) a light-weight Graph Neural Network to refine the approximate pai… ▽ More

    Submitted 21 July, 2022; v1 submitted 19 July, 2022; originally announced July 2022.

    Comments: Accepted at ECCV 2022

  15. arXiv:2207.05634  [pdf, other

    cs.CV

    GANzzle: Reframing jigsaw puzzle solving as a retrieval task using a generative mental image

    Authors: Davide Talon, Alessio Del Bue, Stuart James

    Abstract: Puzzle solving is a combinatorial challenge due to the difficulty of matching adjacent pieces. Instead, we infer a mental image from all pieces, which a given piece can then be matched against avoiding the combinatorial explosion. Exploiting advancements in Generative Adversarial methods, we learn how to reconstruct the image given a set of unordered pieces, allowing the model to learn a joint emb… ▽ More

    Submitted 12 July, 2022; originally announced July 2022.

    Comments: Accepted at International Conference of Image Processing (ICIP22)

  16. First Dark Matter Search Results from the LUX-ZEPLIN (LZ) Experiment

    Authors: J. Aalbers, D. S. Akerib, C. W. Akerlof, A. K. Al Musalhi, F. Alder, A. Alqahtani, S. K. Alsum, C. S. Amarasinghe, A. Ames, T. J. Anderson, N. Angelides, H. M. Araújo, J. E. Armstrong, M. Arthurs, S. Azadi, A. J. Bailey, A. Baker, J. Balajthy, S. Balashov, J. Bang, J. W. Bargemann, M. J. Barry, J. Barthel, D. Bauer, A. Baxter , et al. (322 additional authors not shown)

    Abstract: The LUX-ZEPLIN experiment is a dark matter detector centered on a dual-phase xenon time projection chamber operating at the Sanford Underground Research Facility in Lead, South Dakota, USA. This Letter reports results from LUX-ZEPLIN's first search for weakly interacting massive particles (WIMPs) with an exposure of 60~live days using a fiducial mass of 5.5 t. A profile-likelihood ratio analysis s… ▽ More

    Submitted 2 August, 2023; v1 submitted 8 July, 2022; originally announced July 2022.

    Comments: 9 pages, 8 figures. See https://doi.org/10.1103/PhysRevLett.131.041002 for a data release related to this paper

    Journal ref: Phys. Rev. Lett. 131, 041002 (2023)

  17. arXiv:2206.14244  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Masked World Models for Visual Control

    Authors: Younggyo Seo, Danijar Hafner, Hao Liu, Fangchen Liu, Stephen James, Kimin Lee, Pieter Abbeel

    Abstract: Visual model-based reinforcement learning (RL) has the potential to enable sample-efficient robot learning from visual observations. Yet the current approaches typically train a single model end-to-end for learning both visual representations and dynamics, making it difficult to accurately model the interaction between robots and small objects. In this work, we introduce a visual model-based RL fr… ▽ More

    Submitted 27 May, 2023; v1 submitted 28 June, 2022; originally announced June 2022.

    Comments: Project website: https://sites.google.com/view/mwm-rl. Accepted to CoRL 2022

  18. arXiv:2206.11940  [pdf, other

    cs.AI cs.LG

    World Value Functions: Knowledge Representation for Learning and Planning

    Authors: Geraud Nangue Tasse, Benjamin Rosman, Steven James

    Abstract: We propose world value functions (WVFs), a type of goal-oriented general value function that represents how to solve not just a given task, but any other goal-reaching task in an agent's environment. This is achieved by equipping an agent with an internal goal space defined as all the world states where it experiences a terminal transition. The agent can then modify the standard task rewards to de… ▽ More

    Submitted 23 June, 2022; originally announced June 2022.

    Comments: Accepted at the Planning and Reinforcement Learning Workshop at ICAPS 2022. arXiv admin note: text overlap with arXiv:2205.08827

  19. arXiv:2206.04003  [pdf, other

    cs.CV cs.LG

    Patch-based Object-centric Transformers for Efficient Video Generation

    Authors: Wilson Yan, Ryo Okumura, Stephen James, Pieter Abbeel

    Abstract: In this work, we present Patch-based Object-centric Video Transformer (POVT), a novel region-based video generation architecture that leverages object-centric information to efficiently model temporal dynamics in videos. We build upon prior work in video prediction via an autoregressive transformer over the discrete latent space of compressed videos, with an added modification to model object-cent… ▽ More

    Submitted 18 June, 2022; v1 submitted 8 June, 2022; originally announced June 2022.

    Comments: Project Website: https://sites.google.com/view/povt-public

  20. arXiv:2206.03271  [pdf, other

    cs.LG cs.AI cs.CV cs.RO

    On the Effectiveness of Fine-tuning Versus Meta-reinforcement Learning

    Authors: Zhao Mandi, Pieter Abbeel, Stephen James

    Abstract: Intelligent agents should have the ability to leverage knowledge from previously learned tasks in order to learn new ones quickly and efficiently. Meta-learning approaches have emerged as a popular solution to achieve this. However, meta-reinforcement learning (meta-RL) algorithms have thus far been restricted to simple environments with narrow task distributions. Moreover, the paradigm of pretrai… ▽ More

    Submitted 16 February, 2023; v1 submitted 7 June, 2022; originally announced June 2022.

  21. arXiv:2205.12532  [pdf, other

    cs.LG cs.LO

    Skill Machines: Temporal Logic Skill Composition in Reinforcement Learning

    Authors: Geraud Nangue Tasse, Devon Jarvis, Steven James, Benjamin Rosman

    Abstract: It is desirable for an agent to be able to solve a rich variety of problems that can be specified through language in the same environment. A popular approach towards obtaining such agents is to reuse skills learned in prior tasks to generalise compositionally to new ones. However, this is a challenging problem due to the curse of dimensionality induced by the combinatorially large number of ways… ▽ More

    Submitted 16 March, 2024; v1 submitted 25 May, 2022; originally announced May 2022.

    Comments: Published as a conference paper at ICLR 2024

  22. arXiv:2205.08827  [pdf, other

    cs.LG

    World Value Functions: Knowledge Representation for Multitask Reinforcement Learning

    Authors: Geraud Nangue Tasse, Steven James, Benjamin Rosman

    Abstract: An open problem in artificial intelligence is how to learn and represent knowledge that is sufficient for a general agent that needs to solve multiple tasks in a given world. In this work we propose world value functions (WVFs), which are a type of general value function with mastery of the world - they represent not only how to solve a given task, but also how to solve any other goal-reaching tas… ▽ More

    Submitted 18 May, 2022; originally announced May 2022.

    Comments: Accepted to the 5th Multi-disciplinary Conference on Reinforcement Learning and Decision Making (RLDM), 2022

  23. arXiv:2205.06000  [pdf, other

    cs.LG cs.CV

    Accounting for the Sequential Nature of States to Learn Features for Reinforcement Learning

    Authors: Nathan Michlo, Devon Jarvis, Richard Klein, Steven James

    Abstract: In this work, we investigate the properties of data that cause popular representation learning approaches to fail. In particular, we find that in environments where states do not significantly overlap, variational autoencoders (VAEs) fail to learn useful features. We demonstrate this failure in a simple gridworld domain, and then provide a solution in the form of metric learning. However, metric l… ▽ More

    Submitted 12 May, 2022; originally announced May 2022.

    Comments: arXiv admin note: text overlap with arXiv:2202.13341

    ACM Class: I.2; I.2.6

  24. arXiv:2205.02092  [pdf, other

    cs.LG cs.AI

    Learning Abstract and Transferable Representations for Planning

    Authors: Steven James, Benjamin Rosman, George Konidaris

    Abstract: We are concerned with the question of how an agent can acquire its own representations from sensory data. We restrict our focus to learning representations for long-term planning, a class of problems that state-of-the-art learning methods are unable to solve. We propose a framework for autonomously learning state abstractions of an agent's environment, given a set of skills. Importantly, these abs… ▽ More

    Submitted 4 May, 2022; originally announced May 2022.

    Comments: Accepted to the 5th Multi-disciplinary Conference on Reinforcement Learning and Decision Making (RLDM), 2022

  25. FlameNEST: Explicit Profile Likelihoods with the Noble Element Simulation Technique

    Authors: R. S. James, J. Palmer, A. Kaboth, C. Ghag, J. Aalbers

    Abstract: We present FlameNEST, a framework providing explicit likelihood evaluations in noble element particle detectors using data-driven models from the Noble Element Simulation Technique. FlameNEST provides a way to perform statistical analyses on real data with no dependence on large, computationally expensive Monte Carlo simulations by evaluating the likelihood on an event-by-event basis using analyti… ▽ More

    Submitted 28 April, 2022; originally announced April 2022.

  26. arXiv:2204.12471  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Coarse-to-fine Q-attention with Tree Expansion

    Authors: Stephen James, Pieter Abbeel

    Abstract: Coarse-to-fine Q-attention enables sample-efficient robot manipulation by discretizing the translation space in a coarse-to-fine manner, where the resolution gradually increases at each layer in the hierarchy. Although effective, Q-attention suffers from "coarse ambiguity" - when voxelization is significantly coarse, it is not feasible to distinguish similar-looking objects without first inspectin… ▽ More

    Submitted 2 May, 2022; v1 submitted 26 April, 2022; originally announced April 2022.

    Comments: Project page and code: https://sites.google.com/view/q-attention-qte

  27. arXiv:2204.11842  [pdf, other

    cs.LG cs.AI

    Adaptive Online Value Function Approximation with Wavelets

    Authors: Michael Beukman, Michael Mitchley, Dean Wookey, Steven James, George Konidaris

    Abstract: Using function approximation to represent a value function is necessary for continuous and high-dimensional state spaces. Linear function approximation has desirable theoretical guarantees and often requires less compute and samples than neural networks, but most approaches suffer from an exponential growth in the number of functions as the dimensionality of the state space increases. In this work… ▽ More

    Submitted 22 April, 2022; originally announced April 2022.

    Comments: Accepted to RLDM 2022. Code is located at https://github.com/Michael-Beukman/WaveletRL

  28. arXiv:2204.08327  [pdf, other

    cs.RO

    Automatic Encoding and Repair of Reactive High-Level Tasks with Learned Abstract Representations

    Authors: Adam Pacheck, Steven James, George Konidaris, Hadas Kress-Gazit

    Abstract: We present a framework that, given a set of skills a robot can perform, abstracts sensor data into symbols that we use to automatically encode the robot's capabilities in Linear Temporal Logic. We specify reactive high-level tasks based on these capabilities, for which a strategy is automatically synthesized and executed on the robot, if the task is feasible. If a task is not feasible given the ro… ▽ More

    Submitted 18 April, 2022; originally announced April 2022.

    Comments: 27 pages, 15 figures, Submitted to The International Journal of Robotics Research (IJRR)

  29. arXiv:2204.07049  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Sim-to-Real 6D Object Pose Estimation via Iterative Self-training for Robotic Bin Picking

    Authors: Kai Chen, Rui Cao, Stephen James, Yichuan Li, Yun-Hui Liu, Pieter Abbeel, Qi Dou

    Abstract: In this paper, we propose an iterative self-training framework for sim-to-real 6D object pose estimation to facilitate cost-effective robotic grasping. Given a bin-picking scenario, we establish a photo-realistic simulator to synthesize abundant virtual data, and use this to train an initial pose estimation network. This network then takes the role of a teacher model, which generates pose predicti… ▽ More

    Submitted 21 July, 2022; v1 submitted 14 April, 2022; originally announced April 2022.

    Comments: Accepted to ECCV 2022

  30. Procedural Content Generation using Neuroevolution and Novelty Search for Diverse Video Game Levels

    Authors: Michael Beukman, Christopher W Cleghorn, Steven James

    Abstract: Procedurally generated video game content has the potential to drastically reduce the content creation budget of game developers and large studios. However, adoption is hindered by limitations such as slow generation, as well as low quality and diversity of content. We introduce an evolutionary search-based approach for evolving level generators using novelty search to procedurally generate divers… ▽ More

    Submitted 14 April, 2022; originally announced April 2022.

    Comments: Accepted to the Genetic and Evolutionary Computation Conference (GECCO '22), July 9--13, 2022, Boston, MA, USA. Code is located at https://github.com/Michael-Beukman/PCGNN

  31. Mitigating Mismatch Compression in Differential Local Field Potentials

    Authors: Vineet Tiruvadi, Sam James, Bryan Howell, Mosadoluwa Obatusin, Andrea Crowell, Patricio Riva-Posse, Ki Sueng Choi, Allison Waters, Robert E. Gross, Cameron C. McIntyre, Helen S. Mayberg, Robert Butera

    Abstract: Bidirectional deep brain stimulation (bdDBS) devices capable of recording differential local field potentials (dLFP) enable neural recordings alongside clinical therapy. Efforts to identify objective signals of various brain disorders, or disease readouts, are challenging in dLFP, especially during active DBS. In this report we identified, characterized, and mitigated a major source of distortion… ▽ More

    Submitted 7 April, 2022; originally announced April 2022.

    Comments: 9 pages, 9 figures

  32. arXiv:2204.01571  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Coarse-to-Fine Q-attention with Learned Path Ranking

    Authors: Stephen James, Pieter Abbeel

    Abstract: We propose Learned Path Ranking (LPR), a method that accepts an end-effector goal pose, and learns to rank a set of goal-reaching paths generated from an array of path generating methods, including: path planning, Bezier curve sampling, and a learned policy. The core idea being that each of the path generation modules will be useful in different tasks, or at different stages in a task. When LPR is… ▽ More

    Submitted 4 April, 2022; originally announced April 2022.

    Comments: Project page and code: https://sites.google.com/view/q-attention-lpr

  33. arXiv:2203.13880  [pdf, other

    cs.CV cs.AI

    Reinforcement Learning with Action-Free Pre-Training from Videos

    Authors: Younggyo Seo, Kimin Lee, Stephen James, Pieter Abbeel

    Abstract: Recent unsupervised pre-training methods have shown to be effective on language and vision domains by learning useful representations for multiple downstream tasks. In this paper, we investigate if such unsupervised pre-training methods can also be effective for vision-based reinforcement learning (RL). To this end, we introduce a framework that learns representations useful for understanding the… ▽ More

    Submitted 16 June, 2022; v1 submitted 25 March, 2022; originally announced March 2022.

    Comments: International Conference on Machine Learning (ICML 2022). Project page: https://sites.google.com/view/rl-apv

  34. arXiv:2203.02309  [pdf, other

    physics.ins-det astro-ph.CO hep-ex nucl-ex

    A Next-Generation Liquid Xenon Observatory for Dark Matter and Neutrino Physics

    Authors: J. Aalbers, K. Abe, V. Aerne, F. Agostini, S. Ahmed Maouloud, D. S. Akerib, D. Yu. Akimov, J. Akshat, A. K. Al Musalhi, F. Alder, S. K. Alsum, L. Althueser, C. S. Amarasinghe, F. D. Amaro, A. Ames, T. J. Anderson, B. Andrieu, N. Angelides, E. Angelino, J. Angevaare, V. C. Antochi, D. Antón Martin, B. Antunovic, E. Aprile, H. M. Araújo , et al. (572 additional authors not shown)

    Abstract: The nature of dark matter and properties of neutrinos are among the most pressing issues in contemporary particle physics. The dual-phase xenon time-projection chamber is the leading technology to cover the available parameter space for Weakly Interacting Massive Particles (WIMPs), while featuring extensive sensitivity to many alternative dark matter candidates. These detectors can also study neut… ▽ More

    Submitted 4 March, 2022; originally announced March 2022.

    Comments: 77 pages, 40 figures, 1262 references

    Report number: INT-PUB-22-003

    Journal ref: J. Phys. G: Nucl. Part. Phys. 50 (2023) 013001

  35. arXiv:2202.13341  [pdf, other

    cs.LG cs.AI cs.CV

    Overlooked Implications of the Reconstruction Loss for VAE Disentanglement

    Authors: Nathan Michlo, Richard Klein, Steven James

    Abstract: Learning disentangled representations with variational autoencoders (VAEs) is often attributed to the regularisation component of the loss. In this work, we highlight the interaction between data and the reconstruction term of the loss as the main contributor to disentanglement in VAEs. We show that standard benchmark datasets have unintended correlations between their subjective ground-truth fact… ▽ More

    Submitted 9 August, 2023; v1 submitted 27 February, 2022; originally announced February 2022.

    Comments: 13 pages, 12 figures, 4 tables

    ACM Class: I.2; I.2.6; I.4.10

    Journal ref: Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence (IJCAI), Main Track, 2023

  36. arXiv:2202.11092  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    ReorientBot: Learning Object Reorientation for Specific-Posed Placement

    Authors: Kentaro Wada, Stephen James, Andrew J. Davison

    Abstract: Robots need the capability of placing objects in arbitrary, specific poses to rearrange the world and achieve various valuable tasks. Object reorientation plays a crucial role in this as objects may not initially be oriented such that the robot can grasp and then immediately place them in a specific goal pose. In this work, we present a vision-based manipulation system, ReorientBot, which consists… ▽ More

    Submitted 22 February, 2022; originally announced February 2022.

    Comments: 7 pages, 6 figures, IEEE International Conference on Robotics and Automation (ICRA) 2022

  37. arXiv:2202.05832  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    SafePicking: Learning Safe Object Extraction via Object-Level Mapping

    Authors: Kentaro Wada, Stephen James, Andrew J. Davison

    Abstract: Robots need object-level scene understanding to manipulate objects while reasoning about contact, support, and occlusion among objects. Given a pile of objects, object recognition and reconstruction can identify the boundary of object instances, giving important cues as to how the objects form and support the pile. In this work, we present a system, SafePicking, that integrates object-level mappin… ▽ More

    Submitted 1 March, 2022; v1 submitted 11 February, 2022; originally announced February 2022.

    Comments: 7 pages, 6 figures, IEEE International Conference on Robotics and Automation (ICRA) 2022

  38. arXiv:2202.03957  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Bingham Policy Parameterization for 3D Rotations in Reinforcement Learning

    Authors: Stephen James, Pieter Abbeel

    Abstract: We propose a new policy parameterization for representing 3D rotations during reinforcement learning. Today in the continuous control reinforcement learning literature, many stochastic policy parameterizations are Gaussian. We argue that universally applying a Gaussian policy parameterization is not always desirable for all environments. One such case in particular where this is true are tasks tha… ▽ More

    Submitted 8 February, 2022; originally announced February 2022.

    Comments: Project page and code: https://sites.google.com/view/rl-bpp

  39. arXiv:2202.03091  [pdf, other

    cs.LG cs.AI cs.CV

    Auto-Lambda: Disentangling Dynamic Task Relationships

    Authors: Shikun Liu, Stephen James, Andrew J. Davison, Edward Johns

    Abstract: Understanding the structure of multiple related tasks allows for multi-task learning to improve the generalisation ability of one or all of them. However, it usually requires training each pairwise combination of tasks together in order to capture task relationships, at an extremely high computational cost. In this work, we learn task relationships via an automated weighting framework, named Auto-… ▽ More

    Submitted 2 June, 2022; v1 submitted 7 February, 2022; originally announced February 2022.

    Comments: Published at TMLR 2022. Project Page: https://shikun.io/projects/auto-lambda Code: https://github.com/lorenmt/auto-lambda

  40. arXiv:2202.00740  [pdf, other

    cs.LG

    Investigating Transfer Learning in Graph Neural Networks

    Authors: Nishai Kooverjee, Steven James, Terence van Zyl

    Abstract: Graph neural networks (GNNs) build on the success of deep learning models by extending them for use in graph spaces. Transfer learning has proven extremely successful for traditional deep learning problems: resulting in faster training and improved performance. Despite the increasing interest in GNNs and their use cases, there is little research on their transferability. This research demonstrates… ▽ More

    Submitted 1 February, 2022; originally announced February 2022.

  41. arXiv:2201.10334  [pdf, ps, other

    cs.AI

    Towards Objective Metrics for Procedurally Generated Video Game Levels

    Authors: Michael Beukman, Steven James, Christopher Cleghorn

    Abstract: With increasing interest in procedural content generation by academia and game developers alike, it is vital that different approaches can be compared fairly. However, evaluating procedurally generated video game levels is often difficult, due to the lack of standardised, game-independent metrics. In this paper, we introduce two simulation-based evaluation metrics that involve analysing the behavi… ▽ More

    Submitted 9 March, 2022; v1 submitted 25 January, 2022; originally announced January 2022.

    Comments: 7 pages, 10 figures. V3: This work has been submitted to the IEEE for possible publication. Code is located at https://github.com/Michael-Beukman/PCGNN

  42. arXiv:2201.02858  [pdf, other

    hep-ex astro-ph.CO astro-ph.IM hep-ph

    Cosmogenic production of $^{37}$Ar in the context of the LUX-ZEPLIN experiment

    Authors: J. Aalbers, D. S. Akerib, A. K. Al Musalhi, F. Alder, S. K. Alsum, C. S. Amarasinghe, A. Ames, T. J. Anderson, N. Angelides, H. M. Araújo, J. E. Armstrong, M. Arthurs, X. Bai, A. Baker, J. Balajthy, S. Balashov, J. Bang, J. W. Bargemann, D. Bauer, A. Baxter, K. Beattie, E. P. Bernard, A. Bhatti, A. Biekert, T. P. Biesiadzinski , et al. (183 additional authors not shown)

    Abstract: We estimate the amount of $^{37}$Ar produced in natural xenon via cosmic ray-induced spallation, an inevitable consequence of the transportation and storage of xenon on the Earth's surface. We then calculate the resulting $^{37}$Ar concentration in a 10-tonne payload~(similar to that of the LUX-ZEPLIN experiment) assuming a representative schedule of xenon purification, storage and delivery to the… ▽ More

    Submitted 22 March, 2022; v1 submitted 8 January, 2022; originally announced January 2022.

  43. arXiv:2110.04647  [pdf, other

    cs.LG cs.CL

    Learning to Follow Language Instructions with Compositional Policies

    Authors: Vanya Cohen, Geraud Nangue Tasse, Nakul Gopalan, Steven James, Matthew Gombolay, Benjamin Rosman

    Abstract: We propose a framework that learns to execute natural language instructions in an environment consisting of goal-reaching tasks that share components of their task descriptions. Our approach leverages the compositionality of both value functions and language, with the aim of reducing the sample complexity of learning novel tasks. First, we train a reinforcement learning agent to learn value functi… ▽ More

    Submitted 9 October, 2021; originally announced October 2021.

    Comments: Presented at AI-HRI symposium as part of AAAI-FSS 2021 (arXiv:2109.10836)

    Report number: AIHRI/2021/53

  44. arXiv:2106.12534  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Coarse-to-Fine Q-attention: Efficient Learning for Visual Robotic Manipulation via Discretisation

    Authors: Stephen James, Kentaro Wada, Tristan Laidlow, Andrew J. Davison

    Abstract: We present a coarse-to-fine discretisation method that enables the use of discrete reinforcement learning approaches in place of unstable and data-inefficient actor-critic methods in continuous robotics domains. This approach builds on the recently released ARM algorithm, which replaces the continuous next-best pose agent with a discrete one, with coarse-to-fine Q-attention. Given a voxelised scen… ▽ More

    Submitted 14 March, 2022; v1 submitted 23 June, 2021; originally announced June 2021.

    Comments: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2022). Videos and code: https://sites.google.com/view/c2f-q-attention

  45. arXiv:2105.14829  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Q-attention: Enabling Efficient Learning for Vision-based Robotic Manipulation

    Authors: Stephen James, Andrew J. Davison

    Abstract: Despite the success of reinforcement learning methods, they have yet to have their breakthrough moment when applied to a broad range of robotic manipulation tasks. This is partly due to the fact that reinforcement learning algorithms are notoriously difficult and time consuming to train, which is exacerbated when training from images rather than full-state inputs. As humans perform manipulation ta… ▽ More

    Submitted 3 February, 2022; v1 submitted 31 May, 2021; originally announced May 2021.

    Comments: IEEE Robotics and Automation Letters, 2022 (+ presentation at ICRA 2022). Videos and code found at: https://sites.google.com/view/q-attention

  46. arXiv:2105.02139  [pdf, other

    cs.HC

    Mixing Modalities of 3D Sketching and Speech for Interactive Model Retrieval in Virtual Reality

    Authors: Daniele Giunchi, Alejandro Sztrajman, Stuart James, Anthony Steed

    Abstract: Sketch and speech are intuitive interaction methods that convey complementary information and have been independently used for 3D model retrieval in virtual environments. While sketch has been shown to be an effective retrieval method, not all collections are easily navigable using this modality alone. We design a new challenging database for sketch comprised of 3D chairs where each of the compone… ▽ More

    Submitted 5 May, 2021; originally announced May 2021.

    Comments: Published at IMX 2021

  47. arXiv:2105.00312  [pdf, other

    cs.RO cs.LG

    Waypoint Planning Networks

    Authors: Alexandru-Iosif Toma, Hussein Ali Jaafar, Hao-Ya Hsueh, Stephen James, Daniel Lenton, Ronald Clark, Sajad Saeedi

    Abstract: With the recent advances in machine learning, path planning algorithms are also evolving; however, the learned path planning algorithms often have difficulty competing with success rates of classic algorithms. We propose waypoint planning networks (WPN), a hybrid algorithm based on LSTMs with a local kernel - a classic algorithm such as A*, and a global kernel using a learned algorithm. WPN produc… ▽ More

    Submitted 1 May, 2021; originally announced May 2021.

    Comments: The Conference on Robots and Vision (CRV2021) Supplementary Website: https://sites.google.com/view/waypoint-planning-networks

  48. arXiv:2104.13374  [pdf, other

    physics.ins-det nucl-ex

    Projected sensitivity of the LUX-ZEPLIN (LZ) experiment to the two-neutrino and neutrinoless double beta decays of $^{134}$Xe

    Authors: The LUX-ZEPLIN, Collaboration, :, D. S. Akerib, A. K. Al Musalhi, S. K. Alsum, C. S. Amarasinghe, A. Ames, T. J. Anderson, N. Angelides, H. M. Araujo, J. E. Armstrong, M. Arthurs, X. Bai, J. Balajthy, S. Balashov, J. Bang, J. W. Bargemann, D. Bauer, A. Baxter, P. Beltrame, E. P. Bernard, A. Bernstein, A. Bhatti, A. Biekert , et al. (172 additional authors not shown)

    Abstract: The projected sensitivity of the LUX-ZEPLIN (LZ) experiment to two-neutrino and neutrinoless double beta decay of $^{134}$Xe is presented. LZ is a 10-tonne xenon time projection chamber optimized for the detection of dark matter particles, that is expected to start operating in 2021 at Sanford Underground Research Facility, USA. Its large mass of natural xenon provides an exceptional opportunity t… ▽ More

    Submitted 22 November, 2021; v1 submitted 26 April, 2021; originally announced April 2021.

    Comments: Version accepted for publication in Phys. Rev. C

  49. arXiv:2104.03920  [pdf, other

    cs.SI cs.AI

    Finding Experts in Social Media Data using a Hybrid Approach

    Authors: Simon James, Brady

    Abstract: Several approaches to the problem of expert finding have emerged in computer science research. In this work, three of these approaches - content analysis, social graph analysis and the use of Semantic Web technologies are examined. An integrated set of system requirements is then developed that uses all three approaches in one hybrid approach. To show the practicality of this hybrid approach, a… ▽ More

    Submitted 1 April, 2021; originally announced April 2021.

  50. arXiv:2103.16442  [pdf, other

    cs.CV

    SIMstack: A Generative Shape and Instance Model for Unordered Object Stacks

    Authors: Zoe Landgraf, Raluca Scona, Tristan Laidlow, Stephen James, Stefan Leutenegger, Andrew J. Davison

    Abstract: By estimating 3D shape and instances from a single view, we can capture information about an environment quickly, without the need for comprehensive scanning and multi-view fusion. Solving this task for composite scenes (such as object stacks) is challenging: occluded areas are not only ambiguous in shape but also in instance segmentation; multiple decompositions could be valid. We observe that ph… ▽ More

    Submitted 26 September, 2021; v1 submitted 30 March, 2021; originally announced March 2021.

    Journal ref: ICCV 2021