Skip to main content

Showing 1–50 of 128 results for author: Henriques, J

.
  1. arXiv:2505.05643  [pdf, other

    eess.IV cs.CV physics.med-ph

    UltraGauss: Ultrafast Gaussian Reconstruction of 3D Ultrasound Volumes

    Authors: Mark C. Eid, Ana I. L. Namburete, João F. Henriques

    Abstract: Ultrasound imaging is widely used due to its safety, affordability, and real-time capabilities, but its 2D interpretation is highly operator-dependent, leading to variability and increased cognitive demand. 2D-to-3D reconstruction mitigates these challenges by providing standardized volumetric views, yet existing methods are often computationally expensive, memory-intensive, or incompatible with u… ▽ More

    Submitted 8 May, 2025; originally announced May 2025.

  2. arXiv:2503.04676  [pdf, other

    cond-mat.mes-hall

    Characterizing $S=3/2$ AKLT Hamiltonian with Scanning Tunneling Spectroscopy

    Authors: M. Ferri-Cortés, J. C. G. Henriques, J. Fernández-Rossier

    Abstract: The AKLT Hamiltonian is a particular instance of a general class of model Hamiltonians defined in lattices with coordination $z$ where each site hosts a spins $S=z/2$, interacting both with linear and non-linear exchange couplings. In two dimensions, the AKLT model features a gap in the spectrum, and its ground state is a valence bond solid state; that is an universal resource for measurement base… ▽ More

    Submitted 6 March, 2025; originally announced March 2025.

  3. arXiv:2502.13770  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci

    On determining the energy dispersion of spin excitations with scanning tunneling spectroscopy

    Authors: J. C. G. Henriques, Chenxiao Zhao, G. Catarina, Pascal Ruffieux, Roman Fasel, J. Fernández-Rossier

    Abstract: Conventional methods to measure the dispersion relations of collective spin excitations involve probing bulk samples with particles such as neutrons, photons or electrons, which carry a well-defined momentum. Open-ended finite-size spin chains, on the contrary, do not have a well-defined momentum due to the lack of translation symmetry, and their spin excitations are measured with an eminently loc… ▽ More

    Submitted 19 February, 2025; originally announced February 2025.

    Comments: 6 pages, 4 figures

  4. arXiv:2412.17100  [pdf, other

    eess.IV

    Classifier-guided registration of coronary CT angiography and intravascular ultrasound

    Authors: R. L. M. van Herten, José P. Henriques, R. Nils Planken, Joost Daemen, Eline M. J. Hartman, Jolanda J. Wentzel, Ivana Išgum

    Abstract: Coronary CT angiography (CCTA) and intravascular ultrasound (IVUS) provide complementary information for coronary artery disease assessment, making their registration valuable for comprehensive analysis. However, existing registration methods require manual interaction or extensive segmentations, limiting their practical application. In this work, we present a fully automatic framework for CCTA-IV… ▽ More

    Submitted 22 December, 2024; originally announced December 2024.

    Comments: Submitted to IEEE Transactions on Medical Imaging

  5. arXiv:2412.12079  [pdf, other

    cs.CV

    UniLoc: Towards Universal Place Recognition Using Any Single Modality

    Authors: Yan Xia, Zhendong Li, Yun-Jin Li, Letian Shi, Hu Cao, João F. Henriques, Daniel Cremers

    Abstract: To date, most place recognition methods focus on single-modality retrieval. While they perform well in specific environments, cross-modal methods offer greater flexibility by allowing seamless switching between map and query sources. It also promises to reduce computation requirements by having a unified model, and achieving greater sample efficiency by sharing parameters. In this work, we develop… ▽ More

    Submitted 16 December, 2024; originally announced December 2024.

    Comments: 14 pages, 10 figures

  6. arXiv:2412.10308  [pdf, other

    cs.CV

    TrafficLoc: Localizing Traffic Surveillance Cameras in 3D Scenes

    Authors: Yan Xia, Yunxiang Lu, Rui Song, Oussema Dhaouadi, João F. Henriques, Daniel Cremers

    Abstract: We tackle the problem of localizing traffic cameras within a 3D reference map and propose a novel image-to-point cloud registration (I2P) method, TrafficLoc, in a coarse-tofine matching fashion. To overcome the lack of large-scale real-world intersection datasets, we first introduce Carla Intersection, a new simulated dataset with 75 urban and rural intersections in Carla. We find that current I2P… ▽ More

    Submitted 25 March, 2025; v1 submitted 13 December, 2024; originally announced December 2024.

  7. arXiv:2412.03139  [pdf, ps, other

    cond-mat.mes-hall physics.optics

    Electrically Tunable Interband Collective Excitations in Biased Bilayer and Trilayer Graphene

    Authors: Tomer Eini, M. F. C. Martins Quintela, J. C. G. Henriques, R. M. Ribeiro, Yarden Mazor, N. M. R. Peres, Itai Epstein

    Abstract: Collective excitations of charged particles under the influence of an electromagnetic field give rise to a rich variety of hybrid light-matter quasiparticles with unique properties. In metals, intraband collective response manifested by negative permittivity leads to plasmon-polaritons with extreme field confinement, wavelength squeezing, and potentially low propagation losses. In contrast, photon… ▽ More

    Submitted 27 February, 2025; v1 submitted 4 December, 2024; originally announced December 2024.

  8. arXiv:2410.23156  [pdf, other

    cs.AI cs.CV cs.LG cs.RO

    VisualPredicator: Learning Abstract World Models with Neuro-Symbolic Predicates for Robot Planning

    Authors: Yichao Liang, Nishanth Kumar, Hao Tang, Adrian Weller, Joshua B. Tenenbaum, Tom Silver, João F. Henriques, Kevin Ellis

    Abstract: Broadly intelligent agents should form task-specific abstractions that selectively expose the essential elements of a task, while abstracting away the complexity of the raw sensorimotor space. In this work, we present Neuro-Symbolic Predicates, a first-order abstraction language that combines the strengths of symbolic and neural knowledge representations. We outline an online algorithm for inventi… ▽ More

    Submitted 28 February, 2025; v1 submitted 30 October, 2024; originally announced October 2024.

    Comments: ICLR 2025 (Spotlight)

  9. arXiv:2410.18539  [pdf, other

    cs.CV cs.LG

    Interpretable Representation Learning from Videos using Nonlinear Priors

    Authors: Marian Longa, João F. Henriques

    Abstract: Learning interpretable representations of visual data is an important challenge, to make machines' decisions understandable to humans and to improve generalisation outside of the training distribution. To this end, we propose a deep learning framework where one can specify nonlinear priors for videos (e.g. of Newtonian physics) that allow the model to learn interpretable latent variables and use t… ▽ More

    Submitted 24 October, 2024; originally announced October 2024.

    Comments: Accepted to BMVC 2024 (Oral)

  10. arXiv:2409.11837  [pdf, other

    eess.IV

    World of Forms: Deformable Geometric Templates for One-Shot Surface Meshing in Coronary CT Angiography

    Authors: Rudolf L. M. van Herten, Ioannis Lagogiannis, Jelmer M. Wolterink, Steffen Bruns, Eva R. Meulendijks, Damini Dey, Joris R. de Groot, José P. Henriques, R. Nils Planken, Simone Saitta, Ivana Išgum

    Abstract: Deep learning-based medical image segmentation and surface mesh generation typically involve a sequential pipeline from image to segmentation to meshes, often requiring large training datasets while making limited use of prior geometric knowledge. This may lead to topological inconsistencies and suboptimal performance in low-data regimes. To address these challenges, we propose a data-efficient de… ▽ More

    Submitted 21 February, 2025; v1 submitted 18 September, 2024; originally announced September 2024.

    Comments: Submitted to Medical Image Analysis

  11. arXiv:2409.04196  [pdf, other

    cs.CV cs.AI

    GST: Precise 3D Human Body from a Single Image with Gaussian Splatting Transformers

    Authors: Lorenza Prospero, Abdullah Hamdi, Joao F. Henriques, Christian Rupprecht

    Abstract: Reconstructing posed 3D human models from monocular images has important applications in the sports industry, including performance tracking, injury prevention and virtual training. In this work, we combine 3D human pose and shape estimation with 3D Gaussian Splatting (3DGS), a representation of the scene composed of a mixture of Gaussians. This allows training or fine-tuning a human model predict… ▽ More

    Submitted 16 April, 2025; v1 submitted 6 September, 2024; originally announced September 2024.

    Comments: Camera ready for CVSports workshop at CVPR 2025

  12. arXiv:2409.00851  [pdf, other

    cs.IR cs.LG cs.SD eess.AS

    Dissecting Temporal Understanding in Text-to-Audio Retrieval

    Authors: Andreea-Maria Oncescu, João F. Henriques, A. Sophia Koepke

    Abstract: Recent advancements in machine learning have fueled research on multimodal tasks, such as for instance text-to-video and text-to-audio retrieval. These tasks require models to understand the semantic content of video and audio data, including objects, and characters. The models also need to learn spatial arrangements and temporal relationships. In this work, we analyse the temporal ordering of sou… ▽ More

    Submitted 1 September, 2024; originally announced September 2024.

    Comments: 9 pages, 5 figures, ACM Multimedia 2024, https://www.robots.ox.ac.uk/~vgg/research/audio-retrieval/dtu/

  13. arXiv:2408.14265  [pdf, other

    astro-ph.SR astro-ph.IM

    ALMA Memo 628 -- High-cadence observations of the Sun

    Authors: Sven Wedemeyer, Mikolaj Szydlarski, M. Carmen Toribio, Tobia Carozzi, Daniel Jakobsson, Juan Camilo Guevara Gomez, Henrik Eklund, Vasco M. J. Henriques, Shahin Jafarzadeh, Jaime de la Cruz Rodriguez

    Abstract: The Atacama Large Millimeter/submillimeter Array (ALMA) offers new diagnostic capabilities for studying the Sun, providing complementary insights through high spatial and temporal resolution at millimeter wavelengths. ALMA acts as a linear thermometer for atmospheric gas, aiding in understanding the solar atmosphere's structure, dynamics, and energy balance. Given the Sun's complex emission patter… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

    Comments: ALMA Memo 628 summarising the results of the ESO-funded ALMA development study "High-cadence Imaging of the Sun", concluded in 2023 (74 pages, 53 figures)

    Report number: ALMA Memo 628

  14. arXiv:2408.10045  [pdf, other

    cond-mat.mtrl-sci cond-mat.mes-hall quant-ph

    Gapless spin excitations in nanographene-based antiferromagnetic spin-1/2 Heisenberg chains

    Authors: Chenxiao Zhao, Lin Yang, João C. G. Henriques, Mar Ferri-Cortés, Gonçalo Catarina, Carlo A. Pignedoli, Ji Ma, Xinliang Feng, Pascal Ruffieux, Joaquín Fernández-Rossier, Roman Fasel

    Abstract: Haldane's seminal work established two fundamentally different types of excitation spectra for antiferromagnetic Heisenberg quantum spin chains: gapped excitations in integer-spin chains and gapless excitations in half-integer-spin chains. In finite-length half-integer spin chains, quantization, however, induces a gap in the excitation spectrum, with the upper bound given by the Lieb-Schulz-Mattis… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

    Comments: 5 figures. arXiv admin note: text overlap with arXiv:2402.13590

  15. arXiv:2408.09860  [pdf, other

    cs.CV cs.AI cs.LG

    3D-Aware Instance Segmentation and Tracking in Egocentric Videos

    Authors: Yash Bhalgat, Vadim Tschernezki, Iro Laina, João F. Henriques, Andrea Vedaldi, Andrew Zisserman

    Abstract: Egocentric videos present unique challenges for 3D scene understanding due to rapid camera motion, frequent object occlusions, and limited object visibility. This paper introduces a novel approach to instance segmentation and tracking in first-person video that leverages 3D awareness to overcome these obstacles. Our method integrates scene geometry, 3D object centroid tracking, and instance segmen… ▽ More

    Submitted 20 November, 2024; v1 submitted 19 August, 2024; originally announced August 2024.

    Comments: Camera-ready for ACCV 2024. More experiments added

  16. arXiv:2407.20511  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall physics.chem-ph quant-ph

    Building spin-1/2 antiferromagnetic Heisenberg chains with diaza-nanographenes

    Authors: Xiaoshuai Fu, Li Huang, Kun Liu, João C. G. Henriques, Yixuan Gao, Xianghe Han, Hui Chen, Yan Wang, Carlos-Andres Palma, Zhihai Cheng, Xiao Lin, Shixuan Du, Ji Ma, Joaquín Fernández-Rossier, Xinliang Feng, Hong-Jun Gao

    Abstract: Understanding and engineering the coupling of spins in nanomaterials is of central importance for designing novel devices. Graphene nanostructures with π-magnetism offer a chemically tunable platform to explore quantum magnetic interactions. However, realizing spin chains bearing controlled odd-even effects with suitable nanographene systems is challenging. Here, we demonstrate the successful on-s… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

    Journal ref: Nature Synthesis (2025)

  17. arXiv:2407.18913  [pdf, other

    cs.LG cs.AI

    SOAP-RL: Sequential Option Advantage Propagation for Reinforcement Learning in POMDP Environments

    Authors: Shu Ishida, João F. Henriques

    Abstract: This work compares ways of extending Reinforcement Learning algorithms to Partially Observed Markov Decision Processes (POMDPs) with options. One view of options is as temporally extended action, which can be realized as a memory that allows the agent to retain historical information beyond the policy's context window. While option assignment could be handled using heuristics and hand-crafted obje… ▽ More

    Submitted 11 October, 2024; v1 submitted 26 July, 2024; originally announced July 2024.

  18. arXiv:2406.07284  [pdf, other

    cs.CV cs.AI

    Unsupervised Object Detection with Theoretical Guarantees

    Authors: Marian Longa, João F. Henriques

    Abstract: Unsupervised object detection using deep neural networks is typically a difficult problem with few to no guarantees about the learned representation. In this work we present the first unsupervised object detection method that is theoretically guaranteed to recover the true object positions up to quantifiable small shifts. We develop an unsupervised object detection architecture and prove that the… ▽ More

    Submitted 24 October, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

    Comments: Accepted to NeurIPS 2024

  19. arXiv:2406.04343  [pdf, ps, other

    cs.CV

    Flash3D: Feed-Forward Generalisable 3D Scene Reconstruction from a Single Image

    Authors: Stanislaw Szymanowicz, Eldar Insafutdinov, Chuanxia Zheng, Dylan Campbell, João F. Henriques, Christian Rupprecht, Andrea Vedaldi

    Abstract: We propose Flash3D, a method for scene reconstruction and novel view synthesis from a single image which is both very generalisable and efficient. For generalisability, we start from a "foundation" model for monocular depth estimation and extend it to a full 3D shape and appearance reconstructor. For efficiency, we base this extension on feed-forward Gaussian Splatting. Specifically, we predict a… ▽ More

    Submitted 1 June, 2025; v1 submitted 6 June, 2024; originally announced June 2024.

    Comments: Project page: https://www.robots.ox.ac.uk/~vgg/research/flash3d/

  20. arXiv:2406.03428  [pdf, other

    cs.LG

    HelloFresh: LLM Evaluations on Streams of Real-World Human Editorial Actions across X Community Notes and Wikipedia edits

    Authors: Tim Franzmeyer, Aleksandar Shtedritski, Samuel Albanie, Philip Torr, João F. Henriques, Jakob N. Foerster

    Abstract: Benchmarks have been essential for driving progress in machine learning. A better understanding of LLM capabilities on real world tasks is vital for safe development. Designing adequate LLM benchmarks is challenging: Data from real-world tasks is hard to collect, public availability of static evaluation data results in test data contamination and benchmark overfitting, and periodically generating… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: ACL 2024 Findings

  21. arXiv:2405.12896  [pdf, other

    cond-mat.mes-hall

    Giant spatial anisotropy of magnon lifetime in altermagnets

    Authors: A. T. Costa, J. C. G. Henriques, J. Fernández-Rossier

    Abstract: Altermagnets are a new class of magnetic materials with zero net magnetization (like antiferromagnets) but spin-split electronic bands (like ferromagnets) over a fraction of reciprocal space. As in antiferromagnets, magnons in altermagnets come in two flavours, that either add one or remove one unit of spin to the $S=0$ ground state. However, in altermagnets these two magnon modes are non-degenera… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 9 pages, 11 figures (including the appendices)

  22. arXiv:2405.03735  [pdf, other

    cs.LG cs.AI cs.MA

    Select to Perfect: Imitating desired behavior from large multi-agent data

    Authors: Tim Franzmeyer, Edith Elkind, Philip Torr, Jakob Foerster, Joao Henriques

    Abstract: AI agents are commonly trained with large datasets of demonstrations of human behavior. However, not all behaviors are equally safe or desirable. Desired characteristics for an AI agent can be expressed by assigning desirability scores, which we assume are not assigned to individual behaviors but to collective trajectories. For example, in a dataset of vehicle interactions, these scores might rela… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: ICLR 2024

  23. arXiv:2404.10766  [pdf, other

    eess.IV cs.CV

    RapidVol: Rapid Reconstruction of 3D Ultrasound Volumes from Sensorless 2D Scans

    Authors: Mark C. Eid, Pak-Hei Yeung, Madeleine K. Wyburd, João F. Henriques, Ana I. L. Namburete

    Abstract: Two-dimensional (2D) freehand ultrasonography is one of the most commonly used medical imaging modalities, particularly in obstetrics and gynaecology. However, it only captures 2D cross-sectional views of inherently 3D anatomies, losing valuable contextual information. As an alternative to requiring costly and complex 3D ultrasound scanners, 3D volumes can be constructed from 2D scans using machin… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  24. arXiv:2404.01079  [pdf, other

    cs.CV

    Stale Diffusion: Hyper-realistic 5D Movie Generation Using Old-school Methods

    Authors: Joao F. Henriques, Dylan Campbell, Tengda Han

    Abstract: Two years ago, Stable Diffusion achieved super-human performance at generating images with super-human numbers of fingers. Following the steady decline of its technical novelty, we propose Stale Diffusion, a method that solidifies and ossifies Stable Diffusion in a maximum-entropy state. Stable Diffusion works analogously to a barn (the Stable) from which an infinite set of horses have escaped (th… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: SIGBOVIK 2024

  25. Small-scale magnetic flux emergence preceding a chain of energetic solar atmospheric events

    Authors: D. Nóbrega-Siverio, I. Cabello, S. Bose, L. H. M. Rouppe van der Voort, R. Joshi, C. Froment, V. M. J. Henriques

    Abstract: Advancements in instrumentation have revealed a multitude of small-scale EUV events in the solar atmosphere. Our aim is to employ high-resolution magnetograms to gain a detailed understanding of the magnetic origin of such phenomena. We have used coordinated observations from SST, IRIS, and SDO to analyze an ephemeral magnetic flux emergence episode and the following chain of small-scale energetic… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: Accepted in A&A, 11 pages, 7 figures, 5 movies

    Journal ref: A&A 686, A218 (2024)

  26. arXiv:2403.10997  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    N2F2: Hierarchical Scene Understanding with Nested Neural Feature Fields

    Authors: Yash Bhalgat, Iro Laina, João F. Henriques, Andrew Zisserman, Andrea Vedaldi

    Abstract: Understanding complex scenes at multiple levels of abstraction remains a formidable challenge in computer vision. To address this, we introduce Nested Neural Feature Fields (N2F2), a novel approach that employs hierarchical supervision to learn a single feature field, wherein different dimensions within the same high-dimensional feature encode scene properties at varying granularities. Our method… ▽ More

    Submitted 28 July, 2024; v1 submitted 16 March, 2024; originally announced March 2024.

    Comments: ECCV 2024

  27. arXiv:2403.01638  [pdf, other

    cs.CL

    Multi-level Product Category Prediction through Text Classification

    Authors: Wesley Ferreira Maia, Angelo Carmignani, Gabriel Bortoli, Lucas Maretti, David Luz, Daniel Camilo Fuentes Guzman, Marcos Jardel Henriques, Francisco Louzada Neto

    Abstract: This article investigates applying advanced machine learning models, specifically LSTM and BERT, for text classification to predict multiple categories in the retail sector. The study demonstrates how applying data augmentation techniques and the focal loss function can significantly enhance accuracy in classifying products into multiple categories using a robust Brazilian retail dataset. The LSTM… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

  28. arXiv:2402.19106  [pdf, other

    eess.AS cs.IR cs.SD

    A SOUND APPROACH: Using Large Language Models to generate audio descriptions for egocentric text-audio retrieval

    Authors: Andreea-Maria Oncescu, João F. Henriques, Andrew Zisserman, Samuel Albanie, A. Sophia Koepke

    Abstract: Video databases from the internet are a valuable source of text-audio retrieval datasets. However, given that sound and vision streams represent different "views" of the data, treating visual descriptions as audio descriptions is far from optimal. Even if audio class labels are present, they commonly are not very detailed, making them unsuited for text-audio retrieval. To exploit relevant audio in… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: 9 pages, 2 figures, 9 tables, Accepted at ICASSP 2024

  29. arXiv:2402.13590  [pdf, other

    cond-mat.mtrl-sci cond-mat.mes-hall cond-mat.str-el physics.chem-ph quant-ph

    Tunable topological phases in nanographene-based spin-1/2 alternating-exchange Heisenberg chains

    Authors: Chenxiao Zhao, Gonçalo Catarina, Jin-Jiang Zhang, João C. G. Henriques, Lin Yang, Ji Ma, Xinliang Feng, Oliver Gröning, Pascal Ruffieux, Joaquín Fernández-Rossier, Roman Fasel

    Abstract: Unlocking the potential of topological order within many-body spin systems has long been a central pursuit in the realm of quantum materials. Despite extensive efforts, the quest for a versatile platform enabling site-selective spin manipulation, essential for tuning and probing diverse topological phases, has persisted. Here, we utilize on-surface synthesis to construct spin-1/2 alternating-excha… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  30. arXiv:2401.10886  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    SCENES: Subpixel Correspondence Estimation With Epipolar Supervision

    Authors: Dominik A. Kloepfer, João F. Henriques, Dylan Campbell

    Abstract: Extracting point correspondences from two or more views of a scene is a fundamental computer vision problem with particular importance for relative camera pose estimation and structure-from-motion. Existing local feature matching approaches, trained with correspondence supervision on large-scale datasets, obtain highly-accurate matches on the test sets. However, they do not generalise well to new… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

  31. arXiv:2401.10314  [pdf, other

    cs.SE cs.AI cs.LG cs.RO

    LangProp: A code optimization framework using Large Language Models applied to driving

    Authors: Shu Ishida, Gianluca Corrado, George Fedoseev, Hudson Yeo, Lloyd Russell, Jamie Shotton, João F. Henriques, Anthony Hu

    Abstract: We propose LangProp, a framework for iteratively optimizing code generated by large language models (LLMs), in both supervised and reinforcement learning settings. While LLMs can generate sensible coding solutions zero-shot, they are often sub-optimal. Especially for code generation tasks, it is likely that the initial code will fail on certain edge cases. LangProp automatically evaluates the code… ▽ More

    Submitted 3 May, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

  32. arXiv:2312.04938  [pdf, other

    cond-mat.mes-hall

    Beyond spin models in orbitally-degenerate open-shell nanographenes

    Authors: J. C. G. Henriques, D. Jacob, A. Molina-Sánchez, G. Catarina, A. T. Costa, J. Fernández-Rossier

    Abstract: The study of open-shell nanographenes has relied on a paradigm where spins are the only low-energy degrees of freedom. Here we show that some nanographenes can host low-energy excitations that include strongly coupled spin and orbital degrees of freedom. The key ingredient is the existence of orbital degeneracy, as a consequence of leaving the benzenoid/half-filling scenario. We analyze the case o… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

    Comments: 6 pages, 4 figures

  33. arXiv:2312.04670  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Rapid Motor Adaptation for Robotic Manipulator Arms

    Authors: Yichao Liang, Kevin Ellis, João Henriques

    Abstract: Developing generalizable manipulation skills is a core challenge in embodied AI. This includes generalization across diverse task configurations, encompassing variations in object shape, density, friction coefficient, and external disturbances such as forces applied to the robot. Rapid Motor Adaptation (RMA) offers a promising solution to this challenge. It posits that essential hidden variables i… ▽ More

    Submitted 29 March, 2024; v1 submitted 7 December, 2023; originally announced December 2023.

    Comments: Accepted at CVPR 2024. 12 pages

  34. arXiv:2312.01783  [pdf, other

    cond-mat.mes-hall cond-mat.str-el

    Designer spin models in tunable two-dimensional nanographene lattices

    Authors: J. C. G. Henriques, Mar Ferri-Cortés, J. Fernández-Rossier

    Abstract: Motivated by recent experimental breakthroughs, we propose a strategy to design two-dimensional spin lattices with competing interactions that lead to non-trivial emergent quantum states. We consider $S=1/2$ nanographenes with $C_3$ symmetry as building blocks, and we leverage the potential to control both the sign and the strength of exchange with first neighbours to build a family of spin models… ▽ More

    Submitted 6 February, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

    Comments: 6 pages, 4 figures

  35. arXiv:2311.15977  [pdf, other

    cs.CV

    Text2Loc: 3D Point Cloud Localization from Natural Language

    Authors: Yan Xia, Letian Shi, Zifeng Ding, João F. Henriques, Daniel Cremers

    Abstract: We tackle the problem of 3D point cloud localization based on a few natural linguistic descriptions and introduce a novel neural network, Text2Loc, that fully interprets the semantic relationship between points and text. Text2Loc follows a coarse-to-fine localization pipeline: text-submap global place recognition, followed by fine localization. In global place recognition, relational dynamics amon… ▽ More

    Submitted 28 March, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

    Comments: Accepted by CVPR 2024

  36. arXiv:2310.11297  [pdf, other

    eess.IV

    Automatic Coronary Artery Plaque Quantification and CAD-RADS Prediction using Mesh Priors

    Authors: Rudolf L. M. van Herten, Nils Hampe, Richard A. P. Takx, Klaas Jan Franssen, Yining Wang, Dominika Suchá, José P. Henriques, Tim Leiner, R. Nils Planken, Ivana Išgum

    Abstract: Coronary artery disease (CAD) remains the leading cause of death worldwide. Patients with suspected CAD undergo coronary CT angiography (CCTA) to evaluate the risk of cardiovascular events and determine the treatment. Clinical analysis of coronary arteries in CCTA comprises the identification of atherosclerotic plaque, as well as the grading of any coronary artery stenosis typically obtained throu… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: 12 pages, 6 figures, accepted in IEEE Transactions on Medical Imaging

  37. arXiv:2310.01095  [pdf, other

    cs.CV cs.AI

    LoCUS: Learning Multiscale 3D-consistent Features from Posed Images

    Authors: Dominik A. Kloepfer, Dylan Campbell, João F. Henriques

    Abstract: An important challenge for autonomous agents such as robots is to maintain a spatially and temporally consistent model of the world. It must be maintained through occlusions, previously-unseen views, and long time horizons (e.g., loop closure and re-identification). It is still an open question how to train such a versatile neural representation without supervision. We start from the idea that the… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

    Journal ref: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) 2023, pages 16634-16644

  38. arXiv:2309.11052  [pdf, other

    cs.CL cs.LG stat.ML

    fakenewsbr: A Fake News Detection Platform for Brazilian Portuguese

    Authors: Luiz Giordani, Gilsiley Darú, Rhenan Queiroz, Vitor Buzinaro, Davi Keglevich Neiva, Daniel Camilo Fuentes Guzmán, Marcos Jardel Henriques, Oilson Alberto Gonzatto Junior, Francisco Louzada

    Abstract: The proliferation of fake news has become a significant concern in recent times due to its potential to spread misinformation and manipulate public opinion. This paper presents a comprehensive study on detecting fake news in Brazilian Portuguese, focusing on journalistic-type news. We propose a machine learning-based approach that leverages natural language processing techniques, including TF-IDF… ▽ More

    Submitted 20 September, 2023; v1 submitted 20 September, 2023; originally announced September 2023.

  39. arXiv:2307.00991  [pdf, other

    cond-mat.mes-hall

    Anatomy of linear and non-linear intermolecular exchange in S = 1 nanographenes

    Authors: J. C. G. Henriques, J. Fernández-Rossier

    Abstract: Nanographene triangulenes with a S = 1 ground state have been used as building blocks of antiferromagnetic Haldane spin chains realizing a symmetry protected topological phase. By means of inelastic electron spectroscopy, it was found that the intermolecular exchange contains both linear and non-linear interactions, realizing the bilinear-biquadratic Hamiltonian. Starting from a Hubbard model, and… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

    Comments: 21 pages, 8 figures

  40. Broken-symmetry magnetic phases in two-dimensional triangulene crystals

    Authors: G. Catarina, J. C. G. Henriques, A. Molina-Sánchez, A. T. Costa, J. Fernández-Rossier

    Abstract: We provide a comprehensive theory of magnetic phases in two-dimensional triangulene crystals, using both Hubbard model and density functional theory (DFT) calculations. We consider centrosymmetric and non-centrosymmetric triangulene crystals. In all cases, DFT and mean-field Hubbard model predict the emergence of broken-symmetry antiferromagnetic (ferrimagnetic) phases for the centrosymmetric (non… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

    Journal ref: Phys. Rev. Research 5, 043226 (2023)

  41. arXiv:2306.04633  [pdf, other

    cs.CV cs.AI cs.LG

    Contrastive Lift: 3D Object Instance Segmentation by Slow-Fast Contrastive Fusion

    Authors: Yash Bhalgat, Iro Laina, João F. Henriques, Andrew Zisserman, Andrea Vedaldi

    Abstract: Instance segmentation in 3D is a challenging task due to the lack of large-scale annotated datasets. In this paper, we show that this task can be addressed effectively by leveraging instead 2D pre-trained models for instance segmentation. We propose a novel approach to lift 2D segments to 3D and fuse them by means of a neural field representation, which encourages multi-view consistency across fra… ▽ More

    Submitted 1 December, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023 (Spotlight). Code: https://github.com/yashbhalgat/Contrastive-Lift

  42. arXiv:2306.01804  [pdf, other

    cs.LG cs.AI

    Extracting Reward Functions from Diffusion Models

    Authors: Felipe Nuti, Tim Franzmeyer, João F. Henriques

    Abstract: Diffusion models have achieved remarkable results in image generation, and have similarly been used to learn high-performing policies in sequential decision-making tasks. Decision-making diffusion models can be trained on lower-quality data, and then be steered with a reward function to generate near-optimal trajectories. We consider the problem of extracting a reward function by comparing a decis… ▽ More

    Submitted 9 December, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023

  43. arXiv:2304.00521  [pdf, other

    cs.DL cs.LG

    Large Language Models are Few-shot Publication Scoopers

    Authors: Samuel Albanie, Liliane Momeni, João F. Henriques

    Abstract: Driven by recent advances AI, we passengers are entering a golden age of scientific discovery. But golden for whom? Confronting our insecurity that others may beat us to the most acclaimed breakthroughs of the era, we propose a novel solution to the long-standing personal credit assignment problem to ensure that it is golden for us. At the heart of our approach is a pip-to-the-post algorithm that… ▽ More

    Submitted 2 April, 2023; originally announced April 2023.

    Comments: SIGBOVIK 2023

  44. arXiv:2303.13512  [pdf, other

    cs.AI

    Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the MineRL BASALT 2022 Competition

    Authors: Stephanie Milani, Anssi Kanervisto, Karolis Ramanauskas, Sander Schulhoff, Brandon Houghton, Sharada Mohanty, Byron Galbraith, Ke Chen, Yan Song, Tianze Zhou, Bingquan Yu, He Liu, Kai Guan, Yujing Hu, Tangjie Lv, Federico Malato, Florian Leopold, Amogh Raut, Ville Hautamäki, Andrew Melnik, Shu Ishida, João F. Henriques, Robert Klassert, Walter Laurito, Ellen Novoseller , et al. (5 additional authors not shown)

    Abstract: To facilitate research in the direction of fine-tuning foundation models from human feedback, we held the MineRL BASALT Competition on Fine-Tuning from Human Feedback at NeurIPS 2022. The BASALT challenge asks teams to compete to develop algorithms to solve tasks with hard-to-specify reward functions in Minecraft. Through this competition, we aimed to promote the development of algorithms that use… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

  45. arXiv:2211.15107  [pdf, other

    cs.CV cs.AI cs.LG

    A Light Touch Approach to Teaching Transformers Multi-view Geometry

    Authors: Yash Bhalgat, Joao F. Henriques, Andrew Zisserman

    Abstract: Transformers are powerful visual learners, in large part due to their conspicuous lack of manually-specified priors. This flexibility can be problematic in tasks that involve multiple-view geometry, due to the near-infinite possible variations in 3D shapes and viewpoints (requiring flexibility), and the precise nature of projective geometry (obeying rigid laws). To resolve this conundrum, we propo… ▽ More

    Submitted 2 April, 2023; v1 submitted 28 November, 2022; originally announced November 2022.

    Comments: Camera-ready version. Accepted to CVPR 2023

  46. arXiv:2211.14293  [pdf, other

    cs.CV

    RbA: Segmenting Unknown Regions Rejected by All

    Authors: Nazir Nayal, Mısra Yavuz, João F. Henriques, Fatma Güney

    Abstract: Standard semantic segmentation models owe their success to curated datasets with a fixed set of semantic categories, without contemplating the possibility of identifying unknown objects from novel categories. Existing methods in outlier detection suffer from a lack of smoothness and objectness in their predictions, due to limitations of the per-pixel classification paradigm. Furthermore, additiona… ▽ More

    Submitted 29 March, 2023; v1 submitted 25 November, 2022; originally announced November 2022.

  47. arXiv:2211.12542  [pdf, other

    cs.CV

    CASSPR: Cross Attention Single Scan Place Recognition

    Authors: Yan Xia, Mariia Gladkova, Rui Wang, Qianyun Li, Uwe Stilla, João F. Henriques, Daniel Cremers

    Abstract: Place recognition based on point clouds (LiDAR) is an important component for autonomous robots or self-driving vehicles. Current SOTA performance is achieved on accumulated LiDAR submaps using either point-based or voxel-based structures. While voxel-based approaches nicely integrate spatial context across multiple scales, they do not exhibit the local precision of point-based methods. As a resul… ▽ More

    Submitted 29 August, 2023; v1 submitted 22 November, 2022; originally announced November 2022.

    Comments: Accepted by ICCV2023

  48. arXiv:2210.03144  [pdf

    cond-mat.mtrl-sci physics.app-ph

    Wafer-scale detachable monocrystalline Germanium nanomembranes for the growth of III-V materials and substrate reuse

    Authors: Nicolas Paupy, Zakaria Oulad Elhmaidi, Alexandre Chapotot, Tadeáš Hanuš, Javier Arias-Zapata, Bouraoui Ilahi, Alexandre Heintz, Alex Brice Poungoué Mbeunmi, Roxana Arvinte, Mohammad Reza Aziziyan, Valentin Daniel, Gwenaëlle Hamon, Jérémie Chrétien, Firas Zouaghi, Ahmed Ayari, Laurie Mouchel, Jonathan Henriques, Loïc Demoulin, Thierno Mamoudou Diallo, Philippe-Olivier Provost, Hubert Pelletier, Maïté Volatier, Rufi Kurstjens, Jinyoun Cho, Guillaume Courtois , et al. (10 additional authors not shown)

    Abstract: Germanium (Ge) is increasingly used as a substrate for high-performance optoelectronic, photovoltaic, and electronic devices. These devices are usually grown on thick and rigid Ge substrates manufactured by classical wafering techniques. Nanomembranes (NMs) provide an alternative to this approach while offering wafer-scale lateral dimensions, weight reduction, limitation of waste, and cost effecti… ▽ More

    Submitted 6 October, 2022; originally announced October 2022.

    Comments: 17 pages and 6 figures along with 3 figures in supporting information

  49. arXiv:2209.12093  [pdf, other

    cs.AI

    Learn what matters: cross-domain imitation learning with task-relevant embeddings

    Authors: Tim Franzmeyer, Philip H. S. Torr, João F. Henriques

    Abstract: We study how an autonomous agent learns to perform a task from demonstrations in a different domain, such as a different environment or different agent. Such cross-domain imitation learning is required to, for example, train an artificial agent from demonstrations of a human expert. We propose a scalable framework that enables cross-domain imitation learning without access to additional demonstrat… ▽ More

    Submitted 24 September, 2022; originally announced September 2022.

    Comments: NeurIPS 2022

  50. arXiv:2208.04405  [pdf, other

    cs.LG cs.MA stat.ME

    Recovering the Graph Underlying Networked Dynamical Systems under Partial Observability: A Deep Learning Approach

    Authors: Sérgio Machado, Anirudh Sridhar, Paulo Gil, Jorge Henriques, José M. F. Moura, Augusto Santos

    Abstract: We study the problem of graph structure identification, i.e., of recovering the graph of dependencies among time series. We model these time series data as components of the state of linear stochastic networked dynamical systems. We assume partial observability, where the state evolution of only a subset of nodes comprising the network is observed. We devise a new feature vector computed from the… ▽ More

    Submitted 12 April, 2023; v1 submitted 8 August, 2022; originally announced August 2022.

    Comments: Accepted at The 37th AAAI Conference on Artificial Intelligence (main track)

    MSC Class: 62D20; 93B30 ACM Class: I.2.m; G.3