-
Keystep Recognition using Graph Neural Networks
Authors:
Julia Lee Romero,
Kyle Min,
Subarna Tripathi,
Morteza Karimzadeh
Abstract:
We pose keystep recognition as a node classification task, and propose a flexible graph-learning framework for fine-grained keystep recognition that is able to effectively leverage long-term dependencies in egocentric videos. Our approach, termed GLEVR, consists of constructing a graph where each video clip of the egocentric video corresponds to a node. The constructed graphs are sparse and comput…
▽ More
We pose keystep recognition as a node classification task, and propose a flexible graph-learning framework for fine-grained keystep recognition that is able to effectively leverage long-term dependencies in egocentric videos. Our approach, termed GLEVR, consists of constructing a graph where each video clip of the egocentric video corresponds to a node. The constructed graphs are sparse and computationally efficient, outperforming existing larger models substantially. We further leverage alignment between egocentric and exocentric videos during training for improved inference on egocentric videos, as well as adding automatic captioning as an additional modality. We consider each clip of each exocentric video (if available) or video captions as additional nodes during training. We examine several strategies to define connections across these nodes. We perform extensive experiments on the Ego-Exo4D dataset and show that our proposed flexible graph-based framework notably outperforms existing methods.
△ Less
Submitted 1 June, 2025;
originally announced June 2025.
-
Area and volume from entangled qubits
Authors:
Juan M. Romero,
Emiliano Montoya-González
Abstract:
In this paper a relation between entangled states and geometry is studied. In particular, the area of 2D parallelogram is obtained from an entangled 4-qubit state. In addition, the vector area of a 3D parallelogram is derived from entangled 6-qubit states. Moreover, the volume of a 3D parallelepiped is deduced from an entangled 9-qubit state. Furthermore, it has been provided the quantum circuit i…
▽ More
In this paper a relation between entangled states and geometry is studied. In particular, the area of 2D parallelogram is obtained from an entangled 4-qubit state. In addition, the vector area of a 3D parallelogram is derived from entangled 6-qubit states. Moreover, the volume of a 3D parallelepiped is deduced from an entangled 9-qubit state. Furthermore, it has been provided the quantum circuit in qiskit code for these entangled states. It is worth mentioning that parallelograms and parallelepipeds serve as fundamental building blocks for more sophisticated geometric structures.
△ Less
Submitted 16 May, 2025;
originally announced May 2025.
-
Enhanced Photonic Chip Design via Interpretable Machine Learning Techniques
Authors:
Lirandë Pira,
Airin Antony,
Nayanthara Prathap,
Daniel Peace,
Jacquiline Romero
Abstract:
Photonic chip design has seen significant advancements with the adoption of inverse design methodologies, offering flexibility and efficiency in optimizing device performance. However, the black-box nature of the optimization approaches, such as those used in inverse design in order to minimize a loss function or maximize coupling efficiency, poses challenges in understanding the outputs. This cha…
▽ More
Photonic chip design has seen significant advancements with the adoption of inverse design methodologies, offering flexibility and efficiency in optimizing device performance. However, the black-box nature of the optimization approaches, such as those used in inverse design in order to minimize a loss function or maximize coupling efficiency, poses challenges in understanding the outputs. This challenge is prevalent in machine learning-based optimization methods, which can suffer from the same lack of transparency. To this end, interpretability techniques address the opacity of optimization models. In this work, we apply interpretability techniques from machine learning, with the aim of gaining understanding of inverse design optimization used in designing photonic components, specifically two-mode multiplexers. We base our methodology on the widespread interpretability technique known as local interpretable model-agnostic explanations, or LIME. As a result, LIME-informed insights point us to more effective initial conditions, directly improving device performance. This demonstrates that interpretability methods can do more than explain models -- they can actively guide and enhance the inverse-designed photonic components. Our results demonstrate the ability of interpretable techniques to reveal underlying patterns in the inverse design process, leading to the development of better-performing components.
△ Less
Submitted 14 May, 2025;
originally announced May 2025.
-
Near-unity quantum interference of transverse spatial modes in an ultra-compact inverse-designed photonic device
Authors:
Jamika Ann Roque,
Daniel Peace,
Simon White,
Emanuele Polino,
Sayantan Das,
Farzard Ghafari,
Sergei Slussarenko,
Nora Tischler,
Jacquiline Romero
Abstract:
The transverse spatial mode of photons is an untapped resource for scaling up integrated photonic quantum computing. To be practically useful for improving scalability, reliable and high-visibility quantum interference between transverse spatial modes on-chip needs to be demonstrated. We show repeatable quantum interference using inverse-designed transverse mode beamsplitters that have an ultra-co…
▽ More
The transverse spatial mode of photons is an untapped resource for scaling up integrated photonic quantum computing. To be practically useful for improving scalability, reliable and high-visibility quantum interference between transverse spatial modes on-chip needs to be demonstrated. We show repeatable quantum interference using inverse-designed transverse mode beamsplitters that have an ultra-compact footprint of 3 $μm$ $\times$ 3 $μm$ -- the smallest transverse mode beamsplitters for 1550 nm photons to date. We measure a Hong-Ou-Mandel visibility of up to 99.56$\pm$0.64 % from a single device, with an average visibility across three identical devices of 99.38$\pm$0.41 %, indicating a high degree of reproducibility. Our work demonstrates that inverse-designed components are suitable for engineering quantum interference on-chip of multimode devices, paving the way for future compact integrated quantum photonic devices that exploit the transverse spatial mode of photons for high-dimensional quantum information.
△ Less
Submitted 13 May, 2025;
originally announced May 2025.
-
Study of beta spectrum shapes relevant to the prediction of reactor antineutrino spectra
Authors:
G. A. Alcalá,
A. Algora,
M. Estienne,
M. Fallot,
V. Guadilla,
A. Beloeuvre,
W. Gelletly,
R. Kean,
A. Porta,
S. Bouvier,
J. -S. Stutzmann,
E. Bonnet,
T. Eronen,
D. Etasse,
J. Agramunt,
J. L. Tain,
H. Garcia Cabrera,
L. Giot,
A. Laureau,
J. A. Victoria,
Y. Molla,
A. Jaries,
L. Al Ayoubi,
O. Beliuskina,
W. Gins
, et al. (13 additional authors not shown)
Abstract:
The shapes of the beta spectra of 92Rb and 142Cs, two of the beta decays most relevant for the prediction of the antineutrino spectrum in reactors, have been measured. A new setup composed of two dE-E telescopes has been used. High purity radioactive beams of the isotopes of interest were provided by the IGISOL facility using the JYFLTRAP double Penning trap. The resulting beta spectra have been c…
▽ More
The shapes of the beta spectra of 92Rb and 142Cs, two of the beta decays most relevant for the prediction of the antineutrino spectrum in reactors, have been measured. A new setup composed of two dE-E telescopes has been used. High purity radioactive beams of the isotopes of interest were provided by the IGISOL facility using the JYFLTRAP double Penning trap. The resulting beta spectra have been compared with model predictions using beta decay feedings from total absorption gamma spectroscopy measurements and shape corrections employed in the calculation of the antineutrino spectrum, validating both further. The procedure can be extended to other relevant nuclei in the future, providing solid ground for the prediction of the antineutrino spectrum in reactors.
△ Less
Submitted 9 May, 2025;
originally announced May 2025.
-
Prescribed-Time Boresight Control of Spacecraft Under Pointing Constraints
Authors:
Xiaodong Shao,
Haoyang Yang,
Haoran Li,
Zongyu Zuo,
Jose Guadalupe Romero,
Qinglei Hu
Abstract:
This article proposes an integrated boresight guidance and control (IBGC) scheme to address the boresight reorientation problem of spacecraft under temporal and pointing constraints. A $C^1$ continuous, saturated prescribed-time adjustment (PPTA) function is presented, along with the establishment of a practical prescribed-time stability criterion. Utilizing the time scale transformation technique…
▽ More
This article proposes an integrated boresight guidance and control (IBGC) scheme to address the boresight reorientation problem of spacecraft under temporal and pointing constraints. A $C^1$ continuous, saturated prescribed-time adjustment (PPTA) function is presented, along with the establishment of a practical prescribed-time stability criterion. Utilizing the time scale transformation technique and the PPTA function, we propose a prescribed-time guidance law that guides the boresight vector from almost any initial orientation in free space to a small neighborhood of the goal orientation within a preassigned time, while avoiding all forbidden zones augmented with safety margins. Subsequently, a prescribed-time disturbance observer (PTDO) is derived to reconstruct the external disturbances. By leveraging barrier and PPTA functions, a PTDO-based reduced-attitude tracking controller is developed, which ensures prescribed-time boresight tracking within a ``safe tube''. By judiciously setting the safety margins, settling times, and safe tube for the guidance and control laws, the proposed IBGC scheme achieves pointing-constrained boresight reorientation within a required task completion time. Simulation and experimental results demonstrate the efficacy of the proposed IBGC scheme.
△ Less
Submitted 16 April, 2025; v1 submitted 5 April, 2025;
originally announced April 2025.
-
FRESA: Feedforward Reconstruction of Personalized Skinned Avatars from Few Images
Authors:
Rong Wang,
Fabian Prada,
Ziyan Wang,
Zhongshi Jiang,
Chengxiang Yin,
Junxuan Li,
Shunsuke Saito,
Igor Santesteban,
Javier Romero,
Rohan Joshi,
Hongdong Li,
Jason Saragih,
Yaser Sheikh
Abstract:
We present a novel method for reconstructing personalized 3D human avatars with realistic animation from only a few images. Due to the large variations in body shapes, poses, and cloth types, existing methods mostly require hours of per-subject optimization during inference, which limits their practical applications. In contrast, we learn a universal prior from over a thousand clothed humans to ac…
▽ More
We present a novel method for reconstructing personalized 3D human avatars with realistic animation from only a few images. Due to the large variations in body shapes, poses, and cloth types, existing methods mostly require hours of per-subject optimization during inference, which limits their practical applications. In contrast, we learn a universal prior from over a thousand clothed humans to achieve instant feedforward generation and zero-shot generalization. Specifically, instead of rigging the avatar with shared skinning weights, we jointly infer personalized avatar shape, skinning weights, and pose-dependent deformations, which effectively improves overall geometric fidelity and reduces deformation artifacts. Moreover, to normalize pose variations and resolve coupled ambiguity between canonical shapes and skinning weights, we design a 3D canonicalization process to produce pixel-aligned initial conditions, which helps to reconstruct fine-grained geometric details. We then propose a multi-frame feature aggregation to robustly reduce artifacts introduced in canonicalization and fuse a plausible avatar preserving person-specific identities. Finally, we train the model in an end-to-end framework on a large-scale capture dataset, which contains diverse human subjects paired with high-quality 3D scans. Extensive experiments show that our method generates more authentic reconstruction and animation than state-of-the-arts, and can be directly generalized to inputs from casually taken phone photos. Project page and code is available at https://github.com/rongakowang/FRESA.
△ Less
Submitted 4 April, 2025; v1 submitted 24 March, 2025;
originally announced March 2025.
-
Avat3r: Large Animatable Gaussian Reconstruction Model for High-fidelity 3D Head Avatars
Authors:
Tobias Kirschstein,
Javier Romero,
Artem Sevastopolsky,
Matthias Nießner,
Shunsuke Saito
Abstract:
Traditionally, creating photo-realistic 3D head avatars requires a studio-level multi-view capture setup and expensive optimization during test-time, limiting the use of digital human doubles to the VFX industry or offline renderings.
To address this shortcoming, we present Avat3r, which regresses a high-quality and animatable 3D head avatar from just a few input images, vastly reducing compute…
▽ More
Traditionally, creating photo-realistic 3D head avatars requires a studio-level multi-view capture setup and expensive optimization during test-time, limiting the use of digital human doubles to the VFX industry or offline renderings.
To address this shortcoming, we present Avat3r, which regresses a high-quality and animatable 3D head avatar from just a few input images, vastly reducing compute requirements during inference. More specifically, we make Large Reconstruction Models animatable and learn a powerful prior over 3D human heads from a large multi-view video dataset. For better 3D head reconstructions, we employ position maps from DUSt3R and generalized feature maps from the human foundation model Sapiens. To animate the 3D head, our key discovery is that simple cross-attention to an expression code is already sufficient. Finally, we increase robustness by feeding input images with different expressions to our model during training, enabling the reconstruction of 3D head avatars from inconsistent inputs, e.g., an imperfect phone capture with accidental movement, or frames from a monocular video.
We compare Avat3r with current state-of-the-art methods for few-input and single-input scenarios, and find that our method has a competitive advantage in both tasks. Finally, we demonstrate the wide applicability of our proposed model, creating 3D head avatars from images of different sources, smartphone captures, single images, and even out-of-domain inputs like antique busts.
Project website: https://tobias-kirschstein.github.io/avat3r/
△ Less
Submitted 27 February, 2025;
originally announced February 2025.
-
Phase dependence of the Thermal Memory Effect in Polycrystalline Ribbon and Bulk Ni55Fe19Ga26 Heusler Alloys
Authors:
A. Vidal-Crespo,
A. F. Manchón-Gordón,
J. M. Martín-Olalla,
F. J. Romero,
J. J. Ipus,
M. C. Gallardo,
J. S. Blázquez,
C. F. Conde
Abstract:
The thermal memory effect, TME, has been studied in Ni55Fe19Ga26 shape memory alloys, fabricated as ribbons via melt-spinning and as pellets via arc-melting, to evaluate its dependence on the martensitic structure and the macrostructure of the samples. When the reverse martensitic transformation is interrupted, a kinetic delay in the subsequent complete transformation is only evident in the ribbon…
▽ More
The thermal memory effect, TME, has been studied in Ni55Fe19Ga26 shape memory alloys, fabricated as ribbons via melt-spinning and as pellets via arc-melting, to evaluate its dependence on the martensitic structure and the macrostructure of the samples. When the reverse martensitic transformation is interrupted, a kinetic delay in the subsequent complete transformation is only evident in the ribbon samples, where the 14M modulated structure is the dominant phase. In contrast, degradation of the modulated structure or the presence of the gamma-phase significantly reduces the observed TME. In such cases, the magnitude of the TME approaches the detection limits of commercial calorimeters, and only high-resolution calorimeter at very low heating rate (40 mK h-1) can show the effect. Following the kinetic arrest and subsequent cooling, the reverse martensitic transformation was completed at several heating rates to confirm the athermal nature of the phenomenon.
△ Less
Submitted 4 February, 2025; v1 submitted 3 February, 2025;
originally announced February 2025.
-
Relightable Full-Body Gaussian Codec Avatars
Authors:
Shaofei Wang,
Tomas Simon,
Igor Santesteban,
Timur Bagautdinov,
Junxuan Li,
Vasu Agrawal,
Fabian Prada,
Shoou-I Yu,
Pace Nalbone,
Matt Gramlich,
Roman Lubachersky,
Chenglei Wu,
Javier Romero,
Jason Saragih,
Michael Zollhoefer,
Andreas Geiger,
Siyu Tang,
Shunsuke Saito
Abstract:
We propose Relightable Full-Body Gaussian Codec Avatars, a new approach for modeling relightable full-body avatars with fine-grained details including face and hands. The unique challenge for relighting full-body avatars lies in the large deformations caused by body articulation and the resulting impact on appearance caused by light transport. Changes in body pose can dramatically change the orien…
▽ More
We propose Relightable Full-Body Gaussian Codec Avatars, a new approach for modeling relightable full-body avatars with fine-grained details including face and hands. The unique challenge for relighting full-body avatars lies in the large deformations caused by body articulation and the resulting impact on appearance caused by light transport. Changes in body pose can dramatically change the orientation of body surfaces with respect to lights, resulting in both local appearance changes due to changes in local light transport functions, as well as non-local changes due to occlusion between body parts. To address this, we decompose the light transport into local and non-local effects. Local appearance changes are modeled using learnable zonal harmonics for diffuse radiance transfer. Unlike spherical harmonics, zonal harmonics are highly efficient to rotate under articulation. This allows us to learn diffuse radiance transfer in a local coordinate frame, which disentangles the local radiance transfer from the articulation of the body. To account for non-local appearance changes, we introduce a shadow network that predicts shadows given precomputed incoming irradiance on a base mesh. This facilitates the learning of non-local shadowing between the body parts. Finally, we use a deferred shading approach to model specular radiance transfer and better capture reflections and highlights such as eye glints. We demonstrate that our approach successfully models both the local and non-local light transport required for relightable full-body avatars, with a superior generalization ability under novel illumination conditions and unseen poses.
△ Less
Submitted 24 January, 2025;
originally announced January 2025.
-
Graph-Based Multimodal and Multi-view Alignment for Keystep Recognition
Authors:
Julia Lee Romero,
Kyle Min,
Subarna Tripathi,
Morteza Karimzadeh
Abstract:
Egocentric videos capture scenes from a wearer's viewpoint, resulting in dynamic backgrounds, frequent motion, and occlusions, posing challenges to accurate keystep recognition. We propose a flexible graph-learning framework for fine-grained keystep recognition that is able to effectively leverage long-term dependencies in egocentric videos, and leverage alignment between egocentric and exocentric…
▽ More
Egocentric videos capture scenes from a wearer's viewpoint, resulting in dynamic backgrounds, frequent motion, and occlusions, posing challenges to accurate keystep recognition. We propose a flexible graph-learning framework for fine-grained keystep recognition that is able to effectively leverage long-term dependencies in egocentric videos, and leverage alignment between egocentric and exocentric videos during training for improved inference on egocentric videos. Our approach consists of constructing a graph where each video clip of the egocentric video corresponds to a node. During training, we consider each clip of each exocentric video (if available) as additional nodes. We examine several strategies to define connections across these nodes and pose keystep recognition as a node classification task on the constructed graphs. We perform extensive experiments on the Ego-Exo4D dataset and show that our proposed flexible graph-based framework notably outperforms existing methods by more than 12 points in accuracy. Furthermore, the constructed graphs are sparse and compute efficient. We also present a study examining on harnessing several multimodal features, including narrations, depth, and object class labels, on a heterogeneous graph and discuss their corresponding contribution to the keystep recognition performance.
△ Less
Submitted 7 January, 2025;
originally announced January 2025.
-
Model agnostic signal encoding by leaky integrate and fire, performance and uncertainty
Authors:
Diana Carbajal,
José Luis Romero
Abstract:
Integrate and fire is a resource efficient time-encoding mechanism that summarizes into a signed spike train those time intervals where a signal's charge exceeds a certain threshold. We analyze the IF encoder in terms of a very general notion of approximate bandwidth, which is shared by most commonly-used signal models. This complements results on exact encoding that may be overly adapted to a par…
▽ More
Integrate and fire is a resource efficient time-encoding mechanism that summarizes into a signed spike train those time intervals where a signal's charge exceeds a certain threshold. We analyze the IF encoder in terms of a very general notion of approximate bandwidth, which is shared by most commonly-used signal models. This complements results on exact encoding that may be overly adapted to a particular signal model. We take into account, possibly for the first time, the effect of uncertainty in the exact location of the spikes (as may arise by decimation), uncertainty of integration leakage (as may arise in realistic manufacturing), and boundary effects inherent to finite periods of exposure to the measurement device. The analysis is done by means of a concrete bandwidth-based Ansatz that can also be useful to initialize more sophisticated model specific reconstruction algorithms, and uses the earth mover's (Wassertein) distance to measure spike discrepancy.
△ Less
Submitted 26 March, 2025; v1 submitted 17 December, 2024;
originally announced December 2024.
-
Self-guided tomography of time-frequency qudits
Authors:
Laura Serino,
Markus Rambach,
Benjamin Brecht,
Jacquiline Romero,
Christine Silberhorn
Abstract:
High-dimensional time-frequency encodings have the potential to significantly advance quantum information science; however, practical applications require precise knowledge of the encoded quantum states, which becomes increasingly challenging for larger Hilbert spaces. Self-guided tomography (SGT) has emerged as a practical and scalable technique for this purpose in the spatial domain. Here, we ap…
▽ More
High-dimensional time-frequency encodings have the potential to significantly advance quantum information science; however, practical applications require precise knowledge of the encoded quantum states, which becomes increasingly challenging for larger Hilbert spaces. Self-guided tomography (SGT) has emerged as a practical and scalable technique for this purpose in the spatial domain. Here, we apply SGT to estimate time-frequency states using a multi-output quantum pulse gate. We achieve fidelities of more than 99\% for 3- and 5-dimensional states without the need for calibration or post-processing. We demonstrate the robustness of SGT against statistical and environmental noise, highlighting its efficacy in the photon-starved regime typical of quantum information applications.
△ Less
Submitted 28 November, 2024;
originally announced November 2024.
-
TIMBRE: Efficient Job Recommendation On Heterogeneous Graphs For Professional Recruiters
Authors:
Eric Behar,
Julien Romero,
Amel Bouzeghoub,
Katarzyna Wegrzyn-Wolska
Abstract:
Job recommendation gathers many challenges well-known in recommender systems. First, it suffers from the cold start problem, with the user (the candidate) and the item (the job) having a very limited lifespan. It makes the learning of good user and item representations hard. Second, the temporal aspect is crucial: We cannot recommend an item in the future or too much in the past. Therefore, using…
▽ More
Job recommendation gathers many challenges well-known in recommender systems. First, it suffers from the cold start problem, with the user (the candidate) and the item (the job) having a very limited lifespan. It makes the learning of good user and item representations hard. Second, the temporal aspect is crucial: We cannot recommend an item in the future or too much in the past. Therefore, using solely collaborative filtering barely works. Finally, it is essential to integrate information about the users and the items, as we cannot rely only on previous interactions. This paper proposes a temporal graph-based method for job recommendation: TIMBRE (Temporal Integrated Model for Better REcommendations). TIMBRE integrates user and item information into a heterogeneous graph. This graph is adapted to allow efficient temporal recommendation and evaluation, which is later done using a graph neural network. Finally, we evaluate our approach with recommender system metrics, rarely computed on graph-based recommender systems.
△ Less
Submitted 6 November, 2024;
originally announced November 2024.
-
Immersion of General Nonlinear Systems Into State-Affine Ones for the Design of Generalized Parameter Estimation-Based Observers: A Simple Algebraic Procedure
Authors:
Romeo Ortega,
Alexey Bobtsov,
Jose Guadalupe Romero,
Leyan Fang
Abstract:
Generalized parameter estimation-based observers have proven very successful to deal with systems described in state-affine form. In this paper, we enlarge the domain of applicability of this method proposing an algebraic procedure to immerse} an $n$-dimensional general nonlinear system into and $n_z$-dimensional system in state affine form, with $n_z>n$. First, we recall the necessary and suffici…
▽ More
Generalized parameter estimation-based observers have proven very successful to deal with systems described in state-affine form. In this paper, we enlarge the domain of applicability of this method proposing an algebraic procedure to immerse} an $n$-dimensional general nonlinear system into and $n_z$-dimensional system in state affine form, with $n_z>n$. First, we recall the necessary and sufficient condition for the solution of the general problem, which requires the solution of a partial differential equation that, moreover, has to satisfy a restrictive injectivity condition. Given the complexity of this task we propose an alternative simple algebraic method to identify the required dynamic extension and coordinate transformation, a procedure that, as shown in the paper, is rather natural for physical systems. We illustrate the method with some academic benchmark examples from observer theory literature -- that, in spite of their apparent simplicity, are difficult to solve with the existing methods -- as well as several practically relevant physical examples.
△ Less
Submitted 17 November, 2024;
originally announced November 2024.
-
Enhancing sharp augmented Lagrangian methods with smoothing techniques for nonlinear programming
Authors:
José Luis Romero,
Damián Fernandez,
Germán Ariel Torres
Abstract:
This paper proposes a novel approach to solving nonlinear programming problems using a sharp augmented Lagrangian method with a smoothing technique. Traditional sharp augmented Lagrangian methods are known for their effectiveness but are often hindered by the need for global minimization of nonconvex, nondifferentiable functions at each iteration. To address this challenge, we introduce a smoothin…
▽ More
This paper proposes a novel approach to solving nonlinear programming problems using a sharp augmented Lagrangian method with a smoothing technique. Traditional sharp augmented Lagrangian methods are known for their effectiveness but are often hindered by the need for global minimization of nonconvex, nondifferentiable functions at each iteration. To address this challenge, we introduce a smoothing function that approximates the sharp augmented Lagrangian, enabling the use of primal minimization strategies similar to those in Powell--Hestenes--Rockafellar (PHR) methods. Our approach retains the theoretical rigor of classical duality schemes while allowing for the use of stationary points in the primal optimization process. We present two algorithms based on this method--one utilizing standard descent and the other employing coordinate descent. Numerical experiments demonstrate that our smoothing--based method compares favorably with the PHR augmented Lagrangian approach, offering both robustness and practical efficiency. The proposed method is particularly advantageous in scenarios where exact minimization is computationally infeasible, providing a balance between theoretical precision and computational tractability.
△ Less
Submitted 3 October, 2024;
originally announced October 2024.
-
Experimental demonstration of Robust Amplitude Estimation on near-term quantum devices for chemistry applications
Authors:
Alexander Kunitsa,
Nicole Bellonzi,
Shangjie Guo,
Jérôme F. Gonthier,
Corneliu Buda,
Clena M. Abuan,
Jhonathan Romero
Abstract:
This study explores hardware implementation of Robust Amplitude Estimation (RAE) on IBM quantum devices, demonstrating its application in quantum chemistry for one- and two-qubit Hamiltonian systems. Known for potentially offering quadratic speedups over traditional methods in estimating expectation values, RAE is evaluated under realistic noisy conditions. Our experiments provide detailed insight…
▽ More
This study explores hardware implementation of Robust Amplitude Estimation (RAE) on IBM quantum devices, demonstrating its application in quantum chemistry for one- and two-qubit Hamiltonian systems. Known for potentially offering quadratic speedups over traditional methods in estimating expectation values, RAE is evaluated under realistic noisy conditions. Our experiments provide detailed insights into the practical challenges associated with RAE. We achieved a significant reduction in sampling requirements compared to direct measurement techniques. In estimating the ground state energy of the hydrogen molecule, the RAE implementation demonstrated two orders of magnitude better accuracy for the two-qubit experiments and achieved chemical accuracy. These findings reveal its potential to enhance computational efficiencies in quantum chemistry applications despite the inherent limitations posed by hardware noise. We also found that its performance can be adversely impacted by coherent error and device stability and does not always correlate with the average gate error. These results underscore the importance of adapting quantum computational methods to hardware specifics to realize their full potential in practical scenarios.
△ Less
Submitted 1 October, 2024;
originally announced October 2024.
-
Digital Advertising in a Post-Cookie World: Charting the Impact of Google's Topics API
Authors:
Jesús Romero,
Ángel Cuevas,
Rubén Cuevas
Abstract:
Integrating Google's Topics API into the digital advertising ecosystem represents a significant shift toward privacy-conscious advertising practices. This article analyses the implications of implementing Topics API on ad networks, focusing on competition dynamics and ad space accessibility. Through simulations based on extensive datasets capturing user behavior and market share data for ad networ…
▽ More
Integrating Google's Topics API into the digital advertising ecosystem represents a significant shift toward privacy-conscious advertising practices. This article analyses the implications of implementing Topics API on ad networks, focusing on competition dynamics and ad space accessibility. Through simulations based on extensive datasets capturing user behavior and market share data for ad networks, we evaluate metrics such as Ad Placement Eligibility, Low Competition Rate, and solo competitor. The findings reveal a noticeable impact on ad networks, with larger players strengthening their dominance and smaller networks facing challenges securing ad spaces and competing effectively. Moreover, the study explores the potential environmental implications of Google's actions, highlighting the need to carefully consider policy and regulatory measures to ensure fair competition and privacy protection. Overall, this research contributes valuable insights into the evolving dynamics of digital advertising and highlights the importance of balancing privacy with competition and innovation in the online advertising landscape.
△ Less
Submitted 21 September, 2024;
originally announced September 2024.
-
Alternative Bell's states and teleportation
Authors:
Juan M. Romero,
Emiliano Montoya-Gonzalez,
Oscar Velazquez-Alvarado
Abstract:
Bell's states are among the most useful in quantum computing. These state are an orthonormal base of entagled states with two qubits. We propose alternative bases of entangled states. Some of these states depend on a continuous parameter. We present the quantum circuit and code of these alternative bases. In addition, we study quantum teleportation with these entangled states and present their qua…
▽ More
Bell's states are among the most useful in quantum computing. These state are an orthonormal base of entagled states with two qubits. We propose alternative bases of entangled states. Some of these states depend on a continuous parameter. We present the quantum circuit and code of these alternative bases. In addition, we study quantum teleportation with these entangled states and present their quantum circuits and codes associated.
△ Less
Submitted 23 October, 2024; v1 submitted 10 September, 2024;
originally announced September 2024.
-
On chip high-dimensional entangled photon sources
Authors:
Tavshabad Kaur,
Daniel Peace,
Jacquiline Romero
Abstract:
High-dimensional quantum entanglement is an important resource for emerging quantum technologies such as quantum communication and quantum computation. The scalability of metres-long experimental setups limits high-dimensional entanglement in bulk optics. Advancements in quantum technology hinge on reproducible, and reconfigurable quantum devices -- including photon sources, which are challenging…
▽ More
High-dimensional quantum entanglement is an important resource for emerging quantum technologies such as quantum communication and quantum computation. The scalability of metres-long experimental setups limits high-dimensional entanglement in bulk optics. Advancements in quantum technology hinge on reproducible, and reconfigurable quantum devices -- including photon sources, which are challenging to achieve in a scalable manner using bulk optics. Advances in nanotechnology and CMOS-compatible integration techniques have enabled the generation of entangled photons on millimeter-scale chips, significantly enhancing scalability, stability, replicability, and miniaturization for real-world quantum applications. In recent years we have seen several chip-scale demonstrations with different degrees of freedom including path, frequency-bin, time-bin, and transverse modes, on many material platforms. A complete quantum photonic integrated circuit requires the generation, manipulation, and detection of qudits, involving various active and passive quantum photonic components which further increase the degree of complexity. Here, we review and introduce the nonlinear optical processes that facilitate on-chip high-dimensional entangled photon sources and the currently used material platforms. We discuss a range of current implementations of on-chip high-dimensional entangled photon sources and demonstrated applications. We comment on the current challenges due to the limitations of individual material platforms and present future opportunities in hybrid and heterogeneous integration strategies for the next generation of integrated quantum photonic chips.
△ Less
Submitted 4 September, 2024;
originally announced September 2024.
-
Reasoning about Study Regulations in Answer Set Programming
Authors:
Susana Hahn,
Cedric Martens,
Amade Nemes,
Henry Otunuya,
Javier Romero,
Torsten Schaub,
Sebastian Schellhorn
Abstract:
We are interested in automating reasoning with and about study regulations, catering to various stakeholders, ranging from administrators, over faculty, to students at different stages. Our work builds on an extensive analysis of various study programs at the University of Potsdam. The conceptualization of the underlying principles provides us with a formal account of study regulations. In particu…
▽ More
We are interested in automating reasoning with and about study regulations, catering to various stakeholders, ranging from administrators, over faculty, to students at different stages. Our work builds on an extensive analysis of various study programs at the University of Potsdam. The conceptualization of the underlying principles provides us with a formal account of study regulations. In particular, the formalization reveals the properties of admissible study plans. With these at end, we propose an encoding of study regulations in Answer Set Programming that produces corresponding study plans. Finally, we show how this approach can be extended to a generic user interface for exploring study plans.
△ Less
Submitted 8 August, 2024;
originally announced August 2024.
-
Three-dimensional ultrasound-based online system for automated ovarian follicle measurement
Authors:
Pedro Royo,
Elkin Muñoz,
José-Enrique Romero,
José-Vicente Manjón,
Catalina Roig,
Carmen Fernández-Delgado,
Nuria Muñiz,
Antonio Requena,
Nicolás Garrido,
Juan Antonio García- Velasco,
Antonio Pellicer
Abstract:
Ultrasound follicle tracking is an important part of cycle monitoring. OSIS Ovary (Online System for Image Segmentation for the Ovary) has been conceived aiming to aid the management of the workflow in follicle tracking, one of the most iterative procedures in cycle monitoring during ovarian stimulation. In the present study, we compared OSIS Ovary (as three-dimensional ultrasound-based automated…
▽ More
Ultrasound follicle tracking is an important part of cycle monitoring. OSIS Ovary (Online System for Image Segmentation for the Ovary) has been conceived aiming to aid the management of the workflow in follicle tracking, one of the most iterative procedures in cycle monitoring during ovarian stimulation. In the present study, we compared OSIS Ovary (as three-dimensional ultrasound-based automated system) with the two-dimensional manual standard measurement method, in order to assess the reliability of the main measurements obtained to track follicle growth during ovarian stimulation cycles, the follicle size and count. Based on the mean follicle diameter and follicle count values obtained, the Pearson/intraclass correlation coefficients were 0.976/0.987 and 0.804/0.889 in >=10mm follicles, 0.989/0.994 and 0.809/0.867 in >=13mm follicles and 0.995/0.997 and 0.791/0.840 in >=16mm follicles. The mean difference (MnD) for the mean diameter and follicle count was, respectively, 0.759/0.161 in >=10mm follicles, 0.486/1.033 in >=13mm follicles and 0.784/0.486 in >=16mm follicles. The upper and lower limits of agreement (ULA and LLA) were 3.641/2.123 and 5.392/3.070 in >=10mm follicles, 3.496/2.522 and 4.285/2.218 in >=13mm follicles, and 3.723/2.153 and 2.432/1.459 in >=16mm follicles. The limits of agreement range (LoAR) were 5.764/8.462 in >=10mm follicles, 6.048/6.503 in >=13mm follicles and 5.876/3.891 in >=16mm follicles. P<0.05 was considered for all calculations. As three-dimensional ultrasound-based automated system in comparison with two-dimensional manual method standard, we found OSIS Ovary as a reliable tool to track follicle growth during ovarian stimulation cycles
△ Less
Submitted 26 July, 2024;
originally announced July 2024.
-
Hölder-Continuity of Extreme Spectral Values of Pseudodifferential Operators, Gabor Frame Bounds, and Saturation
Authors:
Karlheinz Gröchenig,
José Luis Romero,
Michael Speckbacher
Abstract:
We build on our recent results on the Lipschitz dependence of the extreme spectral values of one-parameter families of pseudodifferential operators with symbols in a weighted Sjöstrand class. We prove that larger symbol classes lead to Hölder continuity with respect to the parameter. This result is then used to investigate the behavior of frame bounds of families of Gabor systems…
▽ More
We build on our recent results on the Lipschitz dependence of the extreme spectral values of one-parameter families of pseudodifferential operators with symbols in a weighted Sjöstrand class. We prove that larger symbol classes lead to Hölder continuity with respect to the parameter. This result is then used to investigate the behavior of frame bounds of families of Gabor systems $\mathcal{G}(g,αΛ)$ with respect to the parameter $α>0$, where $Λ$ is a set of non-uniform, relatively separated time-frequency shifts, and $g\in M^1_s(\mathbb{R}^d)$, $0\leq s\leq 2$. In particular, we show that the frame bounds depend continuously on $α$ if $g\in M^1(\mathbb{R}^d)$, and are Hölder continuous if $g\in M^1_s(\mathbb{R}^d)$, $0<s\leq 2$, with the Hölder exponent explicitly given.
△ Less
Submitted 25 July, 2024;
originally announced July 2024.
-
Quantum Entanglement, Quantum Teleportation, Multilinear Polynomials and Geometry
Authors:
Juan M. Romero,
Emiliano Montoya-Gonzalez,
Oscar Velazquez-Alvarado
Abstract:
We show that quantum entanglement states are associated with multilinear polynomials that cannot be factored. By using these multilinear polynomials, we propose a geometric representation for entanglement states. In particular, we show that the Bell's states are associated with non-factorable real multilinear polynomial, which can be represented geometrically by three-dimensional surfaces. Further…
▽ More
We show that quantum entanglement states are associated with multilinear polynomials that cannot be factored. By using these multilinear polynomials, we propose a geometric representation for entanglement states. In particular, we show that the Bell's states are associated with non-factorable real multilinear polynomial, which can be represented geometrically by three-dimensional surfaces. Furthermore, in this framework, we show that a quantum circuit can be seen as a geometric transformations of plane geometry. This phenomenon is analogous to gravity, where matter curves space-time. In addition, we show an analogy between quantum teleportation and operations involving multilinear polynomials.
△ Less
Submitted 18 October, 2024; v1 submitted 24 July, 2024;
originally announced July 2024.
-
Stable Machine-Learning Parameterization of Subgrid Processes in a Comprehensive Atmospheric Model Learned From Embedded Convection-Permitting Simulations
Authors:
Zeyuan Hu,
Akshay Subramaniam,
Zhiming Kuang,
Jerry Lin,
Sungduk Yu,
Walter M. Hannah,
Noah D. Brenowitz,
Josh Romero,
Michael S. Pritchard
Abstract:
Modern climate projections often suffer from inadequate spatial and temporal resolution due to computational limitations, resulting in inaccurate representations of sub-grid processes. A promising technique to address this is the Multiscale Modeling Framework (MMF), which embeds a kilometer-resolution cloud-resolving model within each atmospheric column of a host climate model to replace tradition…
▽ More
Modern climate projections often suffer from inadequate spatial and temporal resolution due to computational limitations, resulting in inaccurate representations of sub-grid processes. A promising technique to address this is the Multiscale Modeling Framework (MMF), which embeds a kilometer-resolution cloud-resolving model within each atmospheric column of a host climate model to replace traditional convection and cloud parameterizations. Machine learning (ML) offers a unique opportunity to make MMF more accessible by emulating the embedded cloud-resolving model and reducing its substantial computational cost. Although many studies have demonstrated proof-of-concept success of achieving stable hybrid simulations, it remains a challenge to achieve near operational-level success with real geography and comprehensive variable emulation that includes, for example, explicit cloud condensate coupling. In this study, we present a stable hybrid model capable of integrating for at least 5 years with near operational-level complexity, including coarse-grid geography, seasonality, explicit cloud condensate and wind predictions, and land coupling. Our model demonstrates skillful online performance, achieving a 5-year zonal mean tropospheric temperature bias within 2K, water vapor bias within 1 g/kg, and a precipitation RMSE of 0.96 mm/day. Key factors contributing to our online performance include an expressive U-Net architecture and physical thermodynamic constraints for microphysics. With microphysical constraints mitigating unrealistic cloud formation, our work is the first to demonstrate realistic multi-year cloud condensate climatology under the MMF framework. Despite these advances, online diagnostics reveal persistent biases in certain regions, highlighting the need for innovative strategies to further optimize online performance.
△ Less
Submitted 31 January, 2025; v1 submitted 27 June, 2024;
originally announced July 2024.
-
Hyperuniformity and non-hyperuniformity of zeros of Gaussian Weyl-Heisenberg Functions
Authors:
Naomi Feldheim,
Antti Haimi,
Günther Koliander,
José Luis Romero
Abstract:
We study zero sets of twisted stationary Gaussian random functions on the complex plane, i.e., Gaussian random functions that are stochastically invariant under the action of the Weyl-Heisenberg group. This model includes translation invariant Gaussian entire functions (GEFs), and also many other non-analytic examples, in which case winding numbers around zeros can be either positive or negative.…
▽ More
We study zero sets of twisted stationary Gaussian random functions on the complex plane, i.e., Gaussian random functions that are stochastically invariant under the action of the Weyl-Heisenberg group. This model includes translation invariant Gaussian entire functions (GEFs), and also many other non-analytic examples, in which case winding numbers around zeros can be either positive or negative. We investigate zero statistics both when zeros are weighted with their winding numbers (charged zero set) and when they are not (uncharged zero set). We show that the variance of the charged zero statistic always grows linearly with the radius of the observation disk (hyperuniformity). Importantly, this holds for functions with possibly non-zero means and without assuming additional symmetries such as radiality. With respect to uncharged zero statistics, we provide an example for which the variance grows with the area of the observation disk (non-hyperuniformity). This is used to show that, while the zeros of GEFs are hyperuniform, the set of their critical points fails to be so. Our work contributes to recent developments in statistical signal processing, where the time-frequency profile of a non-stationary signal embedded into noise is revealed by performing a statistical test on the zeros of its spectrogram (``silent points''). We show that empirical spectrogram zero counts enjoy moderate deviation from their ensemble averages over large observation windows (something that was previously known only for pure noise). In contrast, we also show that spectogram maxima (``loud points") fail to enjoy a similar property. This gives the first formal evidence for the statistical superiority of silent points over the competing feature of loud points, a fact that has been noted by practitioners.
△ Less
Submitted 17 September, 2024; v1 submitted 28 June, 2024;
originally announced June 2024.
-
$N$-bein formalism for the parameter space of quantum geometry
Authors:
Jorge Romero,
Carlos A. Velasquez,
J David Vergara
Abstract:
This work introduces a geometrical object that generalizes the quantum geometric tensor; we call it $N$-bein. Analogous to the vielbein (orthonormal frame) used in the Cartan formalism, the $N$-bein behaves like a ``square root'' of the quantum geometric tensor. Using it, we present a quantum geometric tensor of two states that measures the possibility of moving from one state to another after two…
▽ More
This work introduces a geometrical object that generalizes the quantum geometric tensor; we call it $N$-bein. Analogous to the vielbein (orthonormal frame) used in the Cartan formalism, the $N$-bein behaves like a ``square root'' of the quantum geometric tensor. Using it, we present a quantum geometric tensor of two states that measures the possibility of moving from one state to another after two consecutive parameter variations. This new tensor determines the commutativity of such variations through its anti-symmetric part. In addition, we define a connection different from the Berry connection, and combining it with the $N$-bein allows us to introduce a notion of torsion and curvature à la Cartan that satisfies the Bianchi identities. Moreover, the torsion coincides with the anti-symmetric part of the two-state quantum geometric tensor previously mentioned, and thus, it is related to the commutativity of the parameter variations. We also describe our formalism using differential forms and discuss the possible physical interpretations of the new geometrical objects. Furthermore, we define different gauge invariants constructed from the geometrical quantities introduced in this work, resulting in new physical observables. Finally, we present two examples to illustrate these concepts: a harmonic oscillator and a generalized oscillator, both immersed in an electric field. We found that the new tensors quantify correlations between quantum states that were unavailable by other methods.
△ Less
Submitted 14 August, 2024; v1 submitted 27 June, 2024;
originally announced June 2024.
-
Optimizing measurement tradeoffs in multiparameter spatial superresolution
Authors:
J. Řeháček,
J. L. Romero,
A. Z. Goldberg,
Z. Hradil,
L. L. Sánchez-Soto
Abstract:
The quantum Cramér-Rao bound for the joint estimation of the centroid and the separation between two incoherent point sources cannot be saturated. As such, the optimal measurements for extracting maximal information about both at the same time are not known. In this work, we ascertain these optimal measurements for an arbitrary point spread function, in the most relevant regime of a small separati…
▽ More
The quantum Cramér-Rao bound for the joint estimation of the centroid and the separation between two incoherent point sources cannot be saturated. As such, the optimal measurements for extracting maximal information about both at the same time are not known. In this work, we ascertain these optimal measurements for an arbitrary point spread function, in the most relevant regime of a small separation between the sources. Our measurement can be adjusted within a set of tradeoffs, allowing more information to be extracted from the separation or the centroid while ensuring that the total information is the maximum possible.
△ Less
Submitted 24 June, 2024;
originally announced June 2024.
-
Towards Explainable Test Case Prioritisation with Learning-to-Rank Models
Authors:
Aurora Ramírez,
Mario Berrios,
José Raúl Romero,
Robert Feldt
Abstract:
Test case prioritisation (TCP) is a critical task in regression testing to ensure quality as software evolves. Machine learning has become a common way to achieve it. In particular, learning-to-rank (LTR) algorithms provide an effective method of ordering and prioritising test cases. However, their use poses a challenge in terms of explainability, both globally at the model level and locally for p…
▽ More
Test case prioritisation (TCP) is a critical task in regression testing to ensure quality as software evolves. Machine learning has become a common way to achieve it. In particular, learning-to-rank (LTR) algorithms provide an effective method of ordering and prioritising test cases. However, their use poses a challenge in terms of explainability, both globally at the model level and locally for particular results. Here, we present and discuss scenarios that require different explanations and how the particularities of TCP (multiple builds over time, test case and test suite variations, etc.) could influence them. We include a preliminary experiment to analyse the similarity of explanations, showing that they do not only vary depending on test case-specific predictions, but also on the relative ranks.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
Ultraslow calorimetric studies of the martensitic transformation of NiFeGa alloys: detection and analysis of avalanche phenomena
Authors:
José-María Martín-Olalla,
Antonio Vidal-Crespo,
Alejandro F. Manchón-Gordón,
Francisco Javier Romero,
Javier S. Blázquez,
María Carmen Gallardo,
Clara F. Conde
Abstract:
We study the thermal properties of a bulk Ni55Fe19Ga26 Heusler alloy in a conduction calorimeter. At slow heating and cooling rates (1K/h), we compare as-cast and annealed samples. We report a smaller thermal hysteresis after the thermal treatment due to the stabilization of the 14M modulated structure in the martensite phase. In ultraslow experiments (40mK/h), we detect and analyze the calorimetr…
▽ More
We study the thermal properties of a bulk Ni55Fe19Ga26 Heusler alloy in a conduction calorimeter. At slow heating and cooling rates (1K/h), we compare as-cast and annealed samples. We report a smaller thermal hysteresis after the thermal treatment due to the stabilization of the 14M modulated structure in the martensite phase. In ultraslow experiments (40mK/h), we detect and analyze the calorimetric avalanches associated with the direct and reverse martensitic transformation from cubic to 14M phase. This reveals a distribution of events characterized by a power law with exponential cutoff $p(u) \propto u^{-\varepsilon}\exp(-u/ξ)$ where $\varepsilon\sim 2$ and damping energies $ξ=370$uJ (direct) and $ξ=27$uJ (reverse) that characterize the asymmetry of the transformation.
△ Less
Submitted 19 May, 2024;
originally announced May 2024.
-
Enhancement of swimmer diffusion through regular kicks: analytic mapping of a scale independent parameter space
Authors:
Arnau Jurado Romero,
Carles Calero,
Rossend Rey
Abstract:
Depending on their mechanism of self-propulsion, active particles can exhibit a time-dependent, often periodic, propulsion velocity. The precise propulsion velocity profile determines their mean square displacement and their effective diffusion coefficient at long times. Here we demonstrate that any periodic propulsion profile results in a larger diffusion coefficient than the corresponding case w…
▽ More
Depending on their mechanism of self-propulsion, active particles can exhibit a time-dependent, often periodic, propulsion velocity. The precise propulsion velocity profile determines their mean square displacement and their effective diffusion coefficient at long times. Here we demonstrate that any periodic propulsion profile results in a larger diffusion coefficient than the corresponding case with constant propulsion velocity. We investigate in detail the case of periodic exponentially decaying velocity pulses, expected in propulsion mechanisms based on sudden absorption of finite amounts of energy. We show both analytically and with numerical simulations that in these cases the effective diffusion coefficient can be arbitrarily enhanced with respect to the case with constant velocity equal to the average speed. Our results may help interpret in a new light observations on the diffusion enhancement of active particles.
△ Less
Submitted 3 May, 2024;
originally announced May 2024.
-
Phase transition and polar cluster behavior above Curie temperature in ferroelectric BaTi$_{0.8}$Zr$_{0.2}$O$_3$
Authors:
Oktay Aktas,
Francisco Javier Romero,
Zhengwang He,
Gan Linyu,
Xiangdong Ding,
José-María Martín-Olalla,
Maria-Carmen Gallardo,
Turab Lookman
Abstract:
We study the phase transition behavior of the ferroelectric BaTi$_{0.8}$Zr$_{0.2}$O$_3$ in the paraelectric region. The temperature dependencies of thermal, polar, elastic and dielectric properties indicate the presence of local structures above the paraelectric-ferroelectric transition temperature Tc = 292 K. The non-zero remnant polarization is measured up to a characteristic temperature T* ~350…
▽ More
We study the phase transition behavior of the ferroelectric BaTi$_{0.8}$Zr$_{0.2}$O$_3$ in the paraelectric region. The temperature dependencies of thermal, polar, elastic and dielectric properties indicate the presence of local structures above the paraelectric-ferroelectric transition temperature Tc = 292 K. The non-zero remnant polarization is measured up to a characteristic temperature T* ~350 K, which coincides with the temperature where the dielectric constant deviates from Curie-Weiss law. Resonant Piezoelectric Spectroscopy shows that DC field-cooling above Tc using fields smaller than the coercive field leads to an elastic response and remnant piezoelectricity below T*, which likely corresponds to the coherence temperature associated with polar nanostructures in ferroelectrics. The observed remnant effect is attributed to the reorientation of polar nanostructures above Tc.
△ Less
Submitted 6 May, 2024; v1 submitted 30 April, 2024;
originally announced April 2024.
-
Photonic Quantum Computing
Authors:
Jacquiline Romero,
Gerard Milburn
Abstract:
Photonic quantum computation refers to quantum computation that uses photons as the physical system for doing the quantum computation. Photons are ideal quantum systems because they operate at room temperature, and photonic technologies are relatively mature. The field is largely divided between discrete- and continuous-variable photonic quantum computation. In discrete-variable (DV) photonic quan…
▽ More
Photonic quantum computation refers to quantum computation that uses photons as the physical system for doing the quantum computation. Photons are ideal quantum systems because they operate at room temperature, and photonic technologies are relatively mature. The field is largely divided between discrete- and continuous-variable photonic quantum computation. In discrete-variable (DV) photonic quantum computation, quantum information is represented by one or more modal properties (e.g. polarization) that take on distinct values from a finite set. Quantum information is processed via operations on these modal properties and eventually measured using single photon detectors. In continuous-variable (CV) photonic quantum computation, quantum information is represented by properties of the electromagnetic field that take on any value in an interval (e.g. position). The electromagnetic field is transformed via Gaussian and non-Gaussian operations, and then detected via homodyne detection. Both CV and DV photonic quantum computation have been realized experimentally and they each have a unique set of challenges that need to be overcome to achieve scalable photonic universal quantum computation. This article is an introduction to photonic quantum computing, charting its development from the early days of linear optical quantum computing to recent developments in quantum machine learning.
△ Less
Submitted 4 April, 2024;
originally announced April 2024.
-
JCLEC-MO: a Java suite for solving many-objective optimization engineering problems
Authors:
Aurora Ramírez,
José Raúl Romero,
Carlos García-Martínez,
Sebastián Ventura
Abstract:
Although metaheuristics have been widely recognized as efficient techniques to solve real-world optimization problems, implementing them from scratch remains difficult for domain-specific experts without programming skills. In this scenario, metaheuristic optimization frameworks are a practical alternative as they provide a variety of algorithms composed of customized elements, as well as experime…
▽ More
Although metaheuristics have been widely recognized as efficient techniques to solve real-world optimization problems, implementing them from scratch remains difficult for domain-specific experts without programming skills. In this scenario, metaheuristic optimization frameworks are a practical alternative as they provide a variety of algorithms composed of customized elements, as well as experimental support. Recently, many engineering problems require to optimize multiple or even many objectives, increasing the interest in appropriate metaheuristic algorithms and frameworks that might integrate new specific requirements while maintaining the generality and reusability principles they were conceived for. Based on this idea, this paper introduces JCLEC-MO, a Java framework for both multi- and many-objective optimization that enables engineers to apply, or adapt, a great number of multi-objective algorithms with little coding effort. A case study is developed and explained to show how JCLEC-MO can be used to address many-objective engineering problems, often requiring the inclusion of domain-specific elements, and to analyze experimental outcomes by means of conveniently connected R utilities.
△ Less
Submitted 28 February, 2024;
originally announced February 2024.
-
Evolving machine learning workflows through interactive AutoML
Authors:
Rafael Barbudo,
Aurora Ramírez,
José Raúl Romero
Abstract:
Automatic workflow composition (AWC) is a relevant problem in automated machine learning (AutoML) that allows finding suitable sequences of preprocessing and prediction models together with their optimal hyperparameters. This problem can be solved using evolutionary algorithms and, in particular, grammar-guided genetic programming (G3P). Current G3P approaches to AWC define a fixed grammar that fo…
▽ More
Automatic workflow composition (AWC) is a relevant problem in automated machine learning (AutoML) that allows finding suitable sequences of preprocessing and prediction models together with their optimal hyperparameters. This problem can be solved using evolutionary algorithms and, in particular, grammar-guided genetic programming (G3P). Current G3P approaches to AWC define a fixed grammar that formally specifies how workflow elements can be combined and which algorithms can be included. In this paper we present \ourmethod, an interactive G3P algorithm that allows users to dynamically modify the grammar to prune the search space and focus on their regions of interest. Our proposal is the first to combine the advantages of a G3P method with ideas from interactive optimisation and human-guided machine learning, an area little explored in the context of AutoML. To evaluate our approach, we present an experimental study in which 20 participants interact with \ourmethod to evolve workflows according to their preferences. Our results confirm that the collaboration between \ourmethod and humans allows us to find high-performance workflows in terms of accuracy that require less tuning time than those found without human intervention.
△ Less
Submitted 28 February, 2024;
originally announced February 2024.
-
Precision mass measurements in the zirconium region pin down the mass surface across the neutron midshell at $N=66$
Authors:
M. Hukkanen,
W. Ryssens,
P. Ascher,
M. Bender,
T. Eronen,
S. Grévy,
A. Kankainen,
M. Stryjczyk,
O. Beliuskina,
Z. Ge,
S. Geldhof,
M. Gerbaux,
W. Gins,
A. Husson,
D. A. Nesterenko,
A. Raggio,
M. Reponen,
S. Rinta-Antila,
J. Romero,
A. de Roubin,
V. Virtanen,
A. Zadvornaya
Abstract:
Precision mass measurements of $^{104}$Y, $^{106}$Zr, $^{104,104m,109}$Nb, and $^{111,112}$Mo have been performed with the JYFLTRAP double Penning trap mass spectrometer at the Ion Guide Isotope Separator On-Line facility. The order of the long-lived states in $^{104}$Nb was unambiguously established. The trend in two-neutron separation energies around the $N=66$ neutron midshell appeared to be st…
▽ More
Precision mass measurements of $^{104}$Y, $^{106}$Zr, $^{104,104m,109}$Nb, and $^{111,112}$Mo have been performed with the JYFLTRAP double Penning trap mass spectrometer at the Ion Guide Isotope Separator On-Line facility. The order of the long-lived states in $^{104}$Nb was unambiguously established. The trend in two-neutron separation energies around the $N=66$ neutron midshell appeared to be steeper with respect to the Atomic Mass Evaluation 2020 extrapolations for the $_{39}$Y and $_{40}$Zr isotopic chains and less steep for the $_{41}$Nb chain, indicating a possible gap opening around $Z=40$. The experimental results were compared to the BSkG2 model calculations performed with and without vibrational and rotational corrections. All of them predict two low-lying minima for $^{106}$Zr. While the unaltered BSkG2 model fails to predict the trend in two-neutron separation energies, selecting the more deformed minima in calculations and removing the vibrational correction, the calculations are more in line with experimental data. The same is also true for the $2^+_1$ excitation energies and differences in charge radii in the Zr isotopes. The results stress the importance of improved treatment of collective corrections in large-scale models and further development of beyond-mean-field techniques.
△ Less
Submitted 10 July, 2024; v1 submitted 19 February, 2024;
originally announced February 2024.
-
Artificial Intelligence-Enabled Optimization of Battery-Grade Lithium Carbonate Production
Authors:
S. Shayan Mousavi Masouleh,
Corey A. Sanz,
Ryan P. Jansonius,
Samuel Shi,
Maria J. Gendron Romero,
Jason E. Hein,
Jason Hattrick-Simpers
Abstract:
By 2035, the need for battery-grade lithium is expected to quadruple. About half of this lithium is currently sourced from brines and must be converted from a chloride into lithium carbonate (Li2CO3) through a process called softening. Conventional softening methods using sodium or potassium salts contribute to carbon emissions during reagent mining and battery manufacturing, exacerbating global w…
▽ More
By 2035, the need for battery-grade lithium is expected to quadruple. About half of this lithium is currently sourced from brines and must be converted from a chloride into lithium carbonate (Li2CO3) through a process called softening. Conventional softening methods using sodium or potassium salts contribute to carbon emissions during reagent mining and battery manufacturing, exacerbating global warming. This study introduces an alternative approach using carbon dioxide (CO2(g)) as the carbonating reagent in the lithium softening process, offering a carbon capture solution. We employed an active learning-driven high-throughput method to rapidly capture CO2(g) and convert it to lithium carbonate. The model was simplified by focusing on the elemental concentrations of C, Li, and N for practical measurement and tracking, avoiding the complexities of ion speciation equilibria. This approach led to an optimized lithium carbonate process that capitalizes on CO2(g) capture and improves the battery metal supply chain's carbon efficiency.
△ Less
Submitted 20 February, 2024; v1 submitted 10 February, 2024;
originally announced February 2024.
-
Grammar-based evolutionary approach for automated workflow composition with domain-specific operators and ensemble diversity
Authors:
Rafael Barbudo,
Aurora Ramírez,
José Raúl Romero
Abstract:
The process of extracting valuable and novel insights from raw data involves a series of complex steps. In the realm of Automated Machine Learning (AutoML), a significant research focus is on automating aspects of this process, specifically tasks like selecting algorithms and optimising their hyper-parameters. A particularly challenging task in AutoML is automatic workflow composition (AWC). AWC a…
▽ More
The process of extracting valuable and novel insights from raw data involves a series of complex steps. In the realm of Automated Machine Learning (AutoML), a significant research focus is on automating aspects of this process, specifically tasks like selecting algorithms and optimising their hyper-parameters. A particularly challenging task in AutoML is automatic workflow composition (AWC). AWC aims to identify the most effective sequence of data preprocessing and ML algorithms, coupled with their best hyper-parameters, for a specific dataset. However, existing AWC methods are limited in how many and in what ways they can combine algorithms within a workflow.
Addressing this gap, this paper introduces EvoFlow, a grammar-based evolutionary approach for AWC. EvoFlow enhances the flexibility in designing workflow structures, empowering practitioners to select algorithms that best fit their specific requirements. EvoFlow stands out by integrating two innovative features. First, it employs a suite of genetic operators, designed specifically for AWC, to optimise both the structure of workflows and their hyper-parameters. Second, it implements a novel updating mechanism that enriches the variety of predictions made by different workflows. Promoting this diversity helps prevent the algorithm from overfitting. With this aim, EvoFlow builds an ensemble whose workflows differ in their misclassified instances.
To evaluate EvoFlow's effectiveness, we carried out empirical validation using a set of classification benchmarks. We begin with an ablation study to demonstrate the enhanced performance attributable to EvoFlow's unique components. Then, we compare EvoFlow with other AWC approaches, encompassing both evolutionary and non-evolutionary techniques. Our findings show that EvoFlow's specialised genetic operators and updating mechanism substantially outperform current leading methods[..]
△ Less
Submitted 3 February, 2024;
originally announced February 2024.
-
On the generalization of learned constraints for ASP solving in temporal domains
Authors:
Javier Romero,
Torsten Schaub,
Klaus Strauch
Abstract:
The representation of a dynamic problem in ASP usually boils down to using copies of variables and constraints, one for each time stamp, no matter whether it is directly encoded or via an action or temporal language. The multiplication of variables and constraints is commonly done during grounding and the solver is completely ignorant about the temporal relationship among the different instances.…
▽ More
The representation of a dynamic problem in ASP usually boils down to using copies of variables and constraints, one for each time stamp, no matter whether it is directly encoded or via an action or temporal language. The multiplication of variables and constraints is commonly done during grounding and the solver is completely ignorant about the temporal relationship among the different instances. On the other hand, a key factor in the performance of today's ASP solvers is conflict-driven constraint learning. Our question is now whether a constraint learned for particular time steps can be generalized and reused at other time stamps, and ultimately whether this enhances the overall solver performance on temporal problems. Knowing full well the domain of time, we study conditions under which learned dynamic constraints can be generalized. We propose a simple translation of the original logic program such that, for the translated programs, the learned constraints can be generalized to other time points. Additionally, we identify a property of temporal problems that allows us to generalize all learned constraints to all time steps. It turns out that this property is satisfied by many planning problems. Finally, we empirically evaluate the impact of adding the generalized constraints to an ASP solver. Under consideration in Theory and Practice of Logic Programming (TPLP).
△ Less
Submitted 15 October, 2024; v1 submitted 29 January, 2024;
originally announced January 2024.
-
Artificial intelligence to automate the systematic review of scientific literature
Authors:
José de la Torre-López,
Aurora Ramírez,
José Raúl Romero
Abstract:
Artificial intelligence (AI) has acquired notorious relevance in modern computing as it effectively solves complex tasks traditionally done by humans. AI provides methods to represent and infer knowledge, efficiently manipulate texts and learn from vast amount of data. These characteristics are applicable in many activities that human find laborious or repetitive, as is the case of the analysis of…
▽ More
Artificial intelligence (AI) has acquired notorious relevance in modern computing as it effectively solves complex tasks traditionally done by humans. AI provides methods to represent and infer knowledge, efficiently manipulate texts and learn from vast amount of data. These characteristics are applicable in many activities that human find laborious or repetitive, as is the case of the analysis of scientific literature. Manually preparing and writing a systematic literature review (SLR) takes considerable time and effort, since it requires planning a strategy, conducting the literature search and analysis, and reporting the findings. Depending on the area under study, the number of papers retrieved can be of hundreds or thousands, meaning that filtering those relevant ones and extracting the key information becomes a costly and error-prone process. However, some of the involved tasks are repetitive and, therefore, subject to automation by means of AI. In this paper, we present a survey of AI techniques proposed in the last 15 years to help researchers conduct systematic analyses of scientific literature. We describe the tasks currently supported, the types of algorithms applied, and available tools proposed in 34 primary studies. This survey also provides a historical perspective of the evolution of the field and the role that humans can play in an increasingly automated SLR process.
△ Less
Submitted 13 January, 2024;
originally announced January 2024.
-
High-precision mass measurements of neutron deficient silver isotopes probe the robustness of the $N$ = 50 shell closure
Authors:
Zhuang Ge,
Mikael Reponen,
Tommi Eronen,
Baishan Hu,
Markus Kortelainen,
Anu Kankainen,
Iain Moore,
Dmitrii Nesterenko,
Cenxi Yuan,
Olga Beliuskina,
Laetitia Cañete,
Ruben de Groote,
Celement Delafosse,
Pierre Delahaye,
Timo Dickel,
Antoine de Roubin,
Sarina Geldhof,
Wouter Gins,
Jason Holt,
Marjut Hukkanen,
Arthur Jaries,
Ari Jokinen,
Ágota Koszorús,
Gabriella Kripkó-Koncz,
Sonja Kujanpää
, et al. (14 additional authors not shown)
Abstract:
High-precision mass measurements of exotic $^{95-97}$Ag isotopes close to the $N = Z$ line have been conducted with the JYFLTRAP double Penning trap mass spectrometer, with the silver ions produced using the recently commissioned inductively-heated hot cavity catcher laser ion source at the Ion Guide Isotope Separator On-Line facility. The atomic mass of $^{95}$Ag was directly determined for the f…
▽ More
High-precision mass measurements of exotic $^{95-97}$Ag isotopes close to the $N = Z$ line have been conducted with the JYFLTRAP double Penning trap mass spectrometer, with the silver ions produced using the recently commissioned inductively-heated hot cavity catcher laser ion source at the Ion Guide Isotope Separator On-Line facility. The atomic mass of $^{95}$Ag was directly determined for the first time. In addition, the atomic masses of $β$-decaying 2$^+$ and 8$^+$ states in $^{96}$Ag have been identified and measured for the first time, and the precision of the $^{97}$Ag mass has been improved. The newly measured masses, with a precision of $\approx$ 1 keV/c$^2$, have been used to investigate the $N =$ 50 neutron shell closure confirming it to be robust. Empirical shell-gap and pairing energies determined with the new ground-state mass data are compared with the state-of-the-art \textit{ab initio} calculations with various chiral effective field theory Hamiltonians. The precise determination of the excitation energy of the $^{96m}$Ag isomer in particular serves as a benchmark for \textit{ab initio} predictions of nuclear properties beyond the ground state, specifically for odd-odd nuclei situated in proximity to the proton dripline below $^{100}$Sn. In addition, density functional theory (DFT) calculations and configuration-interaction shell-model (CISM) calculations are compared with the experimental results. All theoretical approaches face challenges to reproduce the trend of nuclear ground-state properties in the silver isotopic chain across the $N =$50 neutron shell and toward the proton drip-line.
△ Less
Submitted 14 June, 2024; v1 submitted 15 January, 2024;
originally announced January 2024.
-
Multipoles from Majorana constellations
Authors:
J. L. Romero,
A. B. Klimov,
A. Z. Goldberg,
G. Leuchs,
L. L. Sanchez-Soto
Abstract:
Majorana stars, the $2S$ spin coherent states that are orthogonal to a spin-$S$ state, offer an elegant method to visualize quantum states, disclosing their intrinsic symmetries. These states are naturally described by the corresponding multipoles. These quantities can be experimentally determined and allow for an SU(2)-invariant analysis. We investigate the relationship between Majorana constella…
▽ More
Majorana stars, the $2S$ spin coherent states that are orthogonal to a spin-$S$ state, offer an elegant method to visualize quantum states, disclosing their intrinsic symmetries. These states are naturally described by the corresponding multipoles. These quantities can be experimentally determined and allow for an SU(2)-invariant analysis. We investigate the relationship between Majorana constellations and state multipoles, thus providing insights into the underlying symmetries of the system. We illustrate our approach with some relevant and informative examples.
△ Less
Submitted 15 January, 2024;
originally announced January 2024.
-
InterEvo-TR: Interactive Evolutionary Test Generation With Readability Assessment
Authors:
Pedro Delgado-Pérez,
Aurora Ramírez,
Kevin J. Valle-Gómez,
Inmaculada Medina-Bulo,
José Raúl Romero
Abstract:
Automated test case generation has proven to be useful to reduce the usually high expenses of software testing. However, several studies have also noted the skepticism of testers regarding the comprehension of generated test suites when compared to manually designed ones. This fact suggests that involving testers in the test generation process could be helpful to increase their acceptance of autom…
▽ More
Automated test case generation has proven to be useful to reduce the usually high expenses of software testing. However, several studies have also noted the skepticism of testers regarding the comprehension of generated test suites when compared to manually designed ones. This fact suggests that involving testers in the test generation process could be helpful to increase their acceptance of automatically-produced test suites. In this paper, we propose incorporating interactive readability assessments made by a tester into EvoSuite, a widely-known evolutionary test generation tool. Our approach, InterEvo-TR, interacts with the tester at different moments during the search and shows different test cases covering the same coverage target for their subjective evaluation. The design of such an interactive approach involves a schedule of interaction, a method to diversify the selected targets, a plan to save and handle the readability values, and some mechanisms to customize the level of engagement in the revision, among other aspects. To analyze the potential and practicability of our proposal, we conduct a controlled experiment in which 39 participants, including academics, professional developers, and student collaborators, interact with InterEvo-TR. Our results show that the strategy to select and present intermediate results is effective for the purpose of readability assessment. Furthermore, the participants' actions and responses to a questionnaire allowed us to analyze the aspects influencing test code readability and the benefits and limitations of an interactive approach in the context of test case generation, paving the way for future developments based on interactivity.
△ Less
Submitted 13 January, 2024;
originally announced January 2024.
-
GEML: A Grammar-based Evolutionary Machine Learning Approach for Design-Pattern Detection
Authors:
Rafael Barbudo,
Aurora Ramírez,
Francisco Servant,
José Raúl Romero
Abstract:
Design patterns (DPs) are recognised as a good practice in software development. However, the lack of appropriate documentation often hampers traceability, and their benefits are blurred among thousands of lines of code. Automatic methods for DP detection have become relevant but are usually based on the rigid analysis of either software metrics or specific properties of the source code. We propos…
▽ More
Design patterns (DPs) are recognised as a good practice in software development. However, the lack of appropriate documentation often hampers traceability, and their benefits are blurred among thousands of lines of code. Automatic methods for DP detection have become relevant but are usually based on the rigid analysis of either software metrics or specific properties of the source code. We propose GEML, a novel detection approach based on evolutionary machine learning using software properties of diverse nature. Firstly, GEML makes use of an evolutionary algorithm to extract those characteristics that better describe the DP, formulated in terms of human-readable rules, whose syntax is conformant with a context-free grammar. Secondly, a rule-based classifier is built to predict whether new code contains a hidden DP implementation. GEML has been validated over five DPs taken from a public repository recurrently adopted by machine learning studies. Then, we increase this number up to 15 diverse DPs, showing its effectiveness and robustness in terms of detection capability. An initial parameter study served to tune a parameter setup whose performance guarantees the general applicability of this approach without the need to adjust complex parameters to a specific pattern. Finally, a demonstration tool is also provided.
△ Less
Submitted 13 January, 2024;
originally announced January 2024.
-
First investigation on the isomeric ratio in multinucleon transfer reactions: Entrance channel effects on the spin distribution
Authors:
D. Kumar,
T. Dickel,
A. Zadvornaya,
O. Beliuskin,
A. Kankainen,
P. Constantin,
S. Purushothaman,
A. Spataru,
M. Stryjczyk,
L. Al Ayoubi,
M. Brunet,
L. Canete,
C. Delafosse,
R. P. de Groote,
A. de Roubin,
T. Eronen,
Z. Ge,
W. Gins,
C. Hornung,
M. Hukkanenc,
A. Illana Sison,
A. Jokinen,
D. Kahl,
B. Kindler,
B. Lommel
, et al. (17 additional authors not shown)
Abstract:
The multinucleon transfer (MNT) reaction approach was successfully employed for the first time to measure the isomeric ratios (IRs) of $^{211}$Po (25/2$^+$) isomer and its (9/2$^+$) ground state at the IGISOL facility using a 945 MeV $^{136}$Xe beam impinged on $^{209}$Bi and $^{\rm nat}$Pb targets. The dominant production of isomers compared to the corresponding ground states was consistently rev…
▽ More
The multinucleon transfer (MNT) reaction approach was successfully employed for the first time to measure the isomeric ratios (IRs) of $^{211}$Po (25/2$^+$) isomer and its (9/2$^+$) ground state at the IGISOL facility using a 945 MeV $^{136}$Xe beam impinged on $^{209}$Bi and $^{\rm nat}$Pb targets. The dominant production of isomers compared to the corresponding ground states was consistently revealed in the $α$-decay spectra. Deduced IR of $^{211}$Po populated through the $^{136}$Xe+$^{\rm nat}$Pb reaction was found to enhance $\approx$1.8-times than observed for $^{136}$Xe+$^{209}$Bi. State-of-the-art Langevin-type model calculations have been utilized to estimate the spin distribution of an MNT residue. The computations qualitatively corroborate with the considerable increase in IRs of $^{211}$Po produced from $^{136}$Xe+$^{\rm nat}$Pb compared to $^{136}$Xe+$^{209}$Bi. Theoretical investigations indicate a weak influence of target spin on IRs. The enhancement of the $^{211}$Po isomer in the $^{136}$Xe+$^{\rm nat}$Pb over $^{136}$Xe+$^{209}$Bi can be attributed to the different proton ($p$)-transfer production routes. Estimations demonstrate an increment in the angular momentum transfer, favorable for isomer production, with increasing projectile energy. Comparative analysis indicates the two entrance channel parameters, projectile mass and $p$-transfer channels, strongly influencing the population of the high-spin isomer of $^{211}$Po (25/2$^+$). This is the first experimental and theoretical investigation on the IRs of nuclei produced via different channels of MNT reactions, with the latter quantitatively underestimating the former by a factor of two.
△ Less
Submitted 15 January, 2024; v1 submitted 11 January, 2024;
originally announced January 2024.
-
URHand: Universal Relightable Hands
Authors:
Zhaoxi Chen,
Gyeongsik Moon,
Kaiwen Guo,
Chen Cao,
Stanislav Pidhorskyi,
Tomas Simon,
Rohan Joshi,
Yuan Dong,
Yichen Xu,
Bernardo Pires,
He Wen,
Lucas Evans,
Bo Peng,
Julia Buffalini,
Autumn Trimble,
Kevyn McPhail,
Melissa Schoeller,
Shoou-I Yu,
Javier Romero,
Michael Zollhöfer,
Yaser Sheikh,
Ziwei Liu,
Shunsuke Saito
Abstract:
Existing photorealistic relightable hand models require extensive identity-specific observations in different views, poses, and illuminations, and face challenges in generalizing to natural illuminations and novel identities. To bridge this gap, we present URHand, the first universal relightable hand model that generalizes across viewpoints, poses, illuminations, and identities. Our model allows f…
▽ More
Existing photorealistic relightable hand models require extensive identity-specific observations in different views, poses, and illuminations, and face challenges in generalizing to natural illuminations and novel identities. To bridge this gap, we present URHand, the first universal relightable hand model that generalizes across viewpoints, poses, illuminations, and identities. Our model allows few-shot personalization using images captured with a mobile phone, and is ready to be photorealistically rendered under novel illuminations. To simplify the personalization process while retaining photorealism, we build a powerful universal relightable prior based on neural relighting from multi-view images of hands captured in a light stage with hundreds of identities. The key challenge is scaling the cross-identity training while maintaining personalized fidelity and sharp details without compromising generalization under natural illuminations. To this end, we propose a spatially varying linear lighting model as the neural renderer that takes physics-inspired shading as input feature. By removing non-linear activations and bias, our specifically designed lighting model explicitly keeps the linearity of light transport. This enables single-stage training from light-stage data while generalizing to real-time rendering under arbitrary continuous illuminations across diverse identities. In addition, we introduce the joint learning of a physically based model and our neural relighting model, which further improves fidelity and generalization. Extensive experiments show that our approach achieves superior performance over existing methods in terms of both quality and generalizability. We also demonstrate quick personalization of URHand from a short phone scan of an unseen identity.
△ Less
Submitted 10 January, 2024;
originally announced January 2024.
-
Interactive Multi-Objective Evolutionary Optimization of Software Architectures
Authors:
Aurora Ramírez,
José Raúl Romero,
Sebastián Ventura
Abstract:
While working on a software specification, designers usually need to evaluate different architectural alternatives to be sure that quality criteria are met. Even when these quality aspects could be expressed in terms of multiple software metrics, other qualitative factors cannot be numerically measured, but they are extracted from the engineer's know-how and prior experiences. In fact, detecting n…
▽ More
While working on a software specification, designers usually need to evaluate different architectural alternatives to be sure that quality criteria are met. Even when these quality aspects could be expressed in terms of multiple software metrics, other qualitative factors cannot be numerically measured, but they are extracted from the engineer's know-how and prior experiences. In fact, detecting not only strong but also weak points in the different solutions seems to fit better with the way humans make their decisions. Putting the human in the loop brings new challenges to the search-based software engineering field, especially for those human-centered activities within the early analysis phase. This paper explores how the interactive evolutionary computation can serve as a basis for integrating the human's judgment into the search process. An interactive approach is proposed to discover software architectures, in which both quantitative and qualitative criteria are applied to guide a multi-objective evolutionary algorithm. The obtained feedback is incorporated into the fitness function using architectural preferences allowing the algorithm to discern between promising and poor solutions. Experimentation with real users has revealed that the proposed interaction mechanism can effectively guide the search towards those regions of the search space that are of real interest to the expert.
△ Less
Submitted 8 January, 2024;
originally announced January 2024.
-
From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations
Authors:
Evonne Ng,
Javier Romero,
Timur Bagautdinov,
Shaojie Bai,
Trevor Darrell,
Angjoo Kanazawa,
Alexander Richard
Abstract:
We present a framework for generating full-bodied photorealistic avatars that gesture according to the conversational dynamics of a dyadic interaction. Given speech audio, we output multiple possibilities of gestural motion for an individual, including face, body, and hands. The key behind our method is in combining the benefits of sample diversity from vector quantization with the high-frequency…
▽ More
We present a framework for generating full-bodied photorealistic avatars that gesture according to the conversational dynamics of a dyadic interaction. Given speech audio, we output multiple possibilities of gestural motion for an individual, including face, body, and hands. The key behind our method is in combining the benefits of sample diversity from vector quantization with the high-frequency details obtained through diffusion to generate more dynamic, expressive motion. We visualize the generated motion using highly photorealistic avatars that can express crucial nuances in gestures (e.g. sneers and smirks). To facilitate this line of research, we introduce a first-of-its-kind multi-view conversational dataset that allows for photorealistic reconstruction. Experiments show our model generates appropriate and diverse gestures, outperforming both diffusion- and VQ-only methods. Furthermore, our perceptual evaluation highlights the importance of photorealism (vs. meshes) in accurately assessing subtle motion details in conversational gestures. Code and dataset available online.
△ Less
Submitted 3 January, 2024;
originally announced January 2024.
-
Plug-and-Play regularized 3D seismic inversion with 2D pre-trained denoisers
Authors:
Nick Luiken,
Juan Romero,
Miguel Corrales,
Matteo Ravasi
Abstract:
Post-stack seismic inversion is a widely used technique to retrieve high-resolution acoustic impedance models from migrated seismic data. Its modelling operator assumes that a migrated seismic data can be generated from the convolution of a source wavelet and the time derivative of the acoustic impedance model. Given the band-limited nature of the seismic wavelet, the convolutional model acts as a…
▽ More
Post-stack seismic inversion is a widely used technique to retrieve high-resolution acoustic impedance models from migrated seismic data. Its modelling operator assumes that a migrated seismic data can be generated from the convolution of a source wavelet and the time derivative of the acoustic impedance model. Given the band-limited nature of the seismic wavelet, the convolutional model acts as a filtering operator on the acoustic impedance model, thereby making the problem of retrieving acoustic impedances from seismic data ambiguous. In order to compensate for missing frequencies, post-stack seismic inversion is often regularized, meaning that prior information about the structure of the subsurface is included in the inversion process. Recently, the Plug-and-Play methodology has gained wide interest in the inverse problem community as a new form of implicit regularization, often outperforming state-of-the-art regularization. Plug-and-Play can be applied to any proximal algorithm by simply replacing the proximal operator of the regularizer with any denoiser of choice. We propose to use Plug-and-Play regularization with a 2D pre-trained, deep denoiser for 2D post-stack seismic inversion. Additionally, we show that a generalization of Plug-and-Play, called Multi-Agent Consensus Equilibrium, can be adopted to solve 3D post-stack inversion whilst leveraging the same 2D pre-trained denoiser used in the 2D case. More precisely, Multi-Agent Consensus Equilibrium combines the results of applying such 2D denoiser in the inline, crossline, and time directions in an optimal manner. We verify the proposed methods on a portion of the SEAM Phase 1 velocity model and the Sleipner field dataset. 1
△ Less
Submitted 1 January, 2024;
originally announced January 2024.
-
InPTC: Integrated Planning and Tube-Following Control for Prescribed-Time Collision-Free Navigation of Wheeled Mobile Robots
Authors:
Xiaodong Shao,
Bin Zhang,
Hui Zhi,
Jose Guadalupe Romero,
Bowen Fan,
Qinglei Hu,
David Navarro-Alarcon
Abstract:
In this article, we propose a novel approach, called InPTC (Integrated Planning and Tube-Following Control), for prescribed-time collision-free navigation of wheeled mobile robots in a compact convex workspace cluttered with static, sufficiently separated, and convex obstacles. A path planner with prescribed-time convergence is presented based upon Bouligand's tangent cones and time scale transfor…
▽ More
In this article, we propose a novel approach, called InPTC (Integrated Planning and Tube-Following Control), for prescribed-time collision-free navigation of wheeled mobile robots in a compact convex workspace cluttered with static, sufficiently separated, and convex obstacles. A path planner with prescribed-time convergence is presented based upon Bouligand's tangent cones and time scale transformation (TST) techniques, yielding a continuous vector field that can guide the robot from almost all initial positions in the free space to the designated goal at a prescribed time, while avoiding entering the obstacle regions augmented with safety margin. By leveraging barrier functions and TST, we further derive a tube-following controller to achieve robot trajectory tracking within a prescribed time less than the planner's settling time. This controller ensures the robot moves inside a predefined ``safe tube'' around the reference trajectory, where the tube radius is set to be less than the safety margin. Consequently, the robot will reach the goal location within a prescribed time while avoiding collision with any obstacles along the way. The proposed InPTC is implemented on a Mona robot operating in an arena cluttered with obstacles of various shapes. Experimental results demonstrate that InPTC not only generates smooth collision-free reference trajectories that converge to the goal location at the preassigned time of $250\,\rm s$ (i.e., the required task completion time), but also achieves tube-following trajectory tracking with tracking accuracy higher than $0.01\rm m$ after the preassigned time of $150\,\rm s$. This enables the robot to accomplish the navigation task within the required time of $250\,\rm s$.
△ Less
Submitted 27 August, 2024; v1 submitted 19 December, 2023;
originally announced December 2023.