-
Robust space-time multiscale upscaling via multicontinuum homogenization for evolving perforated media
Authors:
Wei Xie,
Viet Ha Hoang,
Yin Yang,
Yunqing Huang
Abstract:
Time-evolving perforated domains arise in many engineering and geoscientific applications, including reactive transport, particle deposition, and structural degradation in porous media. Accurately capturing the macroscopic behavior of such systems poses significant computational challenges due to the dynamic fine-scale geometries. In this paper, we develop a robust and generalizable multiscale mod…
▽ More
Time-evolving perforated domains arise in many engineering and geoscientific applications, including reactive transport, particle deposition, and structural degradation in porous media. Accurately capturing the macroscopic behavior of such systems poses significant computational challenges due to the dynamic fine-scale geometries. In this paper, we develop a robust and generalizable multiscale modeling framework based on multicontinuum homogenization to derive effective macroscopic equations in shrinking domains. The method distinguishes multiple continua according to the physical characteristics (e.g., channel widths), and couples them via space-time local cell problems formulated on representative volume elements. These local problems incorporate temporal derivatives and domain evolution, ensuring consistency with underlying fine-scale dynamics. The resulting upscaled system yields computable macroscopic coefficients and is suitable for large-scale simulations. Several numerical experiments are presented to validate the accuracy, efficiency, and potential applicability of the method to complex time-dependent engineering problems.
△ Less
Submitted 26 June, 2025;
originally announced June 2025.
-
Precision $e^+e^-$ Hemisphere Masses in the Dijet Region with Power Corrections
Authors:
Andre H. Hoang,
Vicent Mateu,
Matthew D. Schwartz,
Iain W. Stewart
Abstract:
We derive high-precision results for the $e^+e^-$ heavy jet mass (HJM) $d σ/d ρ$ and dihemisphere mass (DHM) $d^2σ/(d s_1 d s_2)$ distributions, for $s_1\sim s_2$, in the dijet region. New results include: i) the N$^3$LL resummation for HJM of large logarithms $\ln^n(ρ)$ at small $ρ$ including the exact two-loop non-global hemisphere soft function, the 4-loop cusp anomalous dimension and the 3-loo…
▽ More
We derive high-precision results for the $e^+e^-$ heavy jet mass (HJM) $d σ/d ρ$ and dihemisphere mass (DHM) $d^2σ/(d s_1 d s_2)$ distributions, for $s_1\sim s_2$, in the dijet region. New results include: i) the N$^3$LL resummation for HJM of large logarithms $\ln^n(ρ)$ at small $ρ$ including the exact two-loop non-global hemisphere soft function, the 4-loop cusp anomalous dimension and the 3-loop hard and jet functions, ii) N$^3$LL results for DHM with resummation of logarithms $\ln(s_{1,2}/Q^2)$ when there is no large separation between $s_1$ and $s_2$, iii) profile functions for HJM to give results simultaneously valid in the peak and tail regions, iv) a complete two-dimensional basis of non-perturbative functions which can be used for double differential observables, that are needed for both HJM and DHM in the peak region, and v) an implementation of renormalon subtractions for large-angle soft radiation to ${\cal O}(α_s^3)$ together with a resummation of the additional large $\ln(Qρ/Λ_{QCD})$ logarithms. Here $Q$ is the $e^+e^-$ center-of-mass energy. Our resummation results are combined with known fixed-order ${\cal O}(α_s^3)$ results and we discuss the convergence and remaining perturbative uncertainty in the cross section. We also prove that, at order $1/Q$, the first moment of the HJM distribution involves an additional non-perturbative parameter compared to the power correction that shifts the tail of the spectrum (where $1\gg ρ\gg Λ_{QCD}/Q$). This differs from thrust where a single non-perturbative parameter at order $1/Q$ describes both the first moment and the tail, and it disfavors models of power corrections employing a single non-perturbative parameter, such as the low-scale effective coupling model. In this paper we focus only on the dijet region, not the far-tail distribution for $ρ\gtrsim 0.2$.
△ Less
Submitted 10 June, 2025;
originally announced June 2025.
-
Goodness-of-fit testing for the stationary density of a size-structured PDE
Authors:
Van Ha Hoang,
Phu Thanh Nguyen,
Thanh Mai Pham Ngoc,
Vincent Rivoirard,
Viet Chi Tran
Abstract:
We consider two division models for structured cell populations, where cells can grow, age and divide. These models have been introduced in the literature under the denomination of `mitosis' and `adder' models. In the recent years, there has been an increasing interest in Biology to understand whether the cells divide equally or not, as this can be related to important mechanisms in cellular aging…
▽ More
We consider two division models for structured cell populations, where cells can grow, age and divide. These models have been introduced in the literature under the denomination of `mitosis' and `adder' models. In the recent years, there has been an increasing interest in Biology to understand whether the cells divide equally or not, as this can be related to important mechanisms in cellular aging or recovery. We are therefore interested in testing the null hypothesis $H_0$ where the division of a mother cell results into two daughters of equal size or age, against the alternative hypothesis $H_1$ where the division is asymmetric and ruled by a kernel that is absolutely continuous with respect to the Lebesgue measure. The sample consists of i.i.d. observations of cell sizes and ages drawn from the population, and the division is not directly observed. The hypotheses of the test are reformulated as hypotheses on the stationary size and age distributions of the models, which we assume are also the distributions of the observations. We propose a goodness-of-fit test that we study numerically on simulated data before applying it on real data.
△ Less
Submitted 5 June, 2025;
originally announced June 2025.
-
MOOSE: Pay Attention to Temporal Dynamics for Video Understanding via Optical Flows
Authors:
Hong Nguyen,
Dung Tran,
Hieu Hoang,
Phong Nguyen,
Shrikanth Narayanan
Abstract:
Many motion-centric video analysis tasks, such as atomic actions, detecting atypical motor behavior in individuals with autism, or analyzing articulatory motion in real-time MRI of human speech, require efficient and interpretable temporal modeling. Capturing temporal dynamics is a central challenge in video analysis, often requiring significant computational resources and fine-grained annotations…
▽ More
Many motion-centric video analysis tasks, such as atomic actions, detecting atypical motor behavior in individuals with autism, or analyzing articulatory motion in real-time MRI of human speech, require efficient and interpretable temporal modeling. Capturing temporal dynamics is a central challenge in video analysis, often requiring significant computational resources and fine-grained annotations that are not widely available. This paper presents MOOSE (Motion Flow Over Spatial Space), a novel temporally-centric video encoder explicitly integrating optical flow with spatial embeddings to model temporal information efficiently, inspired by human perception of motion. Unlike prior models, MOOSE takes advantage of rich, widely available pre-trained visual and optical flow encoders instead of training video models from scratch. This significantly reduces computational complexity while enhancing temporal interpretability. Our primary contributions includes (1) proposing a computationally efficient temporally-centric architecture for video understanding (2) demonstrating enhanced interpretability in modeling temporal dynamics; and (3) achieving state-of-the-art performance on diverse benchmarks, including clinical, medical, and standard action recognition datasets, confirming the broad applicability and effectiveness of our approach.
△ Less
Submitted 1 June, 2025;
originally announced June 2025.
-
Learning What to Do and What Not To Do: Offline Imitation from Expert and Undesirable Demonstrations
Authors:
Huy Hoang,
Tien Mai,
Pradeep Varakantham,
Tanvi Verma
Abstract:
Offline imitation learning typically learns from expert and unlabeled demonstrations, yet often overlooks the valuable signal in explicitly undesirable behaviors. In this work, we study offline imitation learning from contrasting behaviors, where the dataset contains both expert and undesirable demonstrations. We propose a novel formulation that optimizes a difference of KL divergences over the st…
▽ More
Offline imitation learning typically learns from expert and unlabeled demonstrations, yet often overlooks the valuable signal in explicitly undesirable behaviors. In this work, we study offline imitation learning from contrasting behaviors, where the dataset contains both expert and undesirable demonstrations. We propose a novel formulation that optimizes a difference of KL divergences over the state-action visitation distributions of expert and undesirable (or bad) data. Although the resulting objective is a DC (Difference-of-Convex) program, we prove that it becomes convex when expert demonstrations outweigh undesirable demonstrations, enabling a practical and stable non-adversarial training objective. Our method avoids adversarial training and handles both positive and negative demonstrations in a unified framework. Extensive experiments on standard offline imitation learning benchmarks demonstrate that our approach consistently outperforms state-of-the-art baselines.
△ Less
Submitted 27 May, 2025;
originally announced May 2025.
-
A Unified Framework for Variable Selection in Model-Based Clustering with Missing Not at Random
Authors:
Binh H. Ho,
Long Nguyen Chi,
TrungTin Nguyen,
Binh T. Nguyen,
Van Ha Hoang,
Christopher Drovandi
Abstract:
Model-based clustering integrated with variable selection is a powerful tool for uncovering latent structures within complex data. However, its effectiveness is often hindered by challenges such as identifying relevant variables that define heterogeneous subgroups and handling data that are missing not at random, a prevalent issue in fields like transcriptomics. While several notable methods have…
▽ More
Model-based clustering integrated with variable selection is a powerful tool for uncovering latent structures within complex data. However, its effectiveness is often hindered by challenges such as identifying relevant variables that define heterogeneous subgroups and handling data that are missing not at random, a prevalent issue in fields like transcriptomics. While several notable methods have been proposed to address these problems, they typically tackle each issue in isolation, thereby limiting their flexibility and adaptability. This paper introduces a unified framework designed to address these challenges simultaneously. Our approach incorporates a data-driven penalty matrix into penalized clustering to enable more flexible variable selection, along with a mechanism that explicitly models the relationship between missingness and latent class membership. We demonstrate that, under certain regularity conditions, the proposed framework achieves both asymptotic consistency and selection consistency, even in the presence of missing data. This unified strategy significantly enhances the capability and efficiency of model-based clustering, advancing methodologies for identifying informative variables that define homogeneous subgroups in the presence of complex missing data patterns. The performance of the framework, including its computational efficiency, is evaluated through simulations and demonstrated using both synthetic and real-world transcriptomic datasets.
△ Less
Submitted 25 May, 2025;
originally announced May 2025.
-
The Linear Collider Facility (LCF) at CERN
Authors:
H. Abramowicz,
E. Adli,
F. Alharthi,
M. Almanza-Soto,
M. M. Altakach,
S. Ampudia Castelazo,
D. Angal-Kalinin,
J. A. Anguiano,
R. B. Appleby,
O. Apsimon,
A. Arbey,
O. Arquero,
D. Attié,
J. L. Avila-Jimenez,
H. Baer,
Y. Bai,
C. Balazs,
P. Bambade,
T. Barklow,
J. Baudot,
P. Bechtle,
T. Behnke,
A. B. Bellerive,
S. Belomestnykh,
Y. Benhammou
, et al. (386 additional authors not shown)
Abstract:
In this paper we outline a proposal for a Linear Collider Facility as the next flagship project for CERN. It offers the opportunity for a timely, cost-effective and staged construction of a new collider that will be able to comprehensively map the Higgs boson's properties, including the Higgs field potential, thanks to a large span in centre-of-mass energies and polarised beams. A comprehensive pr…
▽ More
In this paper we outline a proposal for a Linear Collider Facility as the next flagship project for CERN. It offers the opportunity for a timely, cost-effective and staged construction of a new collider that will be able to comprehensively map the Higgs boson's properties, including the Higgs field potential, thanks to a large span in centre-of-mass energies and polarised beams. A comprehensive programme to study the Higgs boson and its closest relatives with high precision requires data at centre-of-mass energies from the Z pole to at least 1 TeV. It should include measurements of the Higgs boson in both major production mechanisms, ee -> ZH and ee -> vvH, precision measurements of gauge boson interactions as well as of the W boson, Higgs boson and top-quark masses, measurement of the top-quark Yukawa coupling through ee ->ttH, measurement of the Higgs boson self-coupling through HH production, and precision measurements of the electroweak couplings of the top quark. In addition, ee collisions offer discovery potential for new particles complementary to HL-LHC.
△ Less
Submitted 19 June, 2025; v1 submitted 31 March, 2025;
originally announced March 2025.
-
A Linear Collider Vision for the Future of Particle Physics
Authors:
H. Abramowicz,
E. Adli,
F. Alharthi,
M. Almanza-Soto,
M. M. Altakach,
S Ampudia Castelazo,
D. Angal-Kalinin,
R. B. Appleby,
O. Apsimon,
A. Arbey,
O. Arquero,
A. Aryshev,
S. Asai,
D. Attié,
J. L. Avila-Jimenez,
H. Baer,
J. A. Bagger,
Y. Bai,
I. R. Bailey,
C. Balazs,
T Barklow,
J. Baudot,
P. Bechtle,
T. Behnke,
A. B. Bellerive
, et al. (391 additional authors not shown)
Abstract:
In this paper we review the physics opportunities at linear $e^+e^-$ colliders with a special focus on high centre-of-mass energies and beam polarisation, take a fresh look at the various accelerator technologies available or under development and, for the first time, discuss how a facility first equipped with a technology mature today could be upgraded with technologies of tomorrow to reach much…
▽ More
In this paper we review the physics opportunities at linear $e^+e^-$ colliders with a special focus on high centre-of-mass energies and beam polarisation, take a fresh look at the various accelerator technologies available or under development and, for the first time, discuss how a facility first equipped with a technology mature today could be upgraded with technologies of tomorrow to reach much higher energies and/or luminosities. In addition, we will discuss detectors and alternative collider modes, as well as opportunities for beyond-collider experiments and R\&D facilities as part of a linear collider facility (LCF). The material of this paper will support all plans for $e^+e^-$ linear colliders and additional opportunities they offer, independently of technology choice or proposed site, as well as R\&D for advanced accelerator technologies. This joint perspective on the physics goals, early technologies and upgrade strategies has been developed by the LCVision team based on an initial discussion at LCWS2024 in Tokyo and a follow-up at the LCVision Community Event at CERN in January 2025. It heavily builds on decades of achievements of the global linear collider community, in particular in the context of CLIC and ILC.
△ Less
Submitted 31 March, 2025; v1 submitted 25 March, 2025;
originally announced March 2025.
-
Power-fractional distributions and branching processes
Authors:
Gerold Alsmeyer,
Viet Hung Hoang
Abstract:
In branching process theory, linear-fractional distributions are commonly used to model individual reproduction, especially when the goal is to obtain more explicit formulas than those derived under general model assumptions. In this article, we explore a generalization of these distributions, first introduced by Sagitov and Lindo, which offers similar advantages. We refer to these as power-fracti…
▽ More
In branching process theory, linear-fractional distributions are commonly used to model individual reproduction, especially when the goal is to obtain more explicit formulas than those derived under general model assumptions. In this article, we explore a generalization of these distributions, first introduced by Sagitov and Lindo, which offers similar advantages. We refer to these as power-fractional distributions, primarily because, as we demonstrate, they exhibit power-law behavior. Along with a discussion of their additional properties, we present several results related to the Galton-Watson branching process in both constant and randomly varying environments, illustrating these advantages. The use of power-fractional distributions in continuous time, particularly within the framework of Markov branching processes, is also briefly addressed.
△ Less
Submitted 3 April, 2025; v1 submitted 24 March, 2025;
originally announced March 2025.
-
Fine-Grained Complexity of Computing Degree-Constrained Spanning Trees
Authors:
Narek Bojikian,
Alexander Firbas,
Robert Ganian,
Hung P. Hoang,
Krisztina Szilágyi
Abstract:
We investigate the computation of minimum-cost spanning trees satisfying prescribed vertex degree constraints: Given a graph $G$ and a constraint function $D$, we ask for a (minimum-cost) spanning tree $T$ such that for each vertex $v$, $T$ achieves a degree specified by $D(v)$. Specifically, we consider three kinds of constraint functions ordered by their generality -- $D$ may either assign each…
▽ More
We investigate the computation of minimum-cost spanning trees satisfying prescribed vertex degree constraints: Given a graph $G$ and a constraint function $D$, we ask for a (minimum-cost) spanning tree $T$ such that for each vertex $v$, $T$ achieves a degree specified by $D(v)$. Specifically, we consider three kinds of constraint functions ordered by their generality -- $D$ may either assign each vertex to a list of admissible degrees, an upper bound on the degrees, or a specific degree. Using a combination of novel techniques and state-of-the-art machinery, we obtain an almost-complete overview of the fine-grained complexity of these problems taking into account the most classical graph parameters of the input graph $G$. In particular, we present SETH-tight upper and lower bounds for these problems when parameterized by the pathwidth and cutwidth, an ETH-tight algorithm parameterized by the cliquewidth, and a nearly SETH-tight algorithm parameterized by treewidth.
△ Less
Submitted 19 March, 2025;
originally announced March 2025.
-
A hierarchical approach for multicontinuum homogenization in high contrast media
Authors:
Wei Xie,
Viet Ha Hoang,
Yin Yang,
Yunqing Huang
Abstract:
A recently developed upscaling technique, the multicontinuum homogenization method, has gained significant attention for its effectiveness in modeling complex multiscale systems. This method defines multiple continua based on distinct physical properties and solves a series of constrained cell problems to capture localized information for each continuum. However, solving all these cell problems on…
▽ More
A recently developed upscaling technique, the multicontinuum homogenization method, has gained significant attention for its effectiveness in modeling complex multiscale systems. This method defines multiple continua based on distinct physical properties and solves a series of constrained cell problems to capture localized information for each continuum. However, solving all these cell problems on very fine grids at every macroscopic point is computationally expensive, which is a common limitation of most homogenization approaches for non-periodic problems. To address this challenge, we propose a hierarchical multicontinuum homogenization framework. The core idea is to define hierarchical macroscopic points and solve the constrained problems on grids of varying resolutions. We assume that the local solutions can be represented as a combination of a linear interpolation of local solutions from preceding levels and an additional correction term. This combination is substituted into the original constrained problems, and the correction term is resolved using finite element (FE) grids of varying sizes, depending on the level of the macropoint. By normalizing the computational cost of fully resolving the local problem to $\mathcal{O}(1)$, we establish that our approach incurs a cost of $\mathcal{O}(L η^{(1-L)d})$, highlighting substantial computational savings across hierarchical layers $L$, coarsening factor $η$, and spatial dimension $d$. Numerical experiments validate the effectiveness of the proposed method in media with slowly varying properties, underscoring its potential for efficient multiscale modeling.
△ Less
Submitted 9 June, 2025; v1 submitted 3 March, 2025;
originally announced March 2025.
-
A Precise Determination of $α_s$ from the Heavy Jet Mass Distribution
Authors:
Miguel A. Benitez,
Arindam Bhattacharya,
Andre H. Hoang,
Vicent Mateu,
Matthew D. Schwartz,
Iain W. Stewart,
Xiaoyuan Zhang
Abstract:
A global fit for $α_s(m_Z)$ is performed on available $e^+e^-$ data for the heavy jet mass distribution. The state-of-the-art theory prediction includes $\mathcal{O}(α_s^3)$ fixed-order results, N$^3$LL$^\prime$ dijet resummation, N$^2$LL Sudakov shoulder resummation, and a first-principles treatment of power corrections in the dijet region. Theoretical correlations are incorporated through a flat…
▽ More
A global fit for $α_s(m_Z)$ is performed on available $e^+e^-$ data for the heavy jet mass distribution. The state-of-the-art theory prediction includes $\mathcal{O}(α_s^3)$ fixed-order results, N$^3$LL$^\prime$ dijet resummation, N$^2$LL Sudakov shoulder resummation, and a first-principles treatment of power corrections in the dijet region. Theoretical correlations are incorporated through a flat random-scan covariance matrix. The global fit results in $0.1145^{+0.0021}_{-0.0019}$, compatible with similar determinations from thrust and $C$-parameter. Dijet resummation is essential for a robust fit, as it engenders insensitivity to the fit-range lower cutoff; without resummation the fit-range sensitivity is overwhelming. In addition, we find evidence for a negative power correction in the trijet region if and only if Sudakov shoulder resummation is included.
△ Less
Submitted 17 February, 2025;
originally announced February 2025.
-
Minimum maximal matchings in permutahedra
Authors:
Sofia Brenner,
Jiří Fink,
Hung. P. Hoang,
Arturo Merino,
Vincent Pilaud
Abstract:
We prove that the minimal size $M(π_n)$ of a maximal matching in the permutahedron $π_n$ is asymptotically $n!/3$. On the one hand, we obtain a lower bound $M(π_n) \ge n! (n-1) / (3n-2)$ by considering $4$-cycles in the permutahedron. On the other hand, we obtain an asymptotical upper bound $M(π_n) \le n!(1/3+o(1))$ by multiple applications of Hall's theorem (similar to the approach of Forcade (19…
▽ More
We prove that the minimal size $M(π_n)$ of a maximal matching in the permutahedron $π_n$ is asymptotically $n!/3$. On the one hand, we obtain a lower bound $M(π_n) \ge n! (n-1) / (3n-2)$ by considering $4$-cycles in the permutahedron. On the other hand, we obtain an asymptotical upper bound $M(π_n) \le n!(1/3+o(1))$ by multiple applications of Hall's theorem (similar to the approach of Forcade (1973) for the hypercube) and an exact upper bound $M(π_n) \le n!/3$ by an explicit construction. We also derive bounds on minimum maximal matchings in products of permutahedra.
△ Less
Submitted 14 February, 2025;
originally announced February 2025.
-
Humanity's Last Exam
Authors:
Long Phan,
Alice Gatti,
Ziwen Han,
Nathaniel Li,
Josephina Hu,
Hugh Zhang,
Chen Bo Calvin Zhang,
Mohamed Shaaban,
John Ling,
Sean Shi,
Michael Choi,
Anish Agrawal,
Arnav Chopra,
Adam Khoja,
Ryan Kim,
Richard Ren,
Jason Hausenloy,
Oliver Zhang,
Mantas Mazeika,
Dmitry Dodonov,
Tung Nguyen,
Jaeho Lee,
Daron Anderson,
Mikhail Doroshenko,
Alun Cennyth Stokes
, et al. (1084 additional authors not shown)
Abstract:
Benchmarks are important tools for tracking the rapid advancements in large language model (LLM) capabilities. However, benchmarks are not keeping pace in difficulty: LLMs now achieve over 90\% accuracy on popular benchmarks like MMLU, limiting informed measurement of state-of-the-art LLM capabilities. In response, we introduce Humanity's Last Exam (HLE), a multi-modal benchmark at the frontier of…
▽ More
Benchmarks are important tools for tracking the rapid advancements in large language model (LLM) capabilities. However, benchmarks are not keeping pace in difficulty: LLMs now achieve over 90\% accuracy on popular benchmarks like MMLU, limiting informed measurement of state-of-the-art LLM capabilities. In response, we introduce Humanity's Last Exam (HLE), a multi-modal benchmark at the frontier of human knowledge, designed to be the final closed-ended academic benchmark of its kind with broad subject coverage. HLE consists of 2,500 questions across dozens of subjects, including mathematics, humanities, and the natural sciences. HLE is developed globally by subject-matter experts and consists of multiple-choice and short-answer questions suitable for automated grading. Each question has a known solution that is unambiguous and easily verifiable, but cannot be quickly answered via internet retrieval. State-of-the-art LLMs demonstrate low accuracy and calibration on HLE, highlighting a significant gap between current LLM capabilities and the expert human frontier on closed-ended academic questions. To inform research and policymaking upon a clear understanding of model capabilities, we publicly release HLE at https://lastexam.ai.
△ Less
Submitted 19 April, 2025; v1 submitted 24 January, 2025;
originally announced January 2025.
-
Interior point methods for an algebraic system involving complementarity equations for geomechanical fractures
Authors:
Trung Hau Hoang
Abstract:
Many applications like subseismic fault modeling, fractured reservoir modeling and interpretation/validation of fault connectivity involve the solution to an elliptic boundary value problem in a background medium perturbed by the presence of cracks that take the form of one or many pieces of surface (with boundary). When the background medium can be considered as homogeneous, boundary integral equ…
▽ More
Many applications like subseismic fault modeling, fractured reservoir modeling and interpretation/validation of fault connectivity involve the solution to an elliptic boundary value problem in a background medium perturbed by the presence of cracks that take the form of one or many pieces of surface (with boundary). When the background medium can be considered as homogeneous, boundary integral equations appear as a method of choice for the numerical solution to fractures problems. With such an approach, the problem is reformulated as a fully non-local equation posed at the surface of cracks. Discretization of boundary integral resulting in the so-called Boundary Element Method (BEM) leads to densely populated matrices due to the full non-locality of the operators under consideration. After the discretization process, geologists are faced with a system of equations that turns out difficult to solve numerically. Many empirical algorithms have been proposed by geologists to solve this system of equations. Unfortunately, none of them is guaranteed to converge in theory (in particular when faults (fractures) intersect each other forming a geometrically highly irregular structure). In practice, none of them appears to be either robust or efficient. We investigate another approach, referred to as interior point methods, for which convergence can be ensured (even if faults are too close). Interior point methods have proved their efficiency in a wide variety of domains, most notably for linear programming. Here, even though we do not have any optimization problem, we can adapt ideas from interior point methods for the numerical resolution of the system considered. The numerical results obtained demonstrate computational efficiency and accuracy, highlighting the robustness and effectiveness of the implemented methods.
△ Less
Submitted 27 December, 2024;
originally announced January 2025.
-
KeyNode-Driven Geometry Coding for Real-World Scanned Human Dynamic Mesh Compression
Authors:
Huong Hoang,
Truong Nguyen,
Pamela Cosman
Abstract:
The compression of real-world scanned 3D human dynamic meshes is an emerging research area, driven by applications such as telepresence, virtual reality, and 3D digital streaming. Unlike synthesized dynamic meshes with fixed topology, scanned dynamic meshes often not only have varying topology across frames but also scan defects such as holes and outliers, increasing the complexity of prediction a…
▽ More
The compression of real-world scanned 3D human dynamic meshes is an emerging research area, driven by applications such as telepresence, virtual reality, and 3D digital streaming. Unlike synthesized dynamic meshes with fixed topology, scanned dynamic meshes often not only have varying topology across frames but also scan defects such as holes and outliers, increasing the complexity of prediction and compression. Additionally, human meshes often combine rigid and non-rigid motions, making accurate prediction and encoding significantly more difficult compared to objects that exhibit purely rigid motion. To address these challenges, we propose a compression method designed for real-world scanned human dynamic meshes, leveraging embedded key nodes. The temporal motion of each vertex is formulated as a distance-weighted combination of transformations from neighboring key nodes, requiring the transmission of solely the key nodes' transformations. To enhance the quality of the KeyNode-driven prediction, we introduce an octree-based residual coding scheme and a Dual-direction prediction mode, which uses I-frames from both directions. Extensive experiments demonstrate that our method achieves significant improvements over the state-of-the-art, with an average bitrate savings of 58.43% across the evaluated sequences, particularly excelling at low bitrates.
△ Less
Submitted 2 July, 2025; v1 submitted 3 January, 2025;
originally announced January 2025.
-
On Determining $α_s(m_Z)$ from Dijets in $e^+e^-$ Thrust
Authors:
Miguel A. Benitez,
Andre H. Hoang,
Vicent Mateu,
Iain W. Stewart,
Gherardo Vita
Abstract:
We update a previous N$^3$LL$^\prime$+${\cal O}(α_s^3)$ determination of the strong coupling from a global fit to thrust data by including newly available perturbative ingredients, upgrading the renormalization scales to include a fully canonical scaling region, and implementing the log resummation in a way which ensures the integrated cross section is unaffected by the the leading $1/Q$ hadroniza…
▽ More
We update a previous N$^3$LL$^\prime$+${\cal O}(α_s^3)$ determination of the strong coupling from a global fit to thrust data by including newly available perturbative ingredients, upgrading the renormalization scales to include a fully canonical scaling region, and implementing the log resummation in a way which ensures the integrated cross section is unaffected by the the leading $1/Q$ hadronization power corrections. Detailed discussions are provided concerning the stability of the results under variations of the fit range and the importance of summing up higher-order logarithmic terms for convergence and stability. We show that high-precision results can be achieved even when carrying out a more conservative fit by restricting the dataset to a region which is more clearly dominated by dijet events. This leads to $α_s(m_Z) = 0.1136 \pm 0.0012$ with $χ^2/{\rm dof}=0.86$, fully compatible with earlier results using a larger fit range. We also demonstrate that a number of additional effects associated to power corrections have a small impact on this fit result, including modifications to the renormalon substraction scheme for dijet power corrections and the inclusion of three-jet power correction models. The fit is also shown to provide very good agreement with data outside the fit range.
△ Less
Submitted 19 December, 2024;
originally announced December 2024.
-
How to avoid order reduction in third-order exponential Runge--Kutta methods for problems with non-commutative operators?
Authors:
Thi Tam Dang,
Trung Hau Hoang
Abstract:
This paper investigates the performance of a subclass of exponential integrators, specifically explicit exponential Runge--Kutta methods. It is well known that third-order methods can suffer from order reduction when applied to linearized problems involving unbounded and non-commuting operators. In this work, we consider a fourth-stage third-order Runge--Kutta method, which successfully achieves t…
▽ More
This paper investigates the performance of a subclass of exponential integrators, specifically explicit exponential Runge--Kutta methods. It is well known that third-order methods can suffer from order reduction when applied to linearized problems involving unbounded and non-commuting operators. In this work, we consider a fourth-stage third-order Runge--Kutta method, which successfully achieves the expected order of accuracy and avoids order reduction, as long as all required order conditions are satisfied. The convergence analysis is carried out under the assumption of higher regularity for the initial data. Numerical experiments are provided to validate the theoretical results.
△ Less
Submitted 24 December, 2024; v1 submitted 16 December, 2024;
originally announced December 2024.
-
The $p$-adic zeta function of a plane curve singularity
Authors:
Huyen Trang Hoang,
Quy Thuong Lê,
Hoang Long Nguyen
Abstract:
Using toric modifications and some compatibility we compute the local $p$-adic zeta function of a plane curve singularity. Thanks to the compatibility, we can work over the analytic change of variables formula for $p$-adic integrals, hence avoid adapting to the algebraic setting and Denef's formula.
Using toric modifications and some compatibility we compute the local $p$-adic zeta function of a plane curve singularity. Thanks to the compatibility, we can work over the analytic change of variables formula for $p$-adic integrals, hence avoid adapting to the algebraic setting and Denef's formula.
△ Less
Submitted 6 December, 2024;
originally announced December 2024.
-
Signotopes with few plus signs
Authors:
Helena Bergold,
Lukas Egeling,
Hung. P. Hoang
Abstract:
Arrangements of pseudohyperplanes are widely studied in computational geometry. A rich subclass of pseudohyerplane arrangements, which has gained more attention in recent years, is the so-called signotopes. Introduced by Manin and Schechtman (1989), the higher Bruhat order is a natural order of $r$-signotopes on $n$ elements, with the signotope corresponding to the cyclic arrangement as the minima…
▽ More
Arrangements of pseudohyperplanes are widely studied in computational geometry. A rich subclass of pseudohyerplane arrangements, which has gained more attention in recent years, is the so-called signotopes. Introduced by Manin and Schechtman (1989), the higher Bruhat order is a natural order of $r$-signotopes on $n$ elements, with the signotope corresponding to the cyclic arrangement as the minimal element. In this paper, we show that the lower (and by symmetry upper) levels of this higher Bruhat order contain the same number of elements for a fixed difference $n-r$. This result implies that given the difference $d=n-r$ and $p$, the number of one-element extensions of the cyclic arrangement of $n$ hyperplanes in $\mathbb{R}^d$ with at most $p$ points on one side of the extending pseudohyperplane does not depend on $n$, as long as $n \geq d + p$.
△ Less
Submitted 22 February, 2025; v1 submitted 28 November, 2024;
originally announced November 2024.
-
Optimization and Characterization of Thermoelectric Properties in Selenium-Doped Bismuth Telluride Ultra Thin Films
Authors:
Kien Trung Nguyen,
Lan Anh Dong,
Hien Thi Dinh,
Thi Huyen Trang Bui,
Son Truong Chu,
Thuat Nguyen-Tran,
Chi Hieu Hoang,
Hung Quoc Nguyen
Abstract:
Thermoelectricity in telluride materials is often improved by replacing telluride with selenium in its crystal. Most work, however, focuses on bulk crystal and leaves the 2D thin films intact. In this paper, we optimize the fabrication of selenium-doped bismuth telluride (Bi$_2$Te$_{3-\rm{x}}$Se$_{\rm{x}}$) thin films using a 3-source thermal co-evaporation. Thermoelectric properties, including th…
▽ More
Thermoelectricity in telluride materials is often improved by replacing telluride with selenium in its crystal. Most work, however, focuses on bulk crystal and leaves the 2D thin films intact. In this paper, we optimize the fabrication of selenium-doped bismuth telluride (Bi$_2$Te$_{3-\rm{x}}$Se$_{\rm{x}}$) thin films using a 3-source thermal co-evaporation. Thermoelectric properties, including the Seebeck coefficient and electrical resistivity, are systematically characterized to evaluate the material's performance for thermoelectric applications near room temperature. The thin films were deposited under carefully controlled conditions, with the evaporation rates of bismuth, tellurium, and selenium precisely monitored to achieve the desired stoichiometry and crystalline phase. Finally, thermoelectricity in Bi$_2$Te$_{3-\rm{x}}$Se$_{\rm{x}}$ at the ultra-thin regime is investigated. We consistently obtain films with thickness near 30 nm with a Seebeck coefficient of 400 $μ$V/K and a power factor of 1 mW/mK$^2$.
△ Less
Submitted 27 October, 2024;
originally announced October 2024.
-
Adapters for Altering LLM Vocabularies: What Languages Benefit the Most?
Authors:
HyoJung Han,
Akiko Eriguchi,
Haoran Xu,
Hieu Hoang,
Marine Carpuat,
Huda Khayrallah
Abstract:
Vocabulary adaptation, which integrates new vocabulary into pre-trained language models, enables expansion to new languages and mitigates token over-fragmentation. However, existing approaches are limited by their reliance on heuristics or external embeddings. We propose VocADT, a novel method for vocabulary adaptation using adapter modules that are trained to learn the optimal linear combination…
▽ More
Vocabulary adaptation, which integrates new vocabulary into pre-trained language models, enables expansion to new languages and mitigates token over-fragmentation. However, existing approaches are limited by their reliance on heuristics or external embeddings. We propose VocADT, a novel method for vocabulary adaptation using adapter modules that are trained to learn the optimal linear combination of existing embeddings while keeping the model's weights fixed. VocADT offers a flexible and scalable solution without depending on external resources or language constraints. Across 11 languages-with diverse scripts, resource availability, and fragmentation-we demonstrate that VocADT outperforms the original Mistral model and other baselines across various multilingual tasks including natural language understanding and machine translation. We find that Latin-script languages and highly fragmented languages benefit the most from vocabulary adaptation. We further fine-tune the adapted model on the generative task of machine translation and find that vocabulary adaptation is still beneficial after fine-tuning and that VocADT is the most effective.
△ Less
Submitted 16 March, 2025; v1 submitted 12 October, 2024;
originally announced October 2024.
-
UNIQ: Offline Inverse Q-learning for Avoiding Undesirable Demonstrations
Authors:
Huy Hoang,
Tien Mai,
Pradeep Varakantham
Abstract:
We address the problem of offline learning a policy that avoids undesirable demonstrations. Unlike conventional offline imitation learning approaches that aim to imitate expert or near-optimal demonstrations, our setting involves avoiding undesirable behavior (specified using undesirable demonstrations). To tackle this problem, unlike standard imitation learning where the aim is to minimize the di…
▽ More
We address the problem of offline learning a policy that avoids undesirable demonstrations. Unlike conventional offline imitation learning approaches that aim to imitate expert or near-optimal demonstrations, our setting involves avoiding undesirable behavior (specified using undesirable demonstrations). To tackle this problem, unlike standard imitation learning where the aim is to minimize the distance between learning policy and expert demonstrations, we formulate the learning task as maximizing a statistical distance, in the space of state-action stationary distributions, between the learning policy and the undesirable policy. This significantly different approach results in a novel training objective that necessitates a new algorithm to address it. Our algorithm, UNIQ, tackles these challenges by building on the inverse Q-learning framework, framing the learning problem as a cooperative (non-adversarial) task. We then demonstrate how to efficiently leverage unlabeled data for practical training. Our method is evaluated on standard benchmark environments, where it consistently outperforms state-of-the-art baselines. The code implementation can be accessed at: https://github.com/hmhuy0/UNIQ.
△ Less
Submitted 10 October, 2024;
originally announced October 2024.
-
X-ALMA: Plug & Play Modules and Adaptive Rejection for Quality Translation at Scale
Authors:
Haoran Xu,
Kenton Murray,
Philipp Koehn,
Hieu Hoang,
Akiko Eriguchi,
Huda Khayrallah
Abstract:
Large language models (LLMs) have achieved remarkable success across various NLP tasks with a focus on English due to English-centric pre-training and limited multilingual data. In this work, we focus on the problem of translation, and while some multilingual LLMs claim to support for hundreds of languages, models often fail to provide high-quality responses for mid- and low-resource languages, le…
▽ More
Large language models (LLMs) have achieved remarkable success across various NLP tasks with a focus on English due to English-centric pre-training and limited multilingual data. In this work, we focus on the problem of translation, and while some multilingual LLMs claim to support for hundreds of languages, models often fail to provide high-quality responses for mid- and low-resource languages, leading to imbalanced performance heavily skewed in favor of high-resource languages. We introduce **X-ALMA**, a model designed to ensure top-tier performance across 50 diverse languages, regardless of their resource levels. X-ALMA surpasses state-of-the-art open-source multilingual LLMs, such as Aya-101 and Aya-23, in every single translation direction on the FLORES-200 and WMT'23 test datasets according to COMET-22. This is achieved by plug-and-play language-specific module architecture to prevent language conflicts during training and a carefully designed training regimen with novel optimization methods to maximize the translation performance. After the final stage of training regimen, our proposed **A**daptive **R**ejection **P**reference **O**ptimization (**ARPO**) surpasses existing preference optimization methods in translation tasks.
△ Less
Submitted 2 March, 2025; v1 submitted 3 October, 2024;
originally announced October 2024.
-
Order Reduction of Exponential Runge--Kutta Methods: Non-Commuting Operators
Authors:
Trung Hau Hoang
Abstract:
Nonlinear parabolic equations are central to numerous applications in science and engineering, posing significant challenges for analytical solutions and necessitating efficient numerical methods. Exponential integrators have recently gained attention for handling stiff differential equations. This paper explores exponential Runge--Kutta methods for solving such equations, focusing on the simplifi…
▽ More
Nonlinear parabolic equations are central to numerous applications in science and engineering, posing significant challenges for analytical solutions and necessitating efficient numerical methods. Exponential integrators have recently gained attention for handling stiff differential equations. This paper explores exponential Runge--Kutta methods for solving such equations, focusing on the simplified form $u^{\prime}(t)+A u(t)=B u(t)$, where $A$ generates an analytic semigroup and $B$ is relatively bounded with respect to $A$. By treating $A$ exactly and $B$ explicitly, we derive error bounds for exponential Runge--Kutta methods up to third order. Our analysis shows that these methods maintain their order under mild regularity conditions on the initial data $u_0$, while also addressing the phenomenon of order reduction in higher-order methods. Through a careful convergence analysis and numerical investigations, this study provides a comprehensive understanding of the applicability and limitations of exponential Runge--Kutta methods in solving linear parabolic equations involving two unbounded and non-commuting operators.
△ Less
Submitted 22 December, 2024; v1 submitted 1 October, 2024;
originally announced October 2024.
-
URIEL+: Enhancing Linguistic Inclusion and Usability in a Typological and Multilingual Knowledge Base
Authors:
Aditya Khan,
Mason Shipton,
David Anugraha,
Kaiyao Duan,
Phuong H. Hoang,
Eric Khiu,
A. Seza Doğruöz,
En-Shiun Annie Lee
Abstract:
URIEL is a knowledge base offering geographical, phylogenetic, and typological vector representations for 7970 languages. It includes distance measures between these vectors for 4005 languages, which are accessible via the lang2vec tool. Despite being frequently cited, URIEL is limited in terms of linguistic inclusion and overall usability. To tackle these challenges, we introduce URIEL+, an enhan…
▽ More
URIEL is a knowledge base offering geographical, phylogenetic, and typological vector representations for 7970 languages. It includes distance measures between these vectors for 4005 languages, which are accessible via the lang2vec tool. Despite being frequently cited, URIEL is limited in terms of linguistic inclusion and overall usability. To tackle these challenges, we introduce URIEL+, an enhanced version of URIEL and lang2vec that addresses these limitations. In addition to expanding typological feature coverage for 2898 languages, URIEL+ improves the user experience with robust, customizable distance calculations to better suit the needs of users. These upgrades also offer competitive performance on downstream tasks and provide distances that better align with linguistic distance studies.
△ Less
Submitted 13 February, 2025; v1 submitted 27 September, 2024;
originally announced September 2024.
-
A novel agent with formal goal-reaching guarantees: an experimental study with a mobile robot
Authors:
Grigory Yaremenko,
Dmitrii Dobriborsci,
Roman Zashchitin,
Ruben Contreras Maestre,
Ngoc Quoc Huy Hoang,
Pavel Osinenko
Abstract:
Reinforcement Learning (RL) has been shown to be effective and convenient for a number of tasks in robotics. However, it requires the exploration of a sufficiently large number of state-action pairs, many of which may be unsafe or unimportant. For instance, online model-free learning can be hazardous and inefficient in the absence of guarantees that a certain set of desired states will be reached…
▽ More
Reinforcement Learning (RL) has been shown to be effective and convenient for a number of tasks in robotics. However, it requires the exploration of a sufficiently large number of state-action pairs, many of which may be unsafe or unimportant. For instance, online model-free learning can be hazardous and inefficient in the absence of guarantees that a certain set of desired states will be reached during an episode. An increasingly common approach to address safety involves the addition of a shielding system that constrains the RL actions to a safe set of actions. In turn, a difficulty for such frameworks is how to effectively couple RL with the shielding system to make sure the exploration is not excessively restricted. This work presents a novel safe model-free RL agent called Critic As Lyapunov Function (CALF) and showcases how CALF can be used to improve upon control baselines in robotics in an efficient and convenient fashion while ensuring guarantees of stable goal reaching. The latter is a crucial part of safety, as seen generally. With CALF all state-action pairs remain explorable and yet reaching of desired goal states is formally guaranteed. Formal analysis is provided that shows the goal stabilization-ensuring properties of CALF and a set of real-world and numerical experiments with a non-holonomic wheeled mobile robot (WMR) TurtleBot3 Burger confirmed the superiority of CALF over such a well-established RL agent as proximal policy optimization (PPO), and a modified version of SARSA in a few-episode setting in terms of attained total cost.
△ Less
Submitted 23 September, 2024;
originally announced September 2024.
-
Three-Loop OPE Wilson Coefficients of Dimension-Four Operators for (Axial-)Vector and (Pseudo-)Scalar Current Correlators
Authors:
Robin Brüser,
André H. Hoang,
Maximilian Stahlhofen
Abstract:
We calculate the three-loop Wilson coefficients of all physically relevant dimension-four operators, i.e. $G_{μν}^a G^{a,μν}$, $m_i\bar q_j q_j$ and $m_i m_j m_k^2$, in the short-distance expansion of the time-ordered product of a pair of gauge-singlet vector, axial-vector, scalar and pseudo-scalar currents. The results are given for a general non-Abelian gauge theory with arbitrary (compact semi-…
▽ More
We calculate the three-loop Wilson coefficients of all physically relevant dimension-four operators, i.e. $G_{μν}^a G^{a,μν}$, $m_i\bar q_j q_j$ and $m_i m_j m_k^2$, in the short-distance expansion of the time-ordered product of a pair of gauge-singlet vector, axial-vector, scalar and pseudo-scalar currents. The results are given for a general non-Abelian gauge theory with arbitrary (compact semi-simple) gauge group and $n_f$ light fermion flavors (quarks) in a common arbitrary representation of the gauge group, which includes QCD as a special case. In particular, we allow for arbitrary flavor contents of each of the currents. For the axial-vector current the included contributions from so-called singlet diagrams are consistent with the one-loop axial anomaly.
△ Less
Submitted 20 December, 2024; v1 submitted 7 August, 2024;
originally announced August 2024.
-
Determining $α_s(m_Z)$ from Thrust with Power Corrections
Authors:
Miguel A. Benitez-Rathgeb,
André H. Hoang,
Vicent Mateu,
Iain W. Stewart,
Gherardo Vita
Abstract:
We update and extend a previous N$^3$LL$^\prime$+${\cal O}(α_s^3)$ strong coupling determination from thrust data. In particular, we carry out a fit with data fully restricted to the dijet region seeking to minimize the potential impact of power corrections that go beyond dijet configurations. In addition, we parametrize deviations from the dijet power correction in order to add an additional sour…
▽ More
We update and extend a previous N$^3$LL$^\prime$+${\cal O}(α_s^3)$ strong coupling determination from thrust data. In particular, we carry out a fit with data fully restricted to the dijet region seeking to minimize the potential impact of power corrections that go beyond dijet configurations. In addition, we parametrize deviations from the dijet power correction in order to add an additional source of uncertainty in the result for $α_s(m_Z)$. We also show that the inclusion of resummation is important to achieve stability with respect to varying the fit region.
△ Less
Submitted 23 May, 2024;
originally announced May 2024.
-
Generating all invertible matrices by row operations
Authors:
Petr Gregor,
Hung P. Hoang,
Arturo Merino,
Ondřej Mička
Abstract:
We show that all invertible $n \times n$ matrices over any finite field $\mathbb{F}_q$ can be generated in a Gray code fashion. More specifically, there exists a listing such that (1) each matrix appears exactly once, and (2) two consecutive matrices differ by adding or subtracting one row from a previous or subsequent row, or by multiplying or diving a row by the generator of the multiplicative g…
▽ More
We show that all invertible $n \times n$ matrices over any finite field $\mathbb{F}_q$ can be generated in a Gray code fashion. More specifically, there exists a listing such that (1) each matrix appears exactly once, and (2) two consecutive matrices differ by adding or subtracting one row from a previous or subsequent row, or by multiplying or diving a row by the generator of the multiplicative group of $\mathbb{F}_q$. This even holds if the addition and subtraction of each row is allowed to some specific rows satisfying a certain mild condition. Moreover, we can prescribe the first and the last matrix if $n\ge 3$, or $n=2$ and $q>2$. In other words, the corresponding flip graph on all invertible $n \times n$ matrices over $\mathbb{F}_q$ is Hamilton connected if it is not a cycle. This solves yet another special case of Lovász conjecture on Hamiltonicity of vertex-transitive graphs.
△ Less
Submitted 7 October, 2024; v1 submitted 3 May, 2024;
originally announced May 2024.
-
Parameterized Complexity of Efficient Sortation
Authors:
Robert Ganian,
Hung P. Hoang,
Simon Wietheger
Abstract:
A crucial challenge arising in the design of large-scale logistical networks is to optimize parcel sortation for routing. We study this problem under the recent graph-theoretic formalization of Van Dyk, Klause, Koenemann and Megow (IPCO 2024). The problem asks - given an input digraph D (the fulfillment network) together with a set of commodities represented as source-sink tuples - for a minimum-o…
▽ More
A crucial challenge arising in the design of large-scale logistical networks is to optimize parcel sortation for routing. We study this problem under the recent graph-theoretic formalization of Van Dyk, Klause, Koenemann and Megow (IPCO 2024). The problem asks - given an input digraph D (the fulfillment network) together with a set of commodities represented as source-sink tuples - for a minimum-outdegree subgraph H of the transitive closure of D that contains a source-sink route for each of the commodities. Given the underlying motivation, we study two variants of the problem which differ in whether the routes for the commodities are assumed to be given, or can be chosen arbitrarily.
We perform a thorough parameterized analysis of the complexity of both problems. Our results concentrate on three fundamental parameterizations of the problem: (1) When attempting to parameterize by the target outdegree of H, we show that the problems are paraNP-hard even in highly restricted cases; (2) When parameterizing by the number of commodities, we utilize Ramsey-type arguments and color-coding techniques to obtain parameterized algorithms for both problems; (3) When parameterizing by the structure of D, we establish fixed-parameter tractability for both problems w.r.t. treewidth, maximum degree and the maximum routing length. We combine this with lower bounds which show that omitting any of the three parameters results in paraNP-hardness.
△ Less
Submitted 13 September, 2024; v1 submitted 25 April, 2024;
originally announced April 2024.
-
Adapted Lie splitting method for convection-diffusion problems with singular convective term
Authors:
Thi Tam Dang,
Trung Hau Hoang,
Giandomenico Orlandi
Abstract:
Splitting methods are a widely used numerical scheme for solving convection-diffusion problems. However, they may lose stability in some situations, particularly when applied to convection-diffusion problems in the presence of an unbounded convective term. In this paper, we propose a new splitting method, called the "Adapted Lie splitting method", which successfully overcomes the observed instabil…
▽ More
Splitting methods are a widely used numerical scheme for solving convection-diffusion problems. However, they may lose stability in some situations, particularly when applied to convection-diffusion problems in the presence of an unbounded convective term. In this paper, we propose a new splitting method, called the "Adapted Lie splitting method", which successfully overcomes the observed instability in certain cases. Assuming that the unbounded coefficient belongs to a suitable Lorentz space, we show that the adapted Lie splitting converges to first-order under the analytic semigroup framework. Furthermore, we provide numerical experiments to illustrate our newly proposed splitting approach.
△ Less
Submitted 24 April, 2024;
originally announced April 2024.
-
Matching Hadronization and Perturbative Evolution: The Cluster Model in Light of Infrared Shower Cutoff Dependence
Authors:
André H. Hoang,
Oliver L. Jin,
Simon Plätzer,
Daniel Samitz
Abstract:
In the context of Monte Carlo (MC) generators with parton showers that have next-to-leading-logarithmic (NLL) precision, the cutoff $Q_0$ terminating the shower evolution should be viewed as an infrared factorization scale so that parameters or non-perturbative effects of the MC generator may have a field theoretic interpretation with a controllable scheme dependence. This implies that the generat…
▽ More
In the context of Monte Carlo (MC) generators with parton showers that have next-to-leading-logarithmic (NLL) precision, the cutoff $Q_0$ terminating the shower evolution should be viewed as an infrared factorization scale so that parameters or non-perturbative effects of the MC generator may have a field theoretic interpretation with a controllable scheme dependence. This implies that the generator's parton level should be carefully defined within QCD perturbation theory with subleading order precision. Furthermore, it entails that the shower cut $Q_0$ is not treated as one of the generator's tuning parameters, but that the tuning can be carried out reliably for a range of $Q_0$ values and that the hadron level description is $Q_0$-invariant. This in turn imposes non-trival constraints on the behavior of the generator's hadronization model, so that its parameters can adapt accordingly when the $Q_0$ value is changed. We investigate these features using the angular ordered parton shower and the cluster hadronization model implemented in the Herwig~7.2 MC generator focusing in particular on the $e^+e^-$ 2-jettiness distribution, where the shower is known to be NLL precise and where QCD factorization imposes stringent constraints on the hadronization corrections. We show that the Herwig default cluster hadronization model does not exhibit these features or consistency with QCD factorization with a satisfying precision. We design a modification of the cluster hadronization model, where some dynamical parton shower aspects are added that are missing in the default model. For this novel dynamical cluster hadronization model these features and consistency with QCD factorization are realized much more accurately.
△ Less
Submitted 24 June, 2025; v1 submitted 15 April, 2024;
originally announced April 2024.
-
Existence for noncoercive nonlinear elliptic equations with two lower-order terms
Authors:
Thi Tam Dang,
Trung Hau Hoang
Abstract:
This paper considers a class of noncoercive nonlinear elliptic problems with coefficients defined in Marcinkiewicz and Lorentz spaces. We prove the existence of a solution for the corresponding Dirichlet problem and investigate the higher integrability properties of the solution.
This paper considers a class of noncoercive nonlinear elliptic problems with coefficients defined in Marcinkiewicz and Lorentz spaces. We prove the existence of a solution for the corresponding Dirichlet problem and investigate the higher integrability properties of the solution.
△ Less
Submitted 31 March, 2024;
originally announced April 2024.
-
Nonparametric density estimation for stationary processes under multiplicative measurement errors
Authors:
Duc Trong Dang,
Van Ha Hoang,
Phuc Hung Thai
Abstract:
This paper focuses on estimating the invariant density function $f_X$ of the strongly mixing stationary process $X_t$ in the multiplicative measurement errors model $Y_t = X_t U_t$, where $U_t$ is also a strongly mixing stationary process. We propose a novel approach to handle non-independent data, typical in real-world scenarios. For instance, data collected from various groups may exhibit interd…
▽ More
This paper focuses on estimating the invariant density function $f_X$ of the strongly mixing stationary process $X_t$ in the multiplicative measurement errors model $Y_t = X_t U_t$, where $U_t$ is also a strongly mixing stationary process. We propose a novel approach to handle non-independent data, typical in real-world scenarios. For instance, data collected from various groups may exhibit interdependencies within each group, resembling data generated from $m$-dependent stationary processes, a subset of stationary processes. This study extends the applicability of the model $Y_t = X_t U_t$ to diverse scientific domains dealing with complex dependent data. The paper outlines our estimation techniques, discusses convergence rates, establishes a lower bound on the minimax risk, and demonstrates the asymptotic normality of the estimator for $f_X$ under smooth error distributions. Through examples and simulations, we showcase the efficacy of our estimator. The paper concludes by providing proofs for the presented theoretical results.v
△ Less
Submitted 20 March, 2024;
originally announced March 2024.
-
SPRINQL: Sub-optimal Demonstrations driven Offline Imitation Learning
Authors:
Huy Hoang,
Tien Mai,
Pradeep Varakantham
Abstract:
We focus on offline imitation learning (IL), which aims to mimic an expert's behavior using demonstrations without any interaction with the environment. One of the main challenges in offline IL is the limited support of expert demonstrations, which typically cover only a small fraction of the state-action space. While it may not be feasible to obtain numerous expert demonstrations, it is often pos…
▽ More
We focus on offline imitation learning (IL), which aims to mimic an expert's behavior using demonstrations without any interaction with the environment. One of the main challenges in offline IL is the limited support of expert demonstrations, which typically cover only a small fraction of the state-action space. While it may not be feasible to obtain numerous expert demonstrations, it is often possible to gather a larger set of sub-optimal demonstrations. For example, in treatment optimization problems, there are varying levels of doctor treatments available for different chronic conditions. These range from treatment specialists and experienced general practitioners to less experienced general practitioners. Similarly, when robots are trained to imitate humans in routine tasks, they might learn from individuals with different levels of expertise and efficiency.
In this paper, we propose an offline IL approach that leverages the larger set of sub-optimal demonstrations while effectively mimicking expert trajectories. Existing offline IL methods based on behavior cloning or distribution matching often face issues such as overfitting to the limited set of expert demonstrations or inadvertently imitating sub-optimal trajectories from the larger dataset. Our approach, which is based on inverse soft-Q learning, learns from both expert and sub-optimal demonstrations. It assigns higher importance (through learned weights) to aligning with expert demonstrations and lower importance to aligning with sub-optimal ones. A key contribution of our approach, called SPRINQL, is transforming the offline IL problem into a convex optimization over the space of Q functions. Through comprehensive experimental evaluations, we demonstrate that the SPRINQL algorithm achieves state-of-the-art (SOTA) performance on offline IL benchmarks. Code is available at https://github.com/hmhuy0/SPRINQL.
△ Less
Submitted 10 October, 2024; v1 submitted 20 February, 2024;
originally announced February 2024.
-
The $k$-Opt algorithm for the Traveling Salesman Problem has exponential running time for $k \ge 5$
Authors:
Sophia Heimann,
Hung P. Hoang,
Stefan Hougardy
Abstract:
The $k$-Opt algorithm is a local search algorithm for the Traveling Salesman Problem. Starting with an initial tour, it iteratively replaces at most $k$ edges in the tour with the same number of edges to obtain a better tour. Krentel (FOCS 1989) showed that the Traveling Salesman Problem with the $k$-Opt neighborhood is complete for the class PLS (polynomial time local search) and that the $k$-Opt…
▽ More
The $k$-Opt algorithm is a local search algorithm for the Traveling Salesman Problem. Starting with an initial tour, it iteratively replaces at most $k$ edges in the tour with the same number of edges to obtain a better tour. Krentel (FOCS 1989) showed that the Traveling Salesman Problem with the $k$-Opt neighborhood is complete for the class PLS (polynomial time local search) and that the $k$-Opt algorithm can have exponential running time for any pivot rule. However, his proof requires $k \gg 1000$ and has a substantial gap. We show the two properties above for a much smaller value of $k$, addressing an open question by Monien, Dumrauf, and Tscheuschner (ICALP 2010). In particular, we prove the PLS-completeness for $k \geq 17$ and the exponential running time for $k \geq 5$.
△ Less
Submitted 13 June, 2024; v1 submitted 10 February, 2024;
originally announced February 2024.
-
Changes in heat waves characteristics over Extremadura (SW Spain)
Authors:
F. J. Acero,
M. I. Fernández-Fernández,
V. M. S. Carrasco,
S. Parey,
T. T. Huong Hoang,
D. Dacunha-Castelle,
J. A. García
Abstract:
Heat wave (HW) events are becoming more frequent, and they have important consequences because of the negative effects they can have not only on the human population in health terms, but also on biodiversity and agriculture. This motivated a study of the trends in HW events over Extremadura, a region in the southwest of Spain, with much of its area in summer devoted to the production of irrigated…
▽ More
Heat wave (HW) events are becoming more frequent, and they have important consequences because of the negative effects they can have not only on the human population in health terms, but also on biodiversity and agriculture. This motivated a study of the trends in HW events over Extremadura, a region in the southwest of Spain, with much of its area in summer devoted to the production of irrigated crops such as maize and tomatoes. Heat waves were defined for the study as two consecutive days with temperatures above the 95th percentile of the summer (June-August) maximum temperature (Tmax) time series. Two datasets were used: one consisted of 13 daily temperature records uniformly distributed over the Region, and the other was the SPAIN02 gridded observational dataset, extracting just the points corresponding to Extremadura. The trends studied were in the duration, intensity, and frequency of HW events, and in other parameters such as the mean, low (25th percentile), and high (75th percentile) values. In general terms, the results showed significant positive trends in those parameters over the east, the northwest, and a small area in the south of the Region. In order to study changes in HW characteristics (duration, frequency and intensity) considering different subperiods, a stochastic model was used to generate 1000 time series equivalent to the observed ones. The results showed that there were no significant changes in HW duration in the last 10-year subperiod in comparison with the first. But the results were different for warm events (WE), defined with a lower threshold (the 75th percentile), which are also important for agriculture. For several sites, there were significant changes in WE duration, frequency, and intensity.
△ Less
Submitted 1 February, 2024;
originally announced February 2024.
-
Beyond the Narrow-Width Limit for Off-Shell and Boosted Differential Top Quark Decays
Authors:
André H. Hoang,
Simon Plätzer,
Christoph Regner,
Ines Ruffa
Abstract:
The standard approaches for describing top quark production and its decay dynamics are currently mostly either based on the narrow-width (NW) limit or on off-shell fixed-order calculations. In this article we present a factorised approach for boosted top quarks that combines the properties of the NW limit and off-shell computations accounting for the dominant off-shell effects in an expansion in…
▽ More
The standard approaches for describing top quark production and its decay dynamics are currently mostly either based on the narrow-width (NW) limit or on off-shell fixed-order calculations. In this article we present a factorised approach for boosted top quarks that combines the properties of the NW limit and off-shell computations accounting for the dominant off-shell effects in an expansion in $m_t/Q$ with the hard scattering scale $Q$. We discuss the key ideas of our approach and show some preliminary results at tree-level.
△ Less
Submitted 10 January, 2024;
originally announced January 2024.
-
Imitate the Good and Avoid the Bad: An Incremental Approach to Safe Reinforcement Learning
Authors:
Huy Hoang,
Tien Mai,
Pradeep Varakantham
Abstract:
A popular framework for enforcing safe actions in Reinforcement Learning (RL) is Constrained RL, where trajectory based constraints on expected cost (or other cost measures) are employed to enforce safety and more importantly these constraints are enforced while maximizing expected reward. Most recent approaches for solving Constrained RL convert the trajectory based cost constraint into a surroga…
▽ More
A popular framework for enforcing safe actions in Reinforcement Learning (RL) is Constrained RL, where trajectory based constraints on expected cost (or other cost measures) are employed to enforce safety and more importantly these constraints are enforced while maximizing expected reward. Most recent approaches for solving Constrained RL convert the trajectory based cost constraint into a surrogate problem that can be solved using minor modifications to RL methods. A key drawback with such approaches is an over or underestimation of the cost constraint at each state. Therefore, we provide an approach that does not modify the trajectory based cost constraint and instead imitates ``good'' trajectories and avoids ``bad'' trajectories generated from incrementally improving policies. We employ an oracle that utilizes a reward threshold (which is varied with learning) and the overall cost constraint to label trajectories as ``good'' or ``bad''. A key advantage of our approach is that we are able to work from any starting policy or set of trajectories and improve on it. In an exhaustive set of experiments, we demonstrate that our approach is able to outperform top benchmark approaches for solving Constrained RL problems, with respect to expected cost, CVaR cost, or even unknown cost constraints.
△ Less
Submitted 7 August, 2024; v1 submitted 16 December, 2023;
originally announced December 2023.
-
On-the-Fly Fusion of Large Language Models and Machine Translation
Authors:
Hieu Hoang,
Huda Khayrallah,
Marcin Junczys-Dowmunt
Abstract:
We propose the on-the-fly ensembling of a machine translation model with an LLM, prompted on the same task and input. We perform experiments on 4 language pairs (both directions) with varying data amounts. We find that a slightly weaker-at-translation LLM can improve translations of a NMT model, and ensembling with an LLM can produce better translations than ensembling two stronger MT models. We c…
▽ More
We propose the on-the-fly ensembling of a machine translation model with an LLM, prompted on the same task and input. We perform experiments on 4 language pairs (both directions) with varying data amounts. We find that a slightly weaker-at-translation LLM can improve translations of a NMT model, and ensembling with an LLM can produce better translations than ensembling two stronger MT models. We combine our method with various techniques from LLM prompting, such as in context learning and translation context.
△ Less
Submitted 6 May, 2024; v1 submitted 14 November, 2023;
originally announced November 2023.
-
Raijū: Reinforcement Learning-Guided Post-Exploitation for Automating Security Assessment of Network Systems
Authors:
Van-Hau Pham,
Hien Do Hoang,
Phan Thanh Trung,
Van Dinh Quoc,
Trong-Nghia To,
Phan The Duy
Abstract:
In order to assess the risks of a network system, it is important to investigate the behaviors of attackers after successful exploitation, which is called post-exploitation. Although there are various efficient tools supporting post-exploitation implementation, no application can automate this process. Most of the steps of this process are completed by experts who have profound knowledge of securi…
▽ More
In order to assess the risks of a network system, it is important to investigate the behaviors of attackers after successful exploitation, which is called post-exploitation. Although there are various efficient tools supporting post-exploitation implementation, no application can automate this process. Most of the steps of this process are completed by experts who have profound knowledge of security, known as penetration testers or pen-testers. To this end, our study proposes the Raijū framework, a Reinforcement Learning (RL)-driven automation approach that assists pen-testers in quickly implementing the process of post-exploitation for security-level evaluation in network systems. We implement two RL algorithms, Advantage Actor-Critic (A2C) and Proximal Policy Optimization (PPO), to train specialized agents capable of making intelligent actions, which are Metasploit modules to automatically launch attacks of privileges escalation, gathering hashdump, and lateral movement. By leveraging RL, we aim to empower these agents with the ability to autonomously select and execute actions that can exploit vulnerabilities in target systems. This approach allows us to automate certain aspects of the penetration testing workflow, making it more efficient and responsive to emerging threats and vulnerabilities. The experiments are performed in four real environments with agents trained in thousands of episodes. The agents automatically select actions and launch attacks on the environments and achieve over 84\% of successful attacks with under 55 attack steps given. Moreover, the A2C algorithm has proved extremely effective in the selection of proper actions for automation of post-exploitation.
△ Less
Submitted 27 September, 2023;
originally announced September 2023.
-
On the Effectiveness of Adversarial Samples against Ensemble Learning-based Windows PE Malware Detectors
Authors:
Trong-Nghia To,
Danh Le Kim,
Do Thi Thu Hien,
Nghi Hoang Khoa,
Hien Do Hoang,
Phan The Duy,
Van-Hau Pham
Abstract:
Recently, there has been a growing focus and interest in applying machine learning (ML) to the field of cybersecurity, particularly in malware detection and prevention. Several research works on malware analysis have been proposed, offering promising results for both academic and practical applications. In these works, the use of Generative Adversarial Networks (GANs) or Reinforcement Learning (RL…
▽ More
Recently, there has been a growing focus and interest in applying machine learning (ML) to the field of cybersecurity, particularly in malware detection and prevention. Several research works on malware analysis have been proposed, offering promising results for both academic and practical applications. In these works, the use of Generative Adversarial Networks (GANs) or Reinforcement Learning (RL) can aid malware creators in crafting metamorphic malware that evades antivirus software. In this study, we propose a mutation system to counteract ensemble learning-based detectors by combining GANs and an RL model, overcoming the limitations of the MalGAN model. Our proposed FeaGAN model is built based on MalGAN by incorporating an RL model called the Deep Q-network anti-malware Engines Attacking Framework (DQEAF). The RL model addresses three key challenges in performing adversarial attacks on Windows Portable Executable malware, including format preservation, executability preservation, and maliciousness preservation. In the FeaGAN model, ensemble learning is utilized to enhance the malware detector's evasion ability, with the generated adversarial patterns. The experimental results demonstrate that 100\% of the selected mutant samples preserve the format of executable files, while certain successes in both executability preservation and maliciousness preservation are achieved, reaching a stable success rate.
△ Less
Submitted 24 September, 2023;
originally announced September 2023.
-
Robust Approximation Algorithms for Non-monotone $k$-Submodular Maximization under a Knapsack Constraint
Authors:
Dung T. K. Ha,
Canh V. Pham,
Tan D. Tran,
Huan X. Hoang
Abstract:
The problem of non-monotone $k$-submodular maximization under a knapsack constraint ($\kSMK$) over the ground set size $n$ has been raised in many applications in machine learning, such as data summarization, information propagation, etc. However, existing algorithms for the problem are facing questioning of how to overcome the non-monotone case and how to fast return a good solution in case of th…
▽ More
The problem of non-monotone $k$-submodular maximization under a knapsack constraint ($\kSMK$) over the ground set size $n$ has been raised in many applications in machine learning, such as data summarization, information propagation, etc. However, existing algorithms for the problem are facing questioning of how to overcome the non-monotone case and how to fast return a good solution in case of the big size of data. This paper introduces two deterministic approximation algorithms for the problem that competitively improve the query complexity of existing algorithms.
Our first algorithm, $\LAA$, returns an approximation ratio of $1/19$ within $O(nk)$ query complexity. The second one, $\RLA$, improves the approximation ratio to $1/5-ε$ in $O(nk)$ queries, where $ε$ is an input parameter.
Our algorithms are the first ones that provide constant approximation ratios within only $O(nk)$ query complexity for the non-monotone objective. They, therefore, need fewer the number of queries than state-of-the-the-art ones by a factor of $Ω(\log n)$.
Besides the theoretical analysis, we have evaluated our proposed ones with several experiments in some instances: Influence Maximization and Sensor Placement for the problem. The results confirm that our algorithms ensure theoretical quality as the cutting-edge techniques and significantly reduce the number of queries.
△ Less
Submitted 21 September, 2023;
originally announced September 2023.
-
Top Quark Mass Calibration for Monte Carlo Event Generators -- An Update
Authors:
Bahman Dehnadi,
André H. Hoang,
Oliver L. Jin,
Vicent Mateu
Abstract:
We generalize and update our former top quark mass calibration framework for Monte Carlo (MC) event generators based on the $e^+e^-$ hadron-level 2-jettiness $τ_2$ distribution in the resonance region for boosted $t\bar t$ production, that was used to relate the PYTHIA 8.205 top mass parameter $m_t^{\rm MC}$ to the MSR mass $m_t^{\rm MSR}(R)$ and the pole mass $m_t^{\rm pole}$. The current most pr…
▽ More
We generalize and update our former top quark mass calibration framework for Monte Carlo (MC) event generators based on the $e^+e^-$ hadron-level 2-jettiness $τ_2$ distribution in the resonance region for boosted $t\bar t$ production, that was used to relate the PYTHIA 8.205 top mass parameter $m_t^{\rm MC}$ to the MSR mass $m_t^{\rm MSR}(R)$ and the pole mass $m_t^{\rm pole}$. The current most precise direct top mass measurements specifically determine $m_t^{\rm MC}$. The updated framework includes the addition of the shape variables sum of jet masses $τ_s$ and modified jet mass $τ_m$, and the treatment of two more gap subtraction schemes to remove the ${\cal O}(Λ_{\rm QCD})$ renormalon related to large-angle soft radiation. These generalizations entail implementing a more versatile shape-function fit procedure and accounting for a certain type of $(m_t/Q)^2$ power corrections to achieve gap-scheme and observable independent results. The theoretical description employs boosted heavy-quark effective theory (bHQET) at next-to-next-to-logarithmic order (N$^2$LL), matched to soft-collinear effective theory (SCET) at N$^2$LL and full QCD at next-to-leading order (NLO), and includes the dominant top width effects. Furthermore, the software framework has been modernized to use standard file and event record formats. We update the top mass calibration results by applying the new framework to PYTHIA 8.205, HERWIG 7.2 and SHERPA 2.2.11. Even though the hadron-level resonance positions produced by the three generators differ significantly for the same top mass parameter $m_t^{\rm MC}$ value, the calibration shows that these differences arise from the hadronization modeling. Indeed, we find that $m_t^{\rm MC}$ agrees with $m_t^{\rm MSR}(1\,\mbox{GeV})$ within $200$ MeV for the three generators and differs from the pole mass by $350$ to $600$ MeV.
△ Less
Submitted 9 December, 2023; v1 submitted 1 September, 2023;
originally announced September 2023.
-
Bayesian inversion for Electrical Impedance Tomography by sparse interpolation
Authors:
Quang Huy Pham,
Viet Ha Hoang
Abstract:
We study the Electrical Impedance Tomography Bayesian inverse problem for recovering the conductivity given noisy measurements of the voltage on some boundary surface electrodes. The uncertain conductivity depends linearly on a countable number of uniformly distributed random parameters in a compact interval, with the coefficient functions in the linear expansion decaying at an algebraic rate. We…
▽ More
We study the Electrical Impedance Tomography Bayesian inverse problem for recovering the conductivity given noisy measurements of the voltage on some boundary surface electrodes. The uncertain conductivity depends linearly on a countable number of uniformly distributed random parameters in a compact interval, with the coefficient functions in the linear expansion decaying at an algebraic rate. We analyze the surrogate Markov Chain Monte Carlo (MCMC) approach for sampling the posterior probability measure, where the multivariate sparse adaptive interpolation, with interpolating points chosen according to a lower index set, is used for approximating the forward map. The forward equation is approximated once before running the MCMC for all the realizations, using interpolation on the finite element (FE) approximation at the parametric interpolating points. When evaluation of the solution is needed for a realization, we only need to compute a polynomial, thus cutting drastically the computation time. We contribute a rigorous error estimate for the MCMC convergence. In particular, we show that there is a nested sequence of interpolating lower index sets for which we can derive an interpolation error estimate in terms of the cardinality of these sets, uniformly for all the parameter realizations. An explicit convergence rate for the MCMC sampling of the posterior expectation of the conductivity is rigorously derived, in terms of the interpolating point number, the accuracy of the FE approximation of the forward equation, and the MCMC sample number. We perform numerical experiments using an adaptive greedy approach to construct the sets of interpolation points. We show the benefits of this approach over the simple MCMC where the forward equation is repeatedly solved for all the samples and the non-adaptive surrogate MCMC with an isotropic index set treating all the random parameters equally.
△ Less
Submitted 14 June, 2023;
originally announced June 2023.
-
Correlation between Macroscopic and Microscopic Relaxation Dynamics of Water: Evidence for Two Liquid Forms
Authors:
Nguyen Q. Vinh,
Luan C. Doan,
Ngoc L. H. Hoang,
Jiarong R. Cui,
Ben Sindle
Abstract:
Water is vital for life, and without it biomolecules and cells cannot maintain their structures and functions. The remarkable properties of water originate from its ability to form hydrogen-bonding networks and dynamics, which the connectivity constantly alters because of the orientation rotation of individual water molecules. Experimental investigation of the dynamics of water, however, has prove…
▽ More
Water is vital for life, and without it biomolecules and cells cannot maintain their structures and functions. The remarkable properties of water originate from its ability to form hydrogen-bonding networks and dynamics, which the connectivity constantly alters because of the orientation rotation of individual water molecules. Experimental investigation of the dynamics of water, however, has proven challenging due to the strong absorption of water at terahertz frequencies. In response, by employing a high-precision terahertz spectrometer, we have measured and characterized the terahertz dielectric response of water from supercooled liquid to near the boiling point to explore the motions. The response reveals dynamic relaxation processes corresponding to the collective orientation, single-molecule rotation, and structural rearrangements resulting from breaking and reforming hydrogen bonds in water. We have observed the direct relationship between the macroscopic and microscopic relaxation dynamics of water, and the results have provided evidence of two liquid forms in water with different transition temperatures and thermal activation energies. The results reported here thus provide an unprecedented opportunity to directly test microscopic computational models of water dynamics.
△ Less
Submitted 31 May, 2023;
originally announced May 2023.
-
Mathematical Aspects of the Asymptotic Expansion in Contour Improved Perturbation Theory for Hadronic Tau Decays
Authors:
Néstor G. Gracia,
André H. Hoang,
Vicent Mateu
Abstract:
Recently, it was demonstrated that the discrepancy between the fixed-order (FOPT) and contour-improved (CIPT) perturbative expansions for $τ$-lepton decay hadronic spectral function moments, which had been affecting the precision of $α_s$ determinations for many years, is related to the CIPT expansion being inconsistent with the standard formulation of the operator product expansion (OPE). Even th…
▽ More
Recently, it was demonstrated that the discrepancy between the fixed-order (FOPT) and contour-improved (CIPT) perturbative expansions for $τ$-lepton decay hadronic spectral function moments, which had been affecting the precision of $α_s$ determinations for many years, is related to the CIPT expansion being inconsistent with the standard formulation of the operator product expansion (OPE). Even though the problem can be alleviated phenomenologically for the most part by employing a renormalon-free scheme for the gluon-condensate matrix element, the principal inconsistency of CIPT remains. The CIPT expansion is special because it is not a power expansion, but represents an asymptotic expansion in a sequence of functions of the strong coupling. In this article we provide a closer look at the mathematical aspects of the asymptotic sequence of the functions the CIPT method is based on, and we expose the origin of the CIPT inconsistency as well as the reasons for its apparent good convergence at low orders. Our results are of general interest, and may in particular provide a useful tool to check for the consistency of expansion methods that are similar to CIPT.
△ Less
Submitted 25 August, 2023; v1 submitted 17 May, 2023;
originally announced May 2023.
-
Advanced Mid-Infrared Plasmonic Waveguides For On-Chip Integrated Photonics
Authors:
Mauro David,
Davide Disnan,
Elena Arigliani,
Anna Lardschneider,
Georg Marschick,
Hanh T. Hoang,
Hermann Detz,
Bernhard Lendl,
Ulrich Schmid,
Gottfried Strasser,
Borislav Hinkov
Abstract:
Long-wave infrared (LWIR, 8-14 um) photonics is a rapidly growing research field within the mid-IR with applications in molecular spectroscopy and optical free-space communication. LWIR-applications are often addressed using rather bulky tabletop-sized free-space optical systems, preventing advanced photonic applications such as rapid-time-scale experiments. Here, device miniaturization into photo…
▽ More
Long-wave infrared (LWIR, 8-14 um) photonics is a rapidly growing research field within the mid-IR with applications in molecular spectroscopy and optical free-space communication. LWIR-applications are often addressed using rather bulky tabletop-sized free-space optical systems, preventing advanced photonic applications such as rapid-time-scale experiments. Here, device miniaturization into photonic integrated circuits (PICs) with maintained optical capabilities is key to revolutionize mid-IR photonics. Sub-wavelength mode confinement in plasmonic structures enabled such miniaturization approaches in the visible-to-near-IR spectral range. However, adopting plasmonics for the LWIR needs suitable low-loss and -dispersion materials with compatible integration strategies to existing mid-IR technology. In this work we further unlock the field of LWIR/mid-IR PICs, by combining photolithographic patterning of organic polymers with dielectric-loaded surface plasmon polariton (DLSPP) waveguides. In particular, polyethylene shows favorable optical properties, including low refractive index and broad transparency between ~2-200 um. We investigate the whole value chain, including design, fabrication, and characterization of polyethylene-based DLSPP waveguides and demonstrate their first-time plasmonic operation and mode guiding capabilities along s-bend structures. Low bending losses of ~1.3 dB and straight-section propagation lengths of ~1 mm, pave the way for unprecedented, complex on-chip mid-IR photonic devices. Moreover, DLSPPs allow full control of the mode parameters (propagation length and guiding capabilities) for precisely addressing advanced sensing and telecommunication applications with chip-scale devices.
△ Less
Submitted 5 May, 2023;
originally announced May 2023.
-
Timescales of Chaos in the Inner Solar System: Lyapunov Spectrum and Quasi-integrals of Motion
Authors:
Federico Mogavero,
Nam H. Hoang,
Jacques Laskar
Abstract:
Numerical integrations of the Solar System reveal a remarkable stability of the orbits of the inner planets over billions of years, in spite of their chaotic variations characterized by a Lyapunov time of only 5 million years and the lack of integrals of motion able to constrain their dynamics. To open a window on such long-term behavior, we compute the entire Lyapunov spectrum of a forced secular…
▽ More
Numerical integrations of the Solar System reveal a remarkable stability of the orbits of the inner planets over billions of years, in spite of their chaotic variations characterized by a Lyapunov time of only 5 million years and the lack of integrals of motion able to constrain their dynamics. To open a window on such long-term behavior, we compute the entire Lyapunov spectrum of a forced secular model of the inner planets. We uncover a hierarchy of characteristic exponents that spans two orders of magnitude, manifesting a slow-fast dynamics with a broad separation of timescales. A systematic analysis of the Fourier harmonics of the Hamiltonian, based on computer algebra, reveals three symmetries that characterize the strongest resonances responsible for the orbital chaos. These symmetries are broken only by weak resonances, leading to the existence of quasi-integrals of motion that are shown to relate to the smallest Lyapunov exponents. A principal component analysis of the orbital solutions independently confirms that the quasi-integrals are among the slowest degrees of freedom of the dynamics. Strong evidence emerges that they effectively constrain the chaotic diffusion of the orbits, playing a crucial role in the statistical stability over the Solar System lifetime.
△ Less
Submitted 2 May, 2023;
originally announced May 2023.