-
Dynamic real-time multi-UAV cooperative mission planning method under multiple constraints
Authors:
Chenglou Liu,
Yufeng Lu,
Fangfang Xie,
Tingwei Ji,
Yao Zheng
Abstract:
As UAV popularity soars, so does the mission planning associated with it. The classical approaches suffer from the triple problems of decoupled of task assignment and path planning, poor real-time performance and limited adaptability. Aiming at these challenges, this paper proposes a dynamic real-time multi-UAV collaborative mission planning algorithm based on Dubins paths under a distributed form…
▽ More
As UAV popularity soars, so does the mission planning associated with it. The classical approaches suffer from the triple problems of decoupled of task assignment and path planning, poor real-time performance and limited adaptability. Aiming at these challenges, this paper proposes a dynamic real-time multi-UAV collaborative mission planning algorithm based on Dubins paths under a distributed formation structure. Dubins path with multiple advantages bridges the gap between task assignment and path planning, leading to a coupled solution for mission planning. Then, a series of acceleration techniques, task clustering preprocessing, highly efficient distance cost functions, low-complexity and less iterative task allocation strategies, are employed to guarantee the real-time performance of the algorithms. To cope with different emergencies and their simultaneous extremes, real-time planning of emerging tasks and mission replanning due to the reduction of available UAVs are appropriately handled. Finally, the developed algorithm is comprehensively exemplified and studied through simulations, highlighting that the proposed method only sacrifices 9.57% of the path length, while achieving a speed improvement of 4-5 orders of magnitude over the simulated annealing method, with a single mission planning of about 0.0003s.
△ Less
Submitted 2 June, 2025;
originally announced June 2025.
-
Exploring the keV-scale physics potential of CUORE
Authors:
CUORE Collaboration,
D. Q. Adams,
C. Alduino,
K. Alfonso,
A. Armatol,
F. T. Avignone III,
O. Azzolini,
G. Bari,
F. Bellini,
G. Benato,
M. Beretta,
M. Biassoni,
A. Branca,
C. Brofferio,
C. Bucci,
J. Camilleri,
A. Caminata,
A. Campani,
J. Cao,
C. Capelli,
S. Capelli,
L. Cappelli,
L. Cardani,
P. Carniti,
N. Casali
, et al. (98 additional authors not shown)
Abstract:
We present the analysis techniques developed to explore the keV-scale energy region of the CUORE experiment, based on more than 2 tonne yr of data collected over 5 years. By prioritizing a stricter selection over a larger exposure, we are able to optimize data selection for thresholds at 10 keV and 3 keV with 691 kg yr and 11 kg yr of data, respectively. We study how the performance varies among t…
▽ More
We present the analysis techniques developed to explore the keV-scale energy region of the CUORE experiment, based on more than 2 tonne yr of data collected over 5 years. By prioritizing a stricter selection over a larger exposure, we are able to optimize data selection for thresholds at 10 keV and 3 keV with 691 kg yr and 11 kg yr of data, respectively. We study how the performance varies among the 988-detector array with different detector characteristics and data taking conditions. We achieve an average baseline resolution of 2.54 $\pm$ 0.14 keV FWHM and 1.18 $\pm$ 0.02 keV FWHM for the data selection at 10 keV and 3 keV, respectively. The analysis methods employed reduce the overall background by about an order of magnitude, reaching 2.06 $\pm$ 0.05 counts/(keV kg days) and 16 $\pm$ 2 counts/(keV kg days) at the thresholds of 10 keV and 3 keV. We evaluate for the first time the near-threshold reconstruction efficiencies of the CUORE experiment, and find these to be 26 $\pm$ 4 \% and 50 $\pm$ 2 \% at 3 keV and 10 keV, respectively. This analysis provides crucial insights into rare decay studies, new physics searches, and keV-scale background modeling with CUORE. We demonstrate that tonne-scale cryogenic calorimeters can operate across a wide energy range, from keV to MeV, establishing their scalability as versatile detectors for rare event and dark matter physics. These findings also inform the optimization of future large mass cryogenic calorimeters to enhance the sensitivity to low-energy phenomena.
△ Less
Submitted 29 May, 2025;
originally announced May 2025.
-
MWA and VLA Observations of Diffuse Radio Lobes in M 87
Authors:
Linhui Wu,
Fu-Guo Xie,
Qian Zheng,
Quan Guo,
Huanyuan Shan,
Dan Hu,
Stefan W. Duchesne,
Nicholas Seymour,
Jingying Wang,
Junhua Gu,
Qingwen Wu,
Zhenghao Zhu,
Melanie Johnston-Hollitt,
Chris Riseley,
Xu-Liang Fan
Abstract:
This study investigates the projected, quasi-symmetric $\sim\rm46\,kpc$-scale diffuse radio lobes surrounding the giant elliptical galaxy M\,87, utilizing well-sampled wideband ($\rm 60\,MHz-10.55\,GHz$) observations from MWA and VLA, supplemented by data from LOFAR and Effelsberg. The observed structures feature sharp edges and filaments, with nearly uniform and moderately steep spectral indices…
▽ More
This study investigates the projected, quasi-symmetric $\sim\rm46\,kpc$-scale diffuse radio lobes surrounding the giant elliptical galaxy M\,87, utilizing well-sampled wideband ($\rm 60\,MHz-10.55\,GHz$) observations from MWA and VLA, supplemented by data from LOFAR and Effelsberg. The observed structures feature sharp edges and filaments, with nearly uniform and moderately steep spectral indices ($α$, mostly within $-1.2\leqα\leq-0.8$), indicating turbulence. Well-sampled radio spectra for the lobes' diffuse region are derived using the continuous injection (CI) model (with $α_{\rm inj}\simeq-0.86$ and $ν_{\rm b}\simeq1.72\rm\,GHz$), and for its three localized regions using the impulsive injection model (e.g., JP model). From energy equipartition analysis, we estimate the typical magnetic field strength in the lobes' diffuse region to be $B_{\rm eq}\simeq10\,μ\rm G$. The age of the lobes is estimated as $\sim30-50\,\rm~Myr$, based on lifetimes derived from the CI and JP models and sound crossing time. Outflow powers of $\sim(0.2-2)\times10^{44}\,\rm erg\,s^{-1}$ for the lobes' diffuse components and $\sim(1-11)\times10^{44}\,\rm erg\,s^{-1}$ for the whole source are calculated. With this power assessment, we conclude that the galactic stellar wind has a negligible effect, the active galactic nucleus (AGN)-driven jet can provide the necessary energy for the whole system. Furthermore, we argue that while the wind driven by current AGN activity is unlikely to power the lobes' diffuse components, an average enhancement of AGN activity by a factor of $\sim 10^2$ over the past $\sim 30-50$ Myr remains plausible.
△ Less
Submitted 27 May, 2025;
originally announced May 2025.
-
Mamba-Adaptor: State Space Model Adaptor for Visual Recognition
Authors:
Fei Xie,
Jiahao Nie,
Yujin Tang,
Wenkang Zhang,
Hongshen Zhao
Abstract:
Recent State Space Models (SSM), especially Mamba, have demonstrated impressive performance in visual modeling and possess superior model efficiency. However, the application of Mamba to visual tasks suffers inferior performance due to three main constraints existing in the sequential model: 1) Casual computing is incapable of accessing global context; 2) Long-range forgetting when computing the c…
▽ More
Recent State Space Models (SSM), especially Mamba, have demonstrated impressive performance in visual modeling and possess superior model efficiency. However, the application of Mamba to visual tasks suffers inferior performance due to three main constraints existing in the sequential model: 1) Casual computing is incapable of accessing global context; 2) Long-range forgetting when computing the current hidden states; 3) Weak spatial structural modeling due to the transformed sequential input. To address these issues, we investigate a simple yet powerful vision task Adaptor for Mamba models, which consists of two functional modules: Adaptor-T and Adaptor-S. When solving the hidden states for SSM, we apply a lightweight prediction module Adaptor-T to select a set of learnable locations as memory augmentations to ease long-range forgetting issues. Moreover, we leverage Adapator-S, composed of multi-scale dilated convolutional kernels, to enhance the spatial modeling and introduce the image inductive bias into the feature output. Both modules can enlarge the context modeling in casual computing, as the output is enhanced by the inaccessible features. We explore three usages of Mamba-Adaptor: A general visual backbone for various vision tasks; A booster module to raise the performance of pretrained backbones; A highly efficient fine-tuning module that adapts the base model for transfer learning tasks. Extensive experiments verify the effectiveness of Mamba-Adaptor in three settings. Notably, our Mamba-Adaptor achieves state-of the-art performance on the ImageNet and COCO benchmarks.
△ Less
Submitted 19 May, 2025;
originally announced May 2025.
-
First-ever detection of microseismic activity with a tonne-scale cryogenic experiment
Authors:
D. Q. Adams,
C. Alduino,
K. Alfonso,
A. Armatol,
F. T. Avignone,
O. Azzolini,
G. Bari,
F. Bellini,
G. Benato,
M. Beretta,
M. Biassoni,
A. Branca,
C. Brofferio,
C. Bucci,
J. Camilleri,
A. Caminata,
A. Campani,
J. Cao,
C. Capelli,
S. Capelli,
L. Cappelli,
L. Cardani,
P. Carniti,
N. Casali,
E. Celi
, et al. (105 additional authors not shown)
Abstract:
Vibrations from experimental setups and the environment are a persistent source of noise for low-temperature calorimeters searching for rare events, including neutrinoless double beta ($0νββ$) decay or dark matter interactions. Such noise can significantly limit experimental sensitivity to the physics case under investigation. Here we report the first detection of marine microseismic vibrations us…
▽ More
Vibrations from experimental setups and the environment are a persistent source of noise for low-temperature calorimeters searching for rare events, including neutrinoless double beta ($0νββ$) decay or dark matter interactions. Such noise can significantly limit experimental sensitivity to the physics case under investigation. Here we report the first detection of marine microseismic vibrations using mK-scale calorimeters. This study employs a multi-device analysis correlating data from CUORE, the leading experiment in the search for $0νββ$ decay with mK-scale calorimeters and the Copernicus Earth Observation program, revealing the seasonal impact of Mediterranean Sea activity on CUORE's energy thresholds, resolution, and sensitivity over four years. The detection of marine microseisms underscores the need to address faint environmental noise in ultra-sensitive experiments. Understanding how such noise couples to the detector and developing mitigation strategies is essential for next-generation experiments. We demonstrate one such strategy: a noise decorrelation algorithm implemented in CUORE using auxiliary sensors, which reduces vibrational noise and improves detector performance. Enhancing sensitivity to $0νββ$ decay and to rare events with low-energy signatures requires identifying unresolved noise sources, advancing noise reduction methods, and improving vibration suppression systems, all of which inform the design of next-generation rare event experiments.
△ Less
Submitted 13 May, 2025;
originally announced May 2025.
-
Enhancing Accuracy in Differentially Private Distributed Optimization Through Sensitivity Reduction
Authors:
Furan Xie,
Bing Liu,
Li Chai
Abstract:
In this paper, we investigate the problem of differentially private distributed optimization. Recognizing that lower sensitivity leads to higher accuracy, we analyze the key factors influencing the sensitivity of differentially private distributed algorithms. Building on these insights, we propose a novel differentially private distributed algorithm that enhances optimization accuracy by reducing…
▽ More
In this paper, we investigate the problem of differentially private distributed optimization. Recognizing that lower sensitivity leads to higher accuracy, we analyze the key factors influencing the sensitivity of differentially private distributed algorithms. Building on these insights, we propose a novel differentially private distributed algorithm that enhances optimization accuracy by reducing sensitivity. To ensure practical applicability, we derive a closed-form expression for the noise parameter as a function of the privacy budget. Furthermore, we rigorously prove that the proposed algorithm can achieve arbitrarily rigorous $ε$-differential privacy, establish its convergence in the mean square sense, and provide an upper bound on its optimization accuracy. Finally, extensive comparisons with various privacy-preserving methods validate the effectiveness of our algorithm.
△ Less
Submitted 12 May, 2025;
originally announced May 2025.
-
High optical to X-ray polarization ratio reveals Compton scattering in BL Lacertae's jet
Authors:
Ivan Agudo,
Ioannis Liodakis,
Jorge Otero-Santos,
Riccardo Middei,
Alan Marscher,
Svetlana Jorstad,
Haocheng Zhang,
Hui Li,
Laura Di Gesu,
Roger W. Romani,
Dawoon E. Kim,
Francesco Fenu,
Herman L. Marshall,
Luigi Pacciani,
Juan Escudero Pedrosa,
Francisco Jose Aceituno,
Beatriz Agis-Gonzalez,
Giacomo Bonnoli,
Victor Casanova,
Daniel Morcuende,
Vilppu Piirola,
Alfredo Sota,
Pouya M. Kouch,
Elina Lindfors,
Callum McCall
, et al. (125 additional authors not shown)
Abstract:
Blazars, supermassive black hole systems (SMBHs) with highly relativistic jets aligned with the line of sight, are the most powerful long-lived emitters of electromagnetic emission in the Universe. We report here on a radio to gamma-ray multiwavelength campaign on the blazar BL Lacertae with unprecedented polarimetric coverage from radio to X-ray wavelengths. The observations caught an extraordina…
▽ More
Blazars, supermassive black hole systems (SMBHs) with highly relativistic jets aligned with the line of sight, are the most powerful long-lived emitters of electromagnetic emission in the Universe. We report here on a radio to gamma-ray multiwavelength campaign on the blazar BL Lacertae with unprecedented polarimetric coverage from radio to X-ray wavelengths. The observations caught an extraordinary event on 2023 November 10-18, when the degree of linear polarization of optical synchrotron radiation reached a record value of 47.5%. In stark contrast, the Imaging X-ray Polarimetry Explorer (IXPE) found that the X-ray (Compton scattering or hadron-induced) emission was polarized at less than 7.4% (3sigma confidence level). We argue here that this observational result rules out a hadronic origin of the high energy emission, and strongly favors a leptonic (Compton scattering) origin, thereby breaking the degeneracy between hadronic and leptonic emission models for BL Lacertae and demonstrating the power of multiwavelength polarimetry to address this question. Furthermore, the multiwavelength flux and polarization variability, featuring an extremely prominent rise and decay of the optical polarization degree, is interpreted for the first time by the relaxation of a magnetic "spring" embedded in the newly injected plasma. This suggests that the plasma jet can maintain a predominant toroidal magnetic field component parsecs away from the central engine.
△ Less
Submitted 3 May, 2025;
originally announced May 2025.
-
A polarized view of the young Pulsar Wind Nebula 3C 58 with IXPE
Authors:
N. Bucciantini,
J. Wong,
R. W. Romani,
F. Xie,
C. -Y. Ng,
S. Silvestri,
N. Di Lalla,
Y. -J. Yang,
S. Zhang,
P. Slane,
W. -T. Ye,
M. Pilia,
N. Omodei,
M. Negro
Abstract:
Pulsar Wind nebulae (PWNe), are among the most efficient particle accelerators in the Universe, however understanding the physical conditions and the magnetic geometry in their inner region has always proved elusive. X-ray polarization provides now a unique opportunity to investigate the magnetic field structure and turbulence properties close to where high energy particles are accelerated. Here w…
▽ More
Pulsar Wind nebulae (PWNe), are among the most efficient particle accelerators in the Universe, however understanding the physical conditions and the magnetic geometry in their inner region has always proved elusive. X-ray polarization provides now a unique opportunity to investigate the magnetic field structure and turbulence properties close to where high energy particles are accelerated. Here we report on the recent X-ray polarization measurement of the PWN 3C 58 by the International X-ray Polarimeter Explorer (IXPE). 3C 58 is a young system displaying a characteristic jet-torus structure which, unlike other PWNe, is seen almost edge on. This nebula shows a high level of integrated polarization ~ 22% at an angle ~ 97deg, with an implied magnetic field oriented parallel to the major axis of the inner torus, suggesting a toroidal magnetic geometry with little turbulence in the interior, and an intrinsic level of polarization possibly approaching the theoretical limit for synchrotron emission. No significant detection of a polarized signal from the associated pulsar was found. These results confirm that the internal structure of young PWNe is far less turbulent than previously predicted, and at odds with multidimensional numerical simulations.
△ Less
Submitted 16 May, 2025; v1 submitted 29 April, 2025;
originally announced April 2025.
-
Sensitivity of the CUPID experiment to $0νββ$ decay of $^{100}$Mo
Authors:
K. Alfonso,
A. Armatol,
C. Augier,
F. T. Avignone III,
O. Azzolini,
A. S. Barabash,
G. Bari,
A. Barresi,
D. Baudin,
F. Bellini,
G. Benato,
L. Benussi,
V. Berest,
M. Beretta,
L. Bergé,
M. Bettelli,
M. Biassoni,
J. Billard,
F. Boffelli,
V. Boldrini,
E. D. Brandani,
C. Brofferio,
C. Bucci,
M. Buchynska,
J. Camilleri
, et al. (167 additional authors not shown)
Abstract:
CUPID is a next-generation bolometric experiment to search for neutrinoless double-beta decay ($0νββ$) of $^{100}$Mo using Li$_2$MoO$_4$ scintillating crystals. It will operate 1596 crystals at $\sim$10 mK in the CUORE cryostat at the Laboratori Nazionali del Gran Sasso in Italy. Each crystal will be facing two Ge-based bolometric light detectors for $α$ rejection. We compute the discovery and the…
▽ More
CUPID is a next-generation bolometric experiment to search for neutrinoless double-beta decay ($0νββ$) of $^{100}$Mo using Li$_2$MoO$_4$ scintillating crystals. It will operate 1596 crystals at $\sim$10 mK in the CUORE cryostat at the Laboratori Nazionali del Gran Sasso in Italy. Each crystal will be facing two Ge-based bolometric light detectors for $α$ rejection. We compute the discovery and the exclusion sensitivity of CUPID to $0νββ$ in a Frequentist and a Bayesian framework. This computation is done numerically based on pseudo-experiments. For the CUPID baseline scenario, with a background and an energy resolution of $1.0 \times 10^{-4}$ counts/keV/kg/yr and 5 keV FWHM at the Q-value, respectively, this results in a Bayesian exclusion sensitivity (90% c.i.) of $\hat{T}_{1/2} > 1.6^{+0.6}_{-0.5} \times 10^{27} \ \mathrm{yr}$, corresponding to the effective Majorana neutrino mass of $\hat{m}_{ββ} < \ 9.6$ -- $16.3 \ \mathrm{meV}$. The Frequentist discovery sensitivity (3$σ$) is $\hat{T}_{1/2}= 1.0 \times 10^{27} \ \mathrm{yr}$, corresponding to $\hat{m}_{ββ}= \ 12.2$ -- $20.6 \ \mathrm{meV}$.
△ Less
Submitted 19 April, 2025;
originally announced April 2025.
-
Spatially resolved polarization variation of the Crab Nebula
Authors:
Chao Zuo,
Fei Xie,
Mingyu Ge,
Wei Deng,
Kuan Liu,
Fabio La Monaca,
Alessandro Di Marco,
Wenhao Wei,
Wei Chen
Abstract:
We examined the spatially resolved polarization variations in the Crab Nebula over 2 yr, using observational data from the Imaging X-ray Polarimetry Explorer, and offer key insights into its magnetic field structures and evolution. The results show significant temporal changes in the polarization degree (PD) across three regions of interest in the 2-8 keV energy band. Regions (a) and (b), located…
▽ More
We examined the spatially resolved polarization variations in the Crab Nebula over 2 yr, using observational data from the Imaging X-ray Polarimetry Explorer, and offer key insights into its magnetic field structures and evolution. The results show significant temporal changes in the polarization degree (PD) across three regions of interest in the 2-8 keV energy band. Regions (a) and (b), located in the northern and the southwestern parts of the study area, exhibit PD variations with significance levels greater than 4 sigma and 3 sigma , respectively. Region (c), located in the southwest,shows a notable decrease in PD with a significance greater than 5 sigma. However, no significant variation in the polarization angle was observed. Meanwhile, notable flux variations were detected, likely influenced by dynamic processes such as magnetized turbulence within the nebula.
△ Less
Submitted 13 April, 2025;
originally announced April 2025.
-
Energy-resolved polarisation study of the Crab Nebula with IXPE
Authors:
Wenhao Wei,
Fei Xie,
Fabio La Monaca,
Wei Deng,
Mingyu Ge,
Kuan Liu,
Chao Zuo,
Wei Chen
Abstract:
This work presents a new detailed study on the energy-dependent variation in the X-ray polarisation of the Crab Pulsar Wind Nebula (PWN), obtained using data from the Imaging X-ray Polarimetry Explorer (IXPE). For the entire PWN, we observed a linear variation in polarisation degree (PD), and detected the rotation of the polarisation angle (PA) with the energy at higher than 99.9999\% of the confi…
▽ More
This work presents a new detailed study on the energy-dependent variation in the X-ray polarisation of the Crab Pulsar Wind Nebula (PWN), obtained using data from the Imaging X-ray Polarimetry Explorer (IXPE). For the entire PWN, we observed a linear variation in polarisation degree (PD), and detected the rotation of the polarisation angle (PA) with the energy at higher than 99.9999\% of the confidence level. This energy-dependent polarisation variation is in line with the indication found in Vela PWN by IXPE, and it can be interpreted as the emitting region of the polarised photons shrinks with increasing energy, leading to higher PD because they are less influenced by the turbulence of the magnetic field. We compared the IXPE polarisation results with those of other hard X-ray/gamma observatories (PoGO+, Intregral, AstroSat) for the PWN, finding the same trend from soft-X to hard-X with the PD increasing with the energy and the PA approaching the pulsar's spin axis. In fact, in this wide energy band, the fitting results show an energy trend for the PA compatible with the estimated pulsar's spin axis within 3$σ$ of confidence level.
△ Less
Submitted 13 April, 2025;
originally announced April 2025.
-
Deriving the Gradients of Some Popular Optimal Transport Algorithms
Authors:
Fangzhou Xie
Abstract:
In this note, I review entropy-regularized Monge-Kantorovich problem in Optimal Transport, and derive the gradients of several popular algorithms popular in Computational Optimal Transport, including the Sinkhorn algorithms, Wasserstein Barycenter algorithms, and the Wasserstein Dictionary Learning algorithms.
In this note, I review entropy-regularized Monge-Kantorovich problem in Optimal Transport, and derive the gradients of several popular algorithms popular in Computational Optimal Transport, including the Sinkhorn algorithms, Wasserstein Barycenter algorithms, and the Wasserstein Dictionary Learning algorithms.
△ Less
Submitted 11 April, 2025;
originally announced April 2025.
-
Half-life and precision shape measurement of 2νββ decay of $^{130}$Te
Authors:
D. Q. Adams,
C. Alduino,
K. Alfonso,
F. T. Avignone III,
O. Azzolini,
G. Bari,
F. Bellini,
G. Benato,
M. Beretta,
M. Biassoni,
A. Branca,
C. Brofferio,
C. Bucci,
J. Camilleri,
A. Caminata,
A. Campani,
J. Cao,
C. Capelli,
S. Capelli,
L. Cappelli,
L. Cardani,
P. Carniti,
N. Casali,
E. Celi,
D. Chiesa
, et al. (97 additional authors not shown)
Abstract:
We present a new measurement of the 2nbb half-life of 130Te (T1/2) using the first complete model of the CUORE data, based on 1038 kg yr of collected exposure. Thanks to optimized data selection, we achieve a factor of two improvement in precision, obtaining T1/2 = (9.32 +0.05 -0.04 (stat.) +0.07 -0.07 (syst.)) x10^20 yr. The signal-to-background ratio is increased by 70% compared to our previous…
▽ More
We present a new measurement of the 2nbb half-life of 130Te (T1/2) using the first complete model of the CUORE data, based on 1038 kg yr of collected exposure. Thanks to optimized data selection, we achieve a factor of two improvement in precision, obtaining T1/2 = (9.32 +0.05 -0.04 (stat.) +0.07 -0.07 (syst.)) x10^20 yr. The signal-to-background ratio is increased by 70% compared to our previous results, enabling the first application of the improved 2nbb formalism to 130Te. Within this framework, we determine a credibility interval for the effective axial coupling in the nuclear medium as a function of nuclear matrix elements. We also extract values for the higher-order nuclear matrix element ratios: second-to-first and third-to-first. The second-to-first ratio agrees with nuclear model predictions, while the third-to-first ratio deviates from theoretical expectations. These findings provide essential tests of nuclear models and key inputs for future 0nbb searches.
△ Less
Submitted 31 March, 2025;
originally announced March 2025.
-
Kondo-lattice phenomenology of twisted bilayer WSe$_2$ from compact molecular orbitals of topological bands
Authors:
Fang Xie,
Chenyuan Li,
Jennifer Cano,
Qimiao Si
Abstract:
The discovery of superconductivity and correlated electronic phases in twisted bilayer WSe$_2$ (Xia et al., Nature 2024; Guo et al., Nature 2025) has generated considerable excitement. Accompanying the superconductivity and a correlated insulator phase is the Kondo-lattice-like phenomenology in transport properties. Here we consider how such phenomenology can develop when the combination of the ac…
▽ More
The discovery of superconductivity and correlated electronic phases in twisted bilayer WSe$_2$ (Xia et al., Nature 2024; Guo et al., Nature 2025) has generated considerable excitement. Accompanying the superconductivity and a correlated insulator phase is the Kondo-lattice-like phenomenology in transport properties. Here we consider how such phenomenology can develop when the combination of the active bands are topological. We advance a unique construction of compact molecular orbitals through a partial Wannierization that is symmetry preserving. The resulting Anderson lattice model provides the basis for a microscopic understanding of the experimental observation, including the involved energy scales. Our approach may apply to a broad range of settings where topology and correlations interplay.
△ Less
Submitted 27 March, 2025;
originally announced March 2025.
-
X-ray Polarization of the High-Synchrotron-Peak BL Lacertae Object 1ES 1959+650 during Intermediate and High X-ray Flux States
Authors:
Luigi Pacciani,
Dawoon E. Kim,
Riccardo Middei,
Herman L. Marshall,
Alan P. Marscher,
Ioannis Liodakis,
Iván Agudo,
Svetlana G. Jorstad,
Juri Poutanen,
Manel Errando,
Laura Di Gesu,
Michela Negro,
Fabrizio Tavecchio,
Kinwah Wu,
Chien-Ting Chen,
Fabio Muleri,
Lucio Angelo Antonelli,
Immacolata Donnarumma,
Steven R. Ehlert,
Francesco Massaro,
Stephen L. O'Dell,
Matteo Perri,
Simonetta Puccetti,
Giacomo Bonnoli,
Pouya M. Kouch
, et al. (75 additional authors not shown)
Abstract:
We report the Imaging X-ray Polarimetry Explorer (IXPE) polarimetric and simultaneous multiwavelength observations of the high-energy-peaked BL Lacertae (HBL) object 1ES 1959+650, performed in 2022 October and 2023 August. In 2022 October IXPE measured an average polarization degree $Π_{\rm X}=9.4\;\!\%\pm 1.6\;\!\%$ and an electric-vector position angle $ψ_{\rm X}=53^{\circ}\pm 5^{\circ}$. The po…
▽ More
We report the Imaging X-ray Polarimetry Explorer (IXPE) polarimetric and simultaneous multiwavelength observations of the high-energy-peaked BL Lacertae (HBL) object 1ES 1959+650, performed in 2022 October and 2023 August. In 2022 October IXPE measured an average polarization degree $Π_{\rm X}=9.4\;\!\%\pm 1.6\;\!\%$ and an electric-vector position angle $ψ_{\rm X}=53^{\circ}\pm 5^{\circ}$. The polarized X-ray emission can be decomposed into a constant component, plus a rotating component, with rotation velocity $ω_{\rm EVPA}=(-117\;\!\pm\;\!12)$ ${\rm deg}\;\!{\rm d}^{-1}$. In 2023 August, during a period of pronounced activity of the source, IXPE measured an average $Π_{\rm X}=12.4\;\!\%\pm0.7\;\!\%$ and $ψ_X=20^{\circ}\pm2^{\circ}$, with evidence ($\sim$0.4$\;\!\%$ chance probability) for a rapidly rotating component with $ω_{\rm EVPA}=(1864\;\!\pm\;\!34)$ ${\rm deg}\;\!{\rm d}^{-1}$. These findings suggest the presence of a helical magnetic field in the jet of 1ES 1959+650 or stochastic processes governing the field in turbulent plasma. Our multiwavelength campaigns from radio to X-ray reveal variability in both polarization and flux from optical to X-rays. We interpret the results in terms of a relatively slowly varying component dominating the radio and optical emission, while rapidly variable polarized components dominate the X-ray and provide minor contribution at optical wavelengths. The radio and optical data indicate that on parsec scales the magnetic field is primarily orthogonal to the jet direction. On the contrary, X-ray measurements show a magnetic field almost aligned with the parsec jet direction. Confronting with other IXPE observations, we guess that the magnetic field of HBLs on sub-pc scale should be rather unstable, often changing its direction with respect to the VLBA jet.
△ Less
Submitted 27 March, 2025;
originally announced March 2025.
-
Riemannian Optimization on Relaxed Indicator Matrix Manifold
Authors:
Jinghui Yuan,
Fangyuan Xie,
Feiping Nie,
Xuelong Li
Abstract:
The indicator matrix plays an important role in machine learning, but optimizing it is an NP-hard problem. We propose a new relaxation of the indicator matrix and prove that this relaxation forms a manifold, which we call the Relaxed Indicator Matrix Manifold (RIM manifold). Based on Riemannian geometry, we develop a Riemannian toolbox for optimization on the RIM manifold. Specifically, we provide…
▽ More
The indicator matrix plays an important role in machine learning, but optimizing it is an NP-hard problem. We propose a new relaxation of the indicator matrix and prove that this relaxation forms a manifold, which we call the Relaxed Indicator Matrix Manifold (RIM manifold). Based on Riemannian geometry, we develop a Riemannian toolbox for optimization on the RIM manifold. Specifically, we provide several methods of Retraction, including a fast Retraction method to obtain geodesics. We point out that the RIM manifold is a generalization of the double stochastic manifold, and it is much faster than existing methods on the double stochastic manifold, which has a complexity of \( \mathcal{O}(n^3) \), while RIM manifold optimization is \( \mathcal{O}(n) \) and often yields better results. We conducted extensive experiments, including image denoising, with millions of variables to support our conclusion, and applied the RIM manifold to Ratio Cut, we provide a rigorous convergence proof and achieve clustering results that outperform the state-of-the-art methods. Our Code in \href{https://github.com/Yuan-Jinghui/Riemannian-Optimization-on-Relaxed-Indicator-Matrix-Manifold}{here}.
△ Less
Submitted 11 April, 2025; v1 submitted 26 March, 2025;
originally announced March 2025.
-
FireRedTTS-1S: An Upgraded Streamable Foundation Text-to-Speech System
Authors:
Hao-Han Guo,
Yao Hu,
Fei-Yu Shen,
Xu Tang,
Yi-Chen Wu,
Feng-Long Xie,
Kun Xie
Abstract:
In this work, we upgrade FireRedTTS to a new version, FireRedTTS-1S, a high-quality streaming foundation text-to-speech system. FireRedTTS-1S achieves streaming speech generation via two steps: text-to-semantic decoding and semantic-to-acoustic decoding. In text-to-semantic decoding, a semantic-aware speech tokenizer converts the speech signal into semantic tokens, which can be synthesized from th…
▽ More
In this work, we upgrade FireRedTTS to a new version, FireRedTTS-1S, a high-quality streaming foundation text-to-speech system. FireRedTTS-1S achieves streaming speech generation via two steps: text-to-semantic decoding and semantic-to-acoustic decoding. In text-to-semantic decoding, a semantic-aware speech tokenizer converts the speech signal into semantic tokens, which can be synthesized from the text via a language model in an auto-regressive manner. Meanwhile, the semantic-to-acoustic decoding module simultaneously translates generated semantic tokens into the speech signal in a streaming way. We implement two approaches to achieve this module: 1) a chunk-wise streamable flow-matching approach, and 2) a multi-stream language model-based approach. They both present high-quality and streamable speech generation but differ in real-time factor (RTF) and latency. Specifically, flow-matching decoding can generate speech by chunks, presenting a lower RTF of 0.1 but a higher latency of 300ms. Instead, the multi-stream language model generates speech by frames in an autoregressive manner, presenting a higher RTF of 0.3 but a low latency of 150ms. In experiments on zero-shot voice cloning, the objective results validate FireRedTTS-1S as a high-quality foundation model with comparable intelligibility and speaker similarity over industrial baseline systems. Furthermore, the subjective score of FireRedTTS-1S highlights its impressive synthesis performance, achieving comparable quality to the ground-truth recordings. These results validate FireRedTTS-1S as a high-quality streaming foundation TTS system.
△ Less
Submitted 26 May, 2025; v1 submitted 26 March, 2025;
originally announced March 2025.
-
A Revisit of Large-Scale Patterns in Middle Stratospheric Circulation Variations
Authors:
Ningning Tao,
Xiaosong Chen,
Fei Xie,
Yongwen Zhang,
Yan Xia,
Xuan Ma,
Han Huang,
Hongyu Wang
Abstract:
Variations in stratospheric atmospheric circulation significantly influence tropospheric weather and climate, and understanding these variations can guide stratospheric aircraft development and operations. Despite a century of progress, large-scale patterns in stratospheric circulation remain poorly understood due to the stratosphere's complex nature. To address this, we applied the eigen microsta…
▽ More
Variations in stratospheric atmospheric circulation significantly influence tropospheric weather and climate, and understanding these variations can guide stratospheric aircraft development and operations. Despite a century of progress, large-scale patterns in stratospheric circulation remain poorly understood due to the stratosphere's complex nature. To address this, we applied the eigen microstate approach (EMA) to analyze zonal wind from 70-10 hPa using ERA5 reanalysis data from 1980-2022. We focused on the three leading modes, corresponding to the quasi-biennial oscillation (QBO) and stratospheric circulation in the Arctic and Antarctic. After removing high-frequency components, we observed a significant 11-year cycle in the Antarctic stratospheric circulation mode, possibly linked to the solar cycle. In contrast, the Arctic mode showed a 5-6-year cycle without 11-year periodicity. This difference likely arises from the seasonal timing of polar vortex breakdowns: the Antarctic vortex persists into late spring and summer, making it more sensitive to solar radiation, while the Arctic vortex peaks in winter and early spring. The fourth mode showed features of a Southern Hemisphere dipole and was significantly correlated with the Antarctic mode, leading it by about two months. Finally, we developed a linear prediction model that demonstrated predictive skill for the Antarctic polar vortex.
△ Less
Submitted 25 March, 2025;
originally announced March 2025.
-
Correlated flat-band physics in a bilayer kagome metal based on compact molecular orbitals
Authors:
Mounica Mahankali,
Fang Xie,
Yuan Fang,
Lei Chen,
Shouvik Sur,
Silke Paschen,
Jean C. Souza,
Moshe Haim,
Ambikesh Gupta,
Nurit Avraham,
Haim Beidenkopf,
Hengxin Tan,
Binghai Yan,
Qimiao Si
Abstract:
Flat bands, when located close to the Fermi energy, can considerably enhance the influence of electron correlations on the low energy physics in kagome and other frustrated-lattice metals. A major challenge in describing the interaction effects in such bulk materials is that the flat band is often intermixed with a large number of other bands. Here we show that the recently introduced notion of co…
▽ More
Flat bands, when located close to the Fermi energy, can considerably enhance the influence of electron correlations on the low energy physics in kagome and other frustrated-lattice metals. A major challenge in describing the interaction effects in such bulk materials is that the flat band is often intermixed with a large number of other bands. Here we show that the recently introduced notion of compact molecular orbitals (CMOs) enable a path forward in describing the dominant effect of the Coulomb interactions in spite of the complexity of the bandstructure. Our materials-based analysis allows for the understanding of the scanning-tunneling-microscopy experiment [J. C. Souza et al., preprint (2024)] of the bilayer kagome metal Ni$_3$In in terms of the CMO notion. From the resulting CMO, an effective Anderson lattice model can be set up. This CMO-based approach enables the calculation of correlation effects that is difficult to do based on the atomic orbitals. Furthermore, it suggests an enriched phase diagram for the strange metal physics of the kagome metal, which can be tested by future experiments. We discuss the implications of our results for the general correlation physics of flat band systems and beyond.
△ Less
Submitted 12 March, 2025;
originally announced March 2025.
-
Resolving the Kagome Origin of the Strange Metallicity in Ni$_3$In
Authors:
Jean C. Souza,
Moshe Haim,
Ambikesh Gupta,
Mounica Mahankali,
Fang Xie,
Yuan Fang,
Lei Chen,
Shiang Fang,
Hengxin Tan,
Minyong Han,
Caolan John,
Jingxu Zheng,
Yiwen Liu,
Binghai Yan,
Joseph G. Checkelsky,
Qimiao Si,
Nurit Avraham,
Haim Beidenkopf
Abstract:
Strong correlations promote singular properties such as strange metallicity, which shows considerable commonality across quantum materials platforms. Understanding the mechanism for such emerging universality is an outstanding challenge, given that the underlying degrees of freedom can be complex and varied. Progress may be made in flat band systems, especially kagome and other frustrated-lattice…
▽ More
Strong correlations promote singular properties such as strange metallicity, which shows considerable commonality across quantum materials platforms. Understanding the mechanism for such emerging universality is an outstanding challenge, given that the underlying degrees of freedom can be complex and varied. Progress may be made in flat band systems, especially kagome and other frustrated-lattice metals with active flat bands. These systems show strange metal behavior that bears a striking resemblance to what happens in heavy-fermion metals. Here, in scanning tunneling spectroscopy of kagome metal Ni$_3$In, we find a zero-bias peak-dip structure whose variation with magnetic field and temperature tracks the evolution of the strange metal properties. We identify the origin of the peak as compact molecular orbitals formed by destructive interference over the kagome sites, resulting in emergent $f$-shell-like localized moments. Using quasi-particle interference, we visualize their interaction with the Dirac light bands. We thus unveil the essential microscopic ingredients of the $d$-electron-based kagome metals that, while distinct from the atomic orbitals of the $f$-electron-based heavy fermion materials, are responsible for a shared phenomenology between the two types of systems. Our findings provide a new window to uncover and interconnect the essential and yet diverse microscopic building blocks in disparate families of quantum materials that drive a convergence towards a universal understanding in the regime of amplified quantum fluctuations.
△ Less
Submitted 12 March, 2025;
originally announced March 2025.
-
Innovating Bolometers' Mounting: A Gravity-Based Approach
Authors:
The CUPID Collaboration,
K. Alfonso,
A. Armatol,
C. Augier,
F. T. Avignone III,
O. Azzolini,
A. S. Barabash,
G. Bari,
A. Barresi,
D. Baudin,
F. Bellini,
G. Benato,
L. Benussi,
V. Berest,
M. Beretta,
M. Bettelli,
M. Biassoni,
J. Billard,
F. Boffelli,
V. Boldrini,
E. D. Brandani,
C. Brofferio,
C. Bucci,
M. Buchynska,
J. Camilleri
, et al. (168 additional authors not shown)
Abstract:
Cryogenic calorimeters, also known as bolometers, are among the leading technologies for searching for rare events. The CUPID experiment is exploiting this technology to deploy a tonne-scale detector to search for neutrinoless double-beta decay of $^{100}$Mo. The CUPID collaboration proposed an innovative approach to assembling bolometers in a stacked configuration, held in position solely by grav…
▽ More
Cryogenic calorimeters, also known as bolometers, are among the leading technologies for searching for rare events. The CUPID experiment is exploiting this technology to deploy a tonne-scale detector to search for neutrinoless double-beta decay of $^{100}$Mo. The CUPID collaboration proposed an innovative approach to assembling bolometers in a stacked configuration, held in position solely by gravity. This gravity-based assembly method is unprecedented in the field of bolometers and offers several advantages, including relaxed mechanical tolerances and simplified construction. To assess and optimize its performance, we constructed a medium-scale prototype hosting 28 Li$_2$MoO$_4$ crystals and 30 Ge light detectors, both operated as cryogenic calorimeters at the Laboratori Nazionali del Gran Sasso (Italy). Despite an unexpected excess of noise in the light detectors, the results of this test proved (i) a thermal stability better than $\pm$0.5 mK at 10 mK, (ii) a good energy resolution of Li$_2$MoO$_4$ bolometers, (6.6 $\pm$ 2.2) keV FWHM at 2615 keV, and (iii) a Li$_2$MoO$_4$ light yield measured by the closest light detector of 0.36 keV/MeV, sufficient to guarantee the particle identification requested by CUPID.
△ Less
Submitted 6 March, 2025;
originally announced March 2025.
-
CUPID, the CUORE Upgrade with Particle Identification
Authors:
The CUPID Collaboration,
K. Alfonso,
A. Armatol,
C. Augier,
F. T. Avignone III,
O. Azzolini,
A. S. Barabash,
G. Bari,
A. Barresi,
D. Baudin,
F. Bellini,
G. Benato,
L. Benussi,
V. Berest,
M. Beretta,
M. Bettelli,
M. Biassoni,
J. Billard,
F. Boffelli,
V. Boldrini,
E. D. Brandani,
C. Brofferio,
C. Bucci,
M. Buchynska,
J. Camilleri
, et al. (166 additional authors not shown)
Abstract:
CUPID, the CUORE Upgrade with Particle Identification, is a next-generation experiment to search for neutrinoless double beta decay ($0νββ$) and other rare events using enriched Li$_2$$^{100}$MoO$_4$ scintillating bolometers. It will be hosted by the CUORE cryostat located at the Laboratori Nazionali del Gran Sasso in Italy. The main physics goal of CUPID is to search for $0νββ$\ of $^{100}$Mo wit…
▽ More
CUPID, the CUORE Upgrade with Particle Identification, is a next-generation experiment to search for neutrinoless double beta decay ($0νββ$) and other rare events using enriched Li$_2$$^{100}$MoO$_4$ scintillating bolometers. It will be hosted by the CUORE cryostat located at the Laboratori Nazionali del Gran Sasso in Italy. The main physics goal of CUPID is to search for $0νββ$\ of $^{100}$Mo with a discovery sensitivity covering the full neutrino mass regime in the inverted ordering scenario, as well as the portion of the normal ordering regime with lightest neutrino mass larger than 10 meV. With a conservative background index of 10$^{-4}$ cnts/(keV$\cdot$kg$\cdot$yr), 240 kg isotope mass, 5 keV FWHM energy resolution and 10 live-years of data taking, CUPID will have a 90\% C.L. half-life exclusion sensitivity of 1.8 $\cdot$ 10$^{27}$ yr, corresponding to an effective Majorana neutrino mass ($m_{ββ}$) sensitivity of 9--15 meV, and a $3σ$ discovery sensitivity of 1 $\cdot$ 10$^{27}$ yr, corresponding to an $m_{ββ}$ range of 12--21 meV.
△ Less
Submitted 1 March, 2025;
originally announced March 2025.
-
PodAgent: A Comprehensive Framework for Podcast Generation
Authors:
Yujia Xiao,
Lei He,
Haohan Guo,
Fenglong Xie,
Tan Lee
Abstract:
Existing Existing automatic audio generation methods struggle to generate podcast-like audio programs effectively. The key challenges lie in in-depth content generation, appropriate and expressive voice production. This paper proposed PodAgent, a comprehensive framework for creating audio programs. PodAgent 1) generates informative topic-discussion content by designing a Host-Guest-Writer multi-ag…
▽ More
Existing Existing automatic audio generation methods struggle to generate podcast-like audio programs effectively. The key challenges lie in in-depth content generation, appropriate and expressive voice production. This paper proposed PodAgent, a comprehensive framework for creating audio programs. PodAgent 1) generates informative topic-discussion content by designing a Host-Guest-Writer multi-agent collaboration system, 2) builds a voice pool for suitable voice-role matching and 3) utilizes LLM-enhanced speech synthesis method to generate expressive conversational speech. Given the absence of standardized evaluation criteria for podcast-like audio generation, we developed comprehensive assessment guidelines to effectively evaluate the model's performance. Experimental results demonstrate PodAgent's effectiveness, significantly surpassing direct GPT-4 generation in topic-discussion dialogue content, achieving an 87.4% voice-matching accuracy, and producing more expressive speech through LLM-guided synthesis. Demo page: https://podcast-agent.github.io/demo/. Source code: https://github.com/yujxx/PodAgent.
△ Less
Submitted 1 March, 2025;
originally announced March 2025.
-
PI-HMR: Towards Robust In-bed Temporal Human Shape Reconstruction with Contact Pressure Sensing
Authors:
Ziyu Wu,
Yufan Xiong,
Mengting Niu,
Fangting Xie,
Quan Wan,
Qijun Ying,
Boyan Liu,
Xiaohui Cai
Abstract:
Long-term in-bed monitoring benefits automatic and real-time health management within healthcare, and the advancement of human shape reconstruction technologies further enhances the representation and visualization of users' activity patterns. However, existing technologies are primarily based on visual cues, facing serious challenges in non-light-of-sight and privacy-sensitive in-bed scenes. Pres…
▽ More
Long-term in-bed monitoring benefits automatic and real-time health management within healthcare, and the advancement of human shape reconstruction technologies further enhances the representation and visualization of users' activity patterns. However, existing technologies are primarily based on visual cues, facing serious challenges in non-light-of-sight and privacy-sensitive in-bed scenes. Pressure-sensing bedsheets offer a promising solution for real-time motion reconstruction. Yet, limited exploration in model designs and data have hindered its further development. To tackle these issues, we propose a general framework that bridges gaps in data annotation and model design. Firstly, we introduce SMPLify-IB, an optimization method that overcomes the depth ambiguity issue in top-view scenarios through gravity constraints, enabling generating high-quality 3D human shape annotations for in-bed datasets. Then we present PI-HMR, a temporal-based human shape estimator to regress meshes from pressure sequences. By integrating multi-scale feature fusion with high-pressure distribution and spatial position priors, PI-HMR outperforms SOTA methods with 17.01mm Mean-Per-Joint-Error decrease. This work provides a whole
△ Less
Submitted 22 March, 2025; v1 submitted 27 February, 2025;
originally announced March 2025.
-
VRM: Knowledge Distillation via Virtual Relation Matching
Authors:
Weijia Zhang,
Fei Xie,
Weidong Cai,
Chao Ma
Abstract:
Knowledge distillation (KD) aims to transfer the knowledge of a more capable yet cumbersome teacher model to a lightweight student model. In recent years, relation-based KD methods have fallen behind, as their instance-matching counterparts dominate in performance. In this paper, we revive relational KD by identifying and tackling several key issues in relation-based methods, including their susce…
▽ More
Knowledge distillation (KD) aims to transfer the knowledge of a more capable yet cumbersome teacher model to a lightweight student model. In recent years, relation-based KD methods have fallen behind, as their instance-matching counterparts dominate in performance. In this paper, we revive relational KD by identifying and tackling several key issues in relation-based methods, including their susceptibility to overfitting and spurious responses. Specifically, we transfer novelly constructed affinity graphs that compactly encapsulate a wealth of beneficial inter-sample, inter-class, and inter-view correlations by exploiting virtual views and relations as a new kind of knowledge. As a result, the student has access to richer guidance signals and stronger regularisation throughout the distillation process. To further mitigate the adverse impact of spurious responses, we prune the affinity graphs by dynamically detaching redundant and unreliable edges. Extensive experiments on CIFAR-100 and ImageNet datasets demonstrate the superior performance of the proposed virtual relation matching (VRM) method over a range of models, architectures, and set-ups. For instance, VRM for the first time hits 74.0% accuracy for ResNet50-to-MobileNetV2 distillation on ImageNet, and improves DeiT-T by 14.44% on CIFAR-100 with a ResNet56 teacher. Thorough analyses are also conducted to gauge the soundness, properties, and complexity of our designs. Code and models will be released.
△ Less
Submitted 1 April, 2025; v1 submitted 28 February, 2025;
originally announced February 2025.
-
Local and Non-local Entanglement Witnesses of Fermi Liquid
Authors:
Yiming Wang,
Yuan Fang,
Fang Xie,
Qimiao Si
Abstract:
There is a growing interest both in utilizing entanglement means to characterize many-body systems and in uncovering their entanglement depth. Motivated by recent findings that the spin quantum Fisher information witnesses amplified multipartite entanglement of strange metals and characterizes their loss of quasiparticles, we study the quantum Fisher information in various cases of Fermi liquid. W…
▽ More
There is a growing interest both in utilizing entanglement means to characterize many-body systems and in uncovering their entanglement depth. Motivated by recent findings that the spin quantum Fisher information witnesses amplified multipartite entanglement of strange metals and characterizes their loss of quasiparticles, we study the quantum Fisher information in various cases of Fermi liquid. We show that local operators generically do not witness any multipartite entanglement in a Fermi liquid, but non-local many-body operators do. Our results point to novel experimental means to detect the entanglement depth of metallic fermionic systems and, in general, open a new avenue to the emerging exploration of entanglement in quantum materials.
△ Less
Submitted 19 February, 2025;
originally announced February 2025.
-
Comparing observed properties of winds in low-luminosity active galactic nuclei with theoretical predictions
Authors:
Fangzheng Shi,
Feng Yuan,
Francesco Tombesi,
Fu-guo XIe
Abstract:
Theoretical and numerical simulations of black hole hot accretion flows have shown the ubiquitous existence of winds and predicted their properties such as velocity and mass flux. In this paper, we have summarized from literature the physical properties of winds launched from low-luminosity active galactic nuclei (LLAGN), which are believed to be powered by hot accretion flows, and compared them w…
▽ More
Theoretical and numerical simulations of black hole hot accretion flows have shown the ubiquitous existence of winds and predicted their properties such as velocity and mass flux. In this paper, we have summarized from literature the physical properties of winds launched from low-luminosity active galactic nuclei (LLAGN), which are believed to be powered by hot accretion flows, and compared them with theoretical predictions. We infer that for both ultra-fast outflows and hot winds, the observed wind velocity as a function of their launching radius and the ratio between wind mass flux and black hole accretion rate show good consistency with theoretical predictions. For the prototype LLAGN M81* with abundant observational data, we have examined various observed properties of wind in detail, including velocity, mass flux of the wind, the power-law index of the radial profile of inflow rate, and the jet-to-wind power ratio. Good agreements are found with theoretical predictions, providing strong support to the theory of wind launched from hot accretion flows.
△ Less
Submitted 13 April, 2025; v1 submitted 12 February, 2025;
originally announced February 2025.
-
Continually Evolved Multimodal Foundation Models for Cancer Prognosis
Authors:
Jie Peng,
Shuang Zhou,
Longwei Yang,
Yiran Song,
Mohan Zhang,
Kaixiong Zhou,
Feng Xie,
Mingquan Lin,
Rui Zhang,
Tianlong Chen
Abstract:
Cancer prognosis is a critical task that involves predicting patient outcomes and survival rates. To enhance prediction accuracy, previous studies have integrated diverse data modalities, such as clinical notes, medical images, and genomic data, leveraging their complementary information. However, existing approaches face two major limitations. First, they struggle to incorporate newly arrived dat…
▽ More
Cancer prognosis is a critical task that involves predicting patient outcomes and survival rates. To enhance prediction accuracy, previous studies have integrated diverse data modalities, such as clinical notes, medical images, and genomic data, leveraging their complementary information. However, existing approaches face two major limitations. First, they struggle to incorporate newly arrived data with varying distributions into training, such as patient records from different hospitals, thus rendering sub-optimal generalizability and limited utility in real-world applications. Second, most multimodal integration methods rely on simplistic concatenation or task-specific pipelines, which fail to capture the complex interdependencies across modalities. To address these, we propose a continually evolving multi-modal foundation model. Extensive experiments on the TCGA dataset demonstrate the effectiveness of our approach, highlighting its potential to advance cancer prognosis by enabling robust and adaptive multimodal integration.
△ Less
Submitted 31 January, 2025; v1 submitted 30 January, 2025;
originally announced January 2025.
-
Dual-Bounded Nonlinear Optimal Transport for Size Constrained Min Cut Clustering
Authors:
Fangyuan Xie,
Jinghui Yuan,
Feiping Nie,
Xuelong Li
Abstract:
Min cut is an important graph partitioning method. However, current solutions to the min cut problem suffer from slow speeds, difficulty in solving, and often converge to simple solutions. To address these issues, we relax the min cut problem into a dual-bounded constraint and, for the first time, treat the min cut problem as a dual-bounded nonlinear optimal transport problem. Additionally, we dev…
▽ More
Min cut is an important graph partitioning method. However, current solutions to the min cut problem suffer from slow speeds, difficulty in solving, and often converge to simple solutions. To address these issues, we relax the min cut problem into a dual-bounded constraint and, for the first time, treat the min cut problem as a dual-bounded nonlinear optimal transport problem. Additionally, we develop a method for solving dual-bounded nonlinear optimal transport based on the Frank-Wolfe method (abbreviated as DNF). Notably, DNF not only solves the size constrained min cut problem but is also applicable to all dual-bounded nonlinear optimal transport problems. We prove that for convex problems satisfying Lipschitz smoothness, the DNF method can achieve a convergence rate of \(\mathcal{O}(\frac{1}{t})\). We apply the DNF method to the min cut problem and find that it achieves state-of-the-art performance in terms of both the loss function and clustering accuracy at the fastest speed, with a convergence rate of \(\mathcal{O}(\frac{1}{\sqrt{t}})\). Moreover, the DNF method for the size constrained min cut problem requires no parameters and exhibits better stability.
△ Less
Submitted 30 January, 2025;
originally announced January 2025.
-
FireRedASR: Open-Source Industrial-Grade Mandarin Speech Recognition Models from Encoder-Decoder to LLM Integration
Authors:
Kai-Tuo Xu,
Feng-Long Xie,
Xu Tang,
Yao Hu
Abstract:
We present FireRedASR, a family of large-scale automatic speech recognition (ASR) models for Mandarin, designed to meet diverse requirements in superior performance and optimal efficiency across various applications. FireRedASR comprises two variants:
FireRedASR-LLM: Designed to achieve state-of-the-art (SOTA) performance and to enable seamless end-to-end speech interaction. It adopts an Encoder…
▽ More
We present FireRedASR, a family of large-scale automatic speech recognition (ASR) models for Mandarin, designed to meet diverse requirements in superior performance and optimal efficiency across various applications. FireRedASR comprises two variants:
FireRedASR-LLM: Designed to achieve state-of-the-art (SOTA) performance and to enable seamless end-to-end speech interaction. It adopts an Encoder-Adapter-LLM framework leveraging large language model (LLM) capabilities. On public Mandarin benchmarks, FireRedASR-LLM (8.3B parameters) achieves an average Character Error Rate (CER) of 3.05%, surpassing the latest SOTA of 3.33% with an 8.4% relative CER reduction (CERR). It demonstrates superior generalization capability over industrial-grade baselines, achieving 24%-40% CERR in multi-source Mandarin ASR scenarios such as video, live, and intelligent assistant.
FireRedASR-AED: Designed to balance high performance and computational efficiency and to serve as an effective speech representation module in LLM-based speech models. It utilizes an Attention-based Encoder-Decoder (AED) architecture. On public Mandarin benchmarks, FireRedASR-AED (1.1B parameters) achieves an average CER of 3.18%, slightly worse than FireRedASR-LLM but still outperforming the latest SOTA model with over 12B parameters. It offers a more compact size, making it suitable for resource-constrained applications.
Moreover, both models exhibit competitive results on Chinese dialects and English speech benchmarks and excel in singing lyrics recognition. To advance research in speech processing, we release our models and inference code at https://github.com/FireRedTeam/FireRedASR.
△ Less
Submitted 24 January, 2025;
originally announced January 2025.
-
AirMorph: Topology-Preserving Deep Learning for Pulmonary Airway Analysis
Authors:
Minghui Zhang,
Chenyu Li,
Fangfang Xie,
Yaoyu Liu,
Hanxiao Zhang,
Junyang Wu,
Chunxi Zhang,
Jie Yang,
Jiayuan Sun,
Guang-Zhong Yang,
Yun Gu
Abstract:
Accurate anatomical labeling and analysis of the pulmonary structure and its surrounding anatomy from thoracic CT is getting increasingly important for understanding the etilogy of abnormalities or supporting targetted therapy and early interventions. Whilst lung and airway cell atlases have been attempted, there is a lack of fine-grained morphological atlases that are clinically deployable. In th…
▽ More
Accurate anatomical labeling and analysis of the pulmonary structure and its surrounding anatomy from thoracic CT is getting increasingly important for understanding the etilogy of abnormalities or supporting targetted therapy and early interventions. Whilst lung and airway cell atlases have been attempted, there is a lack of fine-grained morphological atlases that are clinically deployable. In this work, we introduce AirMorph, a robust, end-to-end deep learning pipeline enabling fully automatic and comprehensive airway anatomical labeling at lobar, segmental, and subsegmental resolutions that can be used to create digital atlases of the lung. Evaluated across large-scale multi-center datasets comprising diverse pulmonary conditions, the AirMorph consistently outperformed existing segmentation and labeling methods in terms of accuracy, topological consistency, and completeness. To simplify clinical interpretation, we further introduce a compact anatomical signature quantifying critical morphological airway features, including stenosis, ectasia, tortuosity, divergence, length, and complexity. When applied to various pulmonary diseases such as pulmonary fibrosis, emphysema, atelectasis, consolidation, and reticular opacities, it demonstrates strong discriminative power, revealing disease-specific morphological patterns with high interpretability and explainability. Additionally, AirMorph supports efficient automated branching pattern analysis, potentially enhancing bronchoscopic navigation planning and procedural safety, offering a valuable clinical tool for improved diagnosis, targeted treatment, and personalized patient care.
△ Less
Submitted 8 May, 2025; v1 submitted 14 December, 2024;
originally announced December 2024.
-
Learning Novel Skills from Language-Generated Demonstrations
Authors:
Ao-Qun Jin,
Tian-Yu Xiang,
Xiao-Hu Zhou,
Mei-Jiang Gui,
Xiao-Liang Xie,
Shi-Qi Liu,
Shuang-Yi Wang,
Yue Cao,
Sheng-Bin Duan,
Fu-Chao Xie,
Zeng-Guang Hou
Abstract:
Robots are increasingly deployed across diverse domains to tackle tasks requiring novel skills. However, current robot learning algorithms for acquiring novel skills often rely on demonstration datasets or environment interactions, resulting in high labor costs and potential safety risks. To address these challenges, this study proposes DemoGen, a skill-learning framework that enables robots to ac…
▽ More
Robots are increasingly deployed across diverse domains to tackle tasks requiring novel skills. However, current robot learning algorithms for acquiring novel skills often rely on demonstration datasets or environment interactions, resulting in high labor costs and potential safety risks. To address these challenges, this study proposes DemoGen, a skill-learning framework that enables robots to acquire novel skills from natural language instructions. DemoGen leverages the vision-language model and the video diffusion model to generate demonstration videos of novel skills, which enabling robots to learn new skills effectively. Experimental evaluations in the MetaWorld simulation environments demonstrate the pipeline's capability to generate high-fidelity and reliable demonstrations. Using the generated demonstrations, various skill learning algorithms achieve an accomplishment rate three times the original on novel tasks. These results highlight a novel approach to robot learning, offering a foundation for the intuitive and intelligent acquisition of novel robotic skills. (Project website: https://aoqunjin.github.io/LNSLGD/)
△ Less
Submitted 20 May, 2025; v1 submitted 12 December, 2024;
originally announced December 2024.
-
Exploring the lifetime frontier with a beam-dump experiment at CiADS
Authors:
Liangwen Chen,
Mingxuan Du,
Zhiyu Sun,
Zeren Simon Wang,
Fang Xie,
Ju-Jun Xie,
Lei Yang,
Pei Yu,
Yu Zhang
Abstract:
We propose a beam-dump experiment (BDE) at the upcoming facility of China initiative Accelerator Driven System (CiADS), called CiADS-BDE, in order to search for long-lived particles (LLPs) predicted in various beyond-the-Standard-Model (BSM) theories. The experiment is to be located in the forward direction of the incoming low-energy proton beam at CiADS, leveraging the strong forward boost of the…
▽ More
We propose a beam-dump experiment (BDE) at the upcoming facility of China initiative Accelerator Driven System (CiADS), called CiADS-BDE, in order to search for long-lived particles (LLPs) predicted in various beyond-the-Standard-Model (BSM) theories. The experiment is to be located in the forward direction of the incoming low-energy proton beam at CiADS, leveraging the strong forward boost of the produced particles at the beam dump in general. The space between the dump and the detector is largely available, allowing for installation of veto materials and hence low levels of background events. We elaborate on the detector setup, and choose dark photon as a benchmark model for sensitivity study. We restrict ourselves to the signature of an electron-positron pair, and find that with 5 years' operation, unique, currently unexcluded parts of the parameter space for $\mathcal{O}(100)$ MeV dark-photon masses and $\mathcal{O}(10^{-9}\text{--}10^{-8})$ kinetic mixing can be probed at the CiADS-BDE. Furthermore, considering that there is no need to set up a proton beam specifically for this experiment and that the detector system requires minimal instrumentation, the experiment is supposed to be relatively cost-effective. Therefore, we intend this work to promote studies on the sensitivity reach of the proposed experiment to additional LLP scenarios, and in the end, the realization of the experiment. Incidentally, we study the sensitivity of the same BDE setups at the High Intensity Heavy-ion Accelerator Facility (HIAF), presently under construction near the CiADS program site, and conclude that HIAF-BDE could probe new parameter regions, too.
△ Less
Submitted 12 December, 2024;
originally announced December 2024.
-
Morphological-Symmetry-Equivariant Heterogeneous Graph Neural Network for Robotic Dynamics Learning
Authors:
Fengze Xie,
Sizhe Wei,
Yue Song,
Yisong Yue,
Lu Gan
Abstract:
We present a morphological-symmetry-equivariant heterogeneous graph neural network, namely MS-HGNN, for robotic dynamics learning, that integrates robotic kinematic structures and morphological symmetries into a single graph network. These structural priors are embedded into the learning architecture as constraints, ensuring high generalizability, sample and model efficiency. The proposed MS-HGNN…
▽ More
We present a morphological-symmetry-equivariant heterogeneous graph neural network, namely MS-HGNN, for robotic dynamics learning, that integrates robotic kinematic structures and morphological symmetries into a single graph network. These structural priors are embedded into the learning architecture as constraints, ensuring high generalizability, sample and model efficiency. The proposed MS-HGNN is a versatile and general architecture that is applicable to various multi-body dynamic systems and a wide range of dynamics learning problems. We formally prove the morphological-symmetry-equivariant property of our MS-HGNN and validate its effectiveness across multiple quadruped robot learning problems using both real-world and simulated data. Our code is made publicly available at https://github.com/lunarlab-gatech/MorphSym-HGNN/.
△ Less
Submitted 14 May, 2025; v1 submitted 2 December, 2024;
originally announced December 2024.
-
IXPE Observation of the Low-Synchrotron Peaked Blazar S4 0954+65 During An Optical-X-ray Flare
Authors:
Pouya M. Kouch,
Ioannis Liodakis,
Francesco Fenu,
Haocheng Zhang,
Stella Boula,
Riccardo Middei,
Laura Di Gesu,
Georgios F. Paraschos,
Iván Agudo,
Svetlana G. Jorstad,
Elina Lindfors,
Alan P. Marscher,
Henric Krawczynski,
Michela Negro,
Kun Hu,
Dawoon E. Kim,
Elisabetta Cavazzuti,
Manel Errando,
Dmitry Blinov,
Anastasia Gourni,
Sebastian Kiehlmann,
Angelos Kourtidis,
Nikos Mandarakas,
Nikolaos Triantafyllou,
Anna Vervelaki
, et al. (112 additional authors not shown)
Abstract:
The X-ray polarization observations made possible with the Imaging X-ray Polarimetry Explorer (IXPE) offer new ways of probing high-energy emission processes in astrophysical jets from blazars. Here we report on the first X-ray polarization observation of the blazar S4 0954+65 in a high optical and X-ray state. During our multi-wavelength campaign on the source, we detected an optical flare whose…
▽ More
The X-ray polarization observations made possible with the Imaging X-ray Polarimetry Explorer (IXPE) offer new ways of probing high-energy emission processes in astrophysical jets from blazars. Here we report on the first X-ray polarization observation of the blazar S4 0954+65 in a high optical and X-ray state. During our multi-wavelength campaign on the source, we detected an optical flare whose peak coincided with the peak of an X-ray flare. This optical-X-ray flare most likely took place in a feature moving along the parsec-scale jet, imaged at 43 GHz by the Very Long Baseline Array. The 43 GHz polarization angle of the moving component underwent a rotation near the time of the flare. In the optical band, prior to the IXPE observation, we measured the polarization angle to be aligned with the jet axis. In contrast, during the optical flare the optical polarization angle was perpendicular to the jet axis; after the flare, it reverted to being parallel to the jet axis. Due to the smooth behavior of the optical polarization angle during the flare, we favor shocks as the main acceleration mechanism. We also infer that the ambient magnetic field lines in the jet were parallel to the jet position angle. The average degree of optical polarization during the IXPE observation was (14.3$\pm$4.1)%. Despite the flare, we only detected an upper limit of 14% (at 3$σ$ level) on the X-ray polarization degree; although a reasonable assumption on the X-ray polarization angle results in an upper limit of 8.8% ($3σ$). We model the spectral energy distribution (SED) and spectral polarization distribution (SPD) of S4 0954+65 with leptonic (synchrotron self-Compton) and hadronic (proton and pair synchrotron) models. The constraints we obtain with our combined multi-wavelength polarization observations and SED modeling tentatively disfavor hadronic models for the X-ray emission in S4 0954+65.
△ Less
Submitted 10 March, 2025; v1 submitted 25 November, 2024;
originally announced November 2024.
-
Local Learning for Covariate Selection in Nonparametric Causal Effect Estimation with Latent Variables
Authors:
Zheng Li,
Feng Xie,
Xichen Guo,
Yan Zeng,
Hao Zhang,
Zhi Geng
Abstract:
Estimating causal effects from nonexperimental data is a fundamental problem in many fields of science. A key component of this task is selecting an appropriate set of covariates for confounding adjustment to avoid bias. Most existing methods for covariate selection often assume the absence of latent variables and rely on learning the global network structure among variables. However, identifying…
▽ More
Estimating causal effects from nonexperimental data is a fundamental problem in many fields of science. A key component of this task is selecting an appropriate set of covariates for confounding adjustment to avoid bias. Most existing methods for covariate selection often assume the absence of latent variables and rely on learning the global network structure among variables. However, identifying the global structure can be unnecessary and inefficient, especially when our primary interest lies in estimating the effect of a treatment variable on an outcome variable. To address this limitation, we propose a novel local learning approach for covariate selection in nonparametric causal effect estimation, which accounts for the presence of latent variables. Our approach leverages testable independence and dependence relationships among observed variables to identify a valid adjustment set for a target causal relationship, ensuring both soundness and completeness under standard assumptions. We validate the effectiveness of our algorithm through extensive experiments on both synthetic and real-world data.
△ Less
Submitted 19 May, 2025; v1 submitted 25 November, 2024;
originally announced November 2024.
-
Testability of Instrumental Variables in Additive Nonlinear, Non-Constant Effects Models
Authors:
Xichen Guo,
Zheng Li,
Biwei Huang,
Yan Zeng,
Zhi Geng,
Feng Xie
Abstract:
We address the issue of the testability of instrumental variables derived from observational data. Most existing testable implications are centered on scenarios where the treatment is a discrete variable, e.g., instrumental inequality (Pearl, 1995), or where the effect is assumed to be constant, e.g., instrumental variables condition based on the principle of independent mechanisms (Burauel, 2023)…
▽ More
We address the issue of the testability of instrumental variables derived from observational data. Most existing testable implications are centered on scenarios where the treatment is a discrete variable, e.g., instrumental inequality (Pearl, 1995), or where the effect is assumed to be constant, e.g., instrumental variables condition based on the principle of independent mechanisms (Burauel, 2023). However, treatments can often be continuous variables, such as drug dosages or nutritional content levels, and non-constant effects may occur in many real-world scenarios. In this paper, we consider an additive nonlinear, non-constant effects model with unmeasured confounders, in which treatments can be either discrete or continuous, and propose an Auxiliary-based Independence Test (AIT) condition to test whether a variable is a valid instrument. We first show that if the candidate instrument is valid, then the AIT condition holds. Moreover, we illustrate the implications of the AIT condition and demonstrate that, in certain conditions, AIT conditions are necessary and sufficient to detect all invalid IVs. We also extend the AIT condition to include covariates and introduce a practical testing algorithm. Experimental results on both synthetic and three different real-world datasets show the effectiveness of our proposed condition.
△ Less
Submitted 18 November, 2024;
originally announced November 2024.
-
Detecting Multi-Parameter Constraint Inconsistencies in Python Data Science Libraries
Authors:
Xiufeng Xu,
Fuman Xie,
Chenguang Zhu,
Guangdong Bai,
Sarfraz Khurshid,
Yi Li
Abstract:
Modern AI- and Data-intensive software systems rely heavily on data science and machine learning libraries that provide essential algorithmic implementations and computational frameworks. These libraries expose complex APIs whose correct usage has to follow constraints among multiple interdependent parameters. Developers using these APIs are expected to learn about the constraints through the prov…
▽ More
Modern AI- and Data-intensive software systems rely heavily on data science and machine learning libraries that provide essential algorithmic implementations and computational frameworks. These libraries expose complex APIs whose correct usage has to follow constraints among multiple interdependent parameters. Developers using these APIs are expected to learn about the constraints through the provided documentations and any discrepancy may lead to unexpected behaviors. However, maintaining correct and consistent multi-parameter constraints in API documentations remains a significant challenge for API compatibility and reliability. To address this challenge, we propose MPDetector, for detecting inconsistencies between code and documentation, specifically focusing on multi-parameter constraints. MPDetector identifies these constraints at the code level by exploring execution paths through symbolic execution and further extracts corresponding constraints from documentation using large language models (LLMs). We propose a customized fuzzy constraint logic to reconcile the unpredictability of LLM outputs and detects logical inconsistencies between the code and documentation constraints. We collected and constructed two datasets from four popular data science libraries and evaluated MPDetector on them. The results demonstrate that MPDetector can effectively detect inconsistency issues with the precision of 92.8%. We further reported 14 detected inconsistency issues to the library developers, who have confirmed 11 issues at the time of writing.
△ Less
Submitted 19 November, 2024; v1 submitted 18 November, 2024;
originally announced November 2024.
-
Evidence for a shock-compressed magnetic field in the northwestern rim of Vela Jr. from X-ray polarimetry
Authors:
Dmitry A. Prokhorov,
Yi-Jung Yang,
Riccardo Ferrazzoli,
Jacco Vink,
Patrick Slane,
Enrico Costa,
Stefano Silvestri,
Ping Zhou,
Niccolò Bucciantini,
Alessandro Di Marco,
Martin C. Weisskopf,
Luca Baldini,
Victor Doroshenko,
Steven R. Ehlert,
Jeremy Heyl,
Philip Kaaret,
Dawoon E. Kim,
Frédéric Marin,
Tsunefumi Mizuno,
Chi-Yung Ng,
Melissa Pesce-Rollins,
Carmelo Sgrò,
Paolo Soffitta,
Douglas A. Swartz,
Toru Tamagawa
, et al. (75 additional authors not shown)
Abstract:
Synchrotron X-ray emission has been detected from nearly a dozen young supernova remnants (SNRs). X-rays of synchrotron origin exhibit linear polarization in a regular, non-randomly oriented magnetic field. The significant polarized X-ray emission from four such SNRs has already been reported on the basis of observations with the Imaging X-ray Polarimetry Explorer (IXPE). The magnetic-field struct…
▽ More
Synchrotron X-ray emission has been detected from nearly a dozen young supernova remnants (SNRs). X-rays of synchrotron origin exhibit linear polarization in a regular, non-randomly oriented magnetic field. The significant polarized X-ray emission from four such SNRs has already been reported on the basis of observations with the Imaging X-ray Polarimetry Explorer (IXPE). The magnetic-field structure as derived from IXPE observations is radial for Cassiopeia A, Tycho's SNR, and SN 1006, and tangential for RX J1713.7-3946. The latter together with the recent detection of a tangential magnetic field in SNR 1E 0102.2-7219 by the Australia Telescope Compact Array in the radio band shows that tangential magnetic fields can also be present in young SNRs. Thus, the dichotomy in polarization between young and middle-aged SNRs (radial magnetic fields in young SNRs, but tangential magnetic fields in middle-aged SNRs), previously noticed in the radio band, deserves additional attention. The present analysis of IXPE observations determines, for the first time, a magnetic-field structure in the northwestern rim of Vela Jr, also known as RX J0852.0-4622, and provides a new example of a young SNR with a tangential magnetic field.
△ Less
Submitted 27 October, 2024;
originally announced October 2024.
-
A Two-Week $IXPE$ Monitoring Campaign on Mrk 421
Authors:
W. Peter Maksym,
Ioannis Liodakis,
M. Lynne Saade,
Dawoon E. Kim,
Riccardo Middei,
Laura Di Gesu,
Sebastian Kiehlmann,
Gabriele Matzeu,
Iván Agudo,
Alan P. Marscher,
Steven R. Ehlert,
Svetlana G. Jorstad,
Philip Kaaret,
Herman L. Marshall,
Luigi Pacciani,
Matteo Perri,
Simonetta Puccetti,
Pouya M. Kouch,
Elina Lindfors,
Francisco José Aceituno,
Giacomo Bonnoli,
Víctor Casanova,
Juan Escudero,
Beatriz Agís-González,
César Husillos
, et al. (131 additional authors not shown)
Abstract:
X-ray polarization is a unique new probe of the particle acceleration in astrophysical jets made possible through the Imaging X-ray Polarimetry Explorer. Here we report on the first dense X-ray polarization monitoring campaign on the blazar Mrk 421. Our observations were accompanied by an even denser radio and optical polarization campaign. We find significant short-timescale variability in both X…
▽ More
X-ray polarization is a unique new probe of the particle acceleration in astrophysical jets made possible through the Imaging X-ray Polarimetry Explorer. Here we report on the first dense X-ray polarization monitoring campaign on the blazar Mrk 421. Our observations were accompanied by an even denser radio and optical polarization campaign. We find significant short-timescale variability in both X-ray polarization degree and angle, including a $\sim90^\circ$ angle rotation about the jet axis. We attribute this to random variations of the magnetic field, consistent with the presence of turbulence but also unlikely to be explained by turbulence alone. At the same time, the degree of lower-energy polarization is significantly lower and shows no more than mild variability. Our campaign provides further evidence for a scenario in which energy-stratified shock-acceleration of relativistic electrons, combined with a turbulent magnetic field, is responsible for optical to X-ray synchrotron emission in blazar jets.
△ Less
Submitted 25 October, 2024;
originally announced October 2024.
-
From Real Artifacts to Virtual Reference: A Robust Framework for Translating Endoscopic Images
Authors:
Junyang Wu,
Fangfang Xie,
Jiayuan Sun,
Yun Gu,
Guang-Zhong Yang
Abstract:
Domain adaptation, which bridges the distributions across different modalities, plays a crucial role in multimodal medical image analysis. In endoscopic imaging, combining pre-operative data with intra-operative imaging is important for surgical planning and navigation. However, existing domain adaptation methods are hampered by distribution shift caused by in vivo artifacts, necessitating robust…
▽ More
Domain adaptation, which bridges the distributions across different modalities, plays a crucial role in multimodal medical image analysis. In endoscopic imaging, combining pre-operative data with intra-operative imaging is important for surgical planning and navigation. However, existing domain adaptation methods are hampered by distribution shift caused by in vivo artifacts, necessitating robust techniques for aligning noisy and artifact abundant patient endoscopic videos with clean virtual images reconstructed from pre-operative tomographic data for pose estimation during intraoperative guidance. This paper presents an artifact-resilient image translation method and an associated benchmark for this purpose. The method incorporates a novel ``local-global'' translation framework and a noise-resilient feature extraction strategy. For the former, it decouples the image translation process into a local step for feature denoising, and a global step for global style transfer. For feature extraction, a new contrastive learning strategy is proposed, which can extract noise-resilient features for establishing robust correspondence across domains. Detailed validation on both public and in-house clinical datasets has been conducted, demonstrating significantly improved performance compared to the current state-of-the-art.
△ Less
Submitted 23 October, 2024; v1 submitted 14 October, 2024;
originally announced October 2024.
-
QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space Model
Authors:
Fei Xie,
Weijia Zhang,
Zhongdao Wang,
Chao Ma
Abstract:
Recent advancements in State Space Models, notably Mamba, have demonstrated superior performance over the dominant Transformer models, particularly in reducing the computational complexity from quadratic to linear. Yet, difficulties in adapting Mamba from language to vision tasks arise due to the distinct characteristics of visual data, such as the spatial locality and adjacency within images and…
▽ More
Recent advancements in State Space Models, notably Mamba, have demonstrated superior performance over the dominant Transformer models, particularly in reducing the computational complexity from quadratic to linear. Yet, difficulties in adapting Mamba from language to vision tasks arise due to the distinct characteristics of visual data, such as the spatial locality and adjacency within images and large variations in information granularity across visual tokens. Existing vision Mamba approaches either flatten tokens into sequences in a raster scan fashion, which breaks the local adjacency of images, or manually partition tokens into windows, which limits their long-range modeling and generalization capabilities. To address these limitations, we present a new vision Mamba model, coined QuadMamba, that effectively captures local dependencies of varying granularities via quadtree-based image partition and scan. Concretely, our lightweight quadtree-based scan module learns to preserve the 2D locality of spatial regions within learned window quadrants. The module estimates the locality score of each token from their features, before adaptively partitioning tokens into window quadrants. An omnidirectional window shifting scheme is also introduced to capture more intact and informative features across different local regions. To make the discretized quadtree partition end-to-end trainable, we further devise a sequence masking strategy based on Gumbel-Softmax and its straight-through gradient estimator. Extensive experiments demonstrate that QuadMamba achieves state-of-the-art performance in various vision tasks, including image classification, object detection, instance segmentation, and semantic segmentation. The code is in https://github.com/VISION-SJTU/QuadMamba.
△ Less
Submitted 10 October, 2024; v1 submitted 9 October, 2024;
originally announced October 2024.
-
Meta-Learning Augmented MPC for Disturbance-Aware Motion Planning and Control of Quadrotors
Authors:
Dženan Lapandić,
Fengze Xie,
Christos K. Verginis,
Soon-Jo Chung,
Dimos V. Dimarogonas,
Bo Wahlberg
Abstract:
A major challenge in autonomous flights is unknown disturbances, which can jeopardize safety and lead to collisions, especially in obstacle-rich environments. This paper presents a disturbance-aware motion planning and control framework designed for autonomous aerial flights. The framework is composed of two key components: a disturbance-aware motion planner and a tracking controller. The disturba…
▽ More
A major challenge in autonomous flights is unknown disturbances, which can jeopardize safety and lead to collisions, especially in obstacle-rich environments. This paper presents a disturbance-aware motion planning and control framework designed for autonomous aerial flights. The framework is composed of two key components: a disturbance-aware motion planner and a tracking controller. The disturbance-aware motion planner consists of a predictive control scheme and a learned model of disturbances that is adapted online. The tracking controller is designed using contraction control methods to provide safety bounds on the quadrotor behaviour in the vicinity of the obstacles with respect to the disturbance-aware motion plan. Finally, the algorithm is tested in simulation scenarios with a quadrotor facing strong crosswind and ground-induced disturbances.
△ Less
Submitted 16 December, 2024; v1 submitted 8 October, 2024;
originally announced October 2024.
-
Persistent flat band splitting and strong selective band renormalization in a kagome magnet thin film
Authors:
Zheng Ren,
Jianwei Huang,
Hengxin Tan,
Ananya Biswas,
Aki Pulkkinen,
Yichen Zhang,
Yaofeng Xie,
Ziqin Yue,
Lei Chen,
Fang Xie,
Kevin Allen,
Han Wu,
Qirui Ren,
Anil Rajapitamahuni,
Asish Kundu,
Elio Vescovo,
Junichiro Kono,
Emilia Morosan,
Pengcheng Dai,
Jian-Xin Zhu,
Qimiao Si,
Ján Minár,
Binghai Yan,
Ming Yi
Abstract:
Magnetic kagome materials provide a fascinating playground for exploring the interplay of magnetism, correlation and topology. Many magnetic kagome systems have been reported including the binary FemXn (X=Sn, Ge; m:n = 3:1, 3:2, 1:1) family and the rare earth RMn6Sn6 (R = rare earth) family, where their kagome flat bands are calculated to be near the Fermi level in the paramagnetic phase. While pa…
▽ More
Magnetic kagome materials provide a fascinating playground for exploring the interplay of magnetism, correlation and topology. Many magnetic kagome systems have been reported including the binary FemXn (X=Sn, Ge; m:n = 3:1, 3:2, 1:1) family and the rare earth RMn6Sn6 (R = rare earth) family, where their kagome flat bands are calculated to be near the Fermi level in the paramagnetic phase. While partially filling a kagome flat band is predicted to give rise to a Stoner-type ferromagnetism, experimental visualization of the magnetic splitting across the ordering temperature has not been reported for any of these systems due to the high ordering temperatures, hence leaving the nature of magnetism in kagome magnets an open question. Here, we probe the electronic structure with angle-resolved photoemission spectroscopy in a kagome magnet thin film FeSn synthesized using molecular beam epitaxy. We identify the exchange-split kagome flat bands, whose splitting persists above the magnetic ordering temperature, indicative of a local moment picture. Such local moments in the presence of the topological flat band are consistent with the compact molecular orbitals predicted in theory. We further observe a large spin-orbital selective band renormalization in the Fe d_xy+d_(x^2-y^2 ) spin majority channel reminiscent of the orbital selective correlation effects in the iron-based superconductors. Our discovery of the coexistence of local moments with topological flat bands in a kagome system echoes similar findings in magic-angle twisted bilayer graphene, and provides a basis for theoretical effort towards modeling correlation effects in magnetic flat band systems.
△ Less
Submitted 8 October, 2024;
originally announced October 2024.
-
Comparison between the emission torus and the measured toroidal magnetic field for the Crab and Vela nebula
Authors:
Wei Deng,
Fei Xie,
Kuan Liu,
Mingyu Ge,
Youli Tuo,
Fabio La Monaca,
Alessandro Di Marco,
En-wei Liang
Abstract:
Polarization measurements provide insight into the magnetic field, a critical aspect of the dynamics and emission properties around the compact object. In this paper, we present the polarized magnetic field of the Crab outer torus and the Vela arc utilizing Imaging X-ray Polarimetry Explorer observation data. The polarization angle (PA) measured for the Crab outer torus exhibits two monotonic evol…
▽ More
Polarization measurements provide insight into the magnetic field, a critical aspect of the dynamics and emission properties around the compact object. In this paper, we present the polarized magnetic field of the Crab outer torus and the Vela arc utilizing Imaging X-ray Polarimetry Explorer observation data. The polarization angle (PA) measured for the Crab outer torus exhibits two monotonic evolutions along the azimuth angle, which are consistent with the normal line of the elliptical ring. There is a slight increase in PA along the azimuth angle for both the inner arc and the outer arc of the Vela nebula. The polarized magnetic vector along the outer torus of the Crab nebula shows the polarized magnetic field aligns with Crab outer torus structure. The PA variation along the Crab outer torus suggests a bulk flow speed of 0.8c. Meanwhile, the Vela nebula polarized magnetic field does not exactly align with the Vela arc structure. We noted that the Crab nebula possesses a polarized toroidal magnetic field, where as the Vela nebula exhibits an incomplete toroidal magnetic field.
△ Less
Submitted 7 October, 2024;
originally announced October 2024.
-
Video Prediction Transformers without Recurrence or Convolution
Authors:
Yujin Tang,
Lu Qi,
Fei Xie,
Xiangtai Li,
Chao Ma,
Ming-Hsuan Yang
Abstract:
Video prediction has witnessed the emergence of RNN-based models led by ConvLSTM, and CNN-based models led by SimVP. Following the significant success of ViT, recent works have integrated ViT into both RNN and CNN frameworks, achieving improved performance. While we appreciate these prior approaches, we raise a fundamental question: Is there a simpler yet more effective solution that can eliminate…
▽ More
Video prediction has witnessed the emergence of RNN-based models led by ConvLSTM, and CNN-based models led by SimVP. Following the significant success of ViT, recent works have integrated ViT into both RNN and CNN frameworks, achieving improved performance. While we appreciate these prior approaches, we raise a fundamental question: Is there a simpler yet more effective solution that can eliminate the high computational cost of RNNs while addressing the limited receptive fields and poor generalization of CNNs? How far can it go with a simple pure transformer model for video prediction? In this paper, we propose PredFormer, a framework entirely based on Gated Transformers. We provide a comprehensive analysis of 3D Attention in the context of video prediction. Extensive experiments demonstrate that PredFormer delivers state-of-the-art performance across four standard benchmarks. The significant improvements in both accuracy and efficiency highlight the potential of PredFormer as a strong baseline for real-world video prediction applications. The source code and trained models will be released at https://github.com/yyyujintang/PredFormer.
△ Less
Submitted 30 March, 2025; v1 submitted 6 October, 2024;
originally announced October 2024.
-
X-ray spectro-polarimetric characterization of GX 340+0 in the horizontal branch: a highly inclined source?
Authors:
Fabio La Monaca,
Alessandro Di Marco,
Renee M. Ludlam,
Anna Bobrikova,
Juri Poutanen,
Songwei Li,
Fei Xie
Abstract:
We report the first detection of X-ray polarization in the horizontal branch for GX 340+0 as obtained by Imaging X-ray Polarimetry Explorer (IXPE). A polarization degree of 4.3%$\pm$0.3% is obtained. This value is in agreement with the previous polarization measurements of Z-sources in the horizontal branch. Spectro-polarimetric analysis, performed using a broad-band spectral model obtained by NIC…
▽ More
We report the first detection of X-ray polarization in the horizontal branch for GX 340+0 as obtained by Imaging X-ray Polarimetry Explorer (IXPE). A polarization degree of 4.3%$\pm$0.3% is obtained. This value is in agreement with the previous polarization measurements of Z-sources in the horizontal branch. Spectro-polarimetric analysis, performed using a broad-band spectral model obtained by NICER and NuSTAR quasi-simultaneous observations, allowed us to constrain the polarization for the soft and hard spectral components typical to these sources. The polarization angle for the two components differs by ${\sim}40°$. This result could be explained by a misalignment of the NS rotations axis with respect to the accretion disk axis. We provide a comparison of the results with polarization expected in different models. Theoretical expectations for the polarization of the disk and the Comptonization components favor an orbital inclination for GX 340+0 higher than 60°, as expected for Cyg-like sources, in contrast with results we report for the reflection component using broad-band spectrum.
△ Less
Submitted 20 November, 2024; v1 submitted 1 October, 2024;
originally announced October 2024.
-
Speaking from Coarse to Fine: Improving Neural Codec Language Model via Multi-Scale Speech Coding and Generation
Authors:
Haohan Guo,
Fenglong Xie,
Dongchao Yang,
Xixin Wu,
Helen Meng
Abstract:
The neural codec language model (CLM) has demonstrated remarkable performance in text-to-speech (TTS) synthesis. However, troubled by ``recency bias", CLM lacks sufficient attention to coarse-grained information at a higher temporal scale, often producing unnatural or even unintelligible speech. This work proposes CoFi-Speech, a coarse-to-fine CLM-TTS approach, employing multi-scale speech coding…
▽ More
The neural codec language model (CLM) has demonstrated remarkable performance in text-to-speech (TTS) synthesis. However, troubled by ``recency bias", CLM lacks sufficient attention to coarse-grained information at a higher temporal scale, often producing unnatural or even unintelligible speech. This work proposes CoFi-Speech, a coarse-to-fine CLM-TTS approach, employing multi-scale speech coding and generation to address this issue. We train a multi-scale neural codec, CoFi-Codec, to encode speech into a multi-scale discrete representation, comprising multiple token sequences with different time resolutions. Then, we propose CoFi-LM that can generate this representation in two modes: the single-LM-based chain-of-scale generation and the multiple-LM-based stack-of-scale generation. In experiments, CoFi-Speech significantly outperforms single-scale baseline systems on naturalness and speaker similarity in zero-shot TTS. The analysis of multi-scale coding demonstrates the effectiveness of CoFi-Codec in learning multi-scale discrete speech representations while keeping high-quality speech reconstruction. The coarse-to-fine multi-scale generation, especially for the stack-of-scale approach, is also validated as a crucial approach in pursuing a high-quality neural codec language model for TTS.
△ Less
Submitted 17 September, 2024;
originally announced September 2024.
-
Intelligent LiDAR Navigation: Leveraging External Information and Semantic Maps with LLM as Copilot
Authors:
Fujing Xie,
Jiajie Zhang,
Sören Schwertfeger
Abstract:
Traditional robot navigation systems primarily utilize occupancy grid maps and laser-based sensing technologies, as demonstrated by the popular move_base package in ROS. Unlike robots, humans navigate not only through spatial awareness and physical distances but also by integrating external information, such as elevator maintenance updates from public notification boards and experiential knowledge…
▽ More
Traditional robot navigation systems primarily utilize occupancy grid maps and laser-based sensing technologies, as demonstrated by the popular move_base package in ROS. Unlike robots, humans navigate not only through spatial awareness and physical distances but also by integrating external information, such as elevator maintenance updates from public notification boards and experiential knowledge, like the need for special access through certain doors. With the development of Large Language Models (LLMs), which possesses text understanding and intelligence close to human performance, there is now an opportunity to infuse robot navigation systems with a level of understanding akin to human cognition. In this study, we propose using osmAG (Area Graph in OpensStreetMap textual format), an innovative semantic topometric hierarchical map representation, to bridge the gap between the capabilities of ROS move_base and the contextual understanding offered by LLMs. Our methodology employs LLMs as an actual copilot in robot navigation, enabling the integration of a broader range of informational inputs while maintaining the robustness of traditional robotic navigation systems. Our code, demo, map, experiment results can be accessed at https://github.com/xiexiexiaoxiexie/Intelligent-LiDAR-Navigation-LLM-as-Copilot.
△ Less
Submitted 23 March, 2025; v1 submitted 12 September, 2024;
originally announced September 2024.
-
Contrastive Learning-based User Identification with Limited Data on Smart Textiles
Authors:
Yunkang Zhang,
Ziyu Wu,
Zhen Liang,
Fangting Xie,
Quan Wan,
Mingjie Zhao,
Xiaohui Cai
Abstract:
Pressure-sensitive smart textiles are widely applied in the fields of healthcare, sports monitoring, and intelligent homes. The integration of devices embedded with pressure sensing arrays is expected to enable comprehensive scene coverage and multi-device integration. However, the implementation of identity recognition, a fundamental function in this context, relies on extensive device-specific d…
▽ More
Pressure-sensitive smart textiles are widely applied in the fields of healthcare, sports monitoring, and intelligent homes. The integration of devices embedded with pressure sensing arrays is expected to enable comprehensive scene coverage and multi-device integration. However, the implementation of identity recognition, a fundamental function in this context, relies on extensive device-specific datasets due to variations in pressure distribution across different devices. To address this challenge, we propose a novel user identification method based on contrastive learning. We design two parallel branches to facilitate user identification on both new and existing devices respectively, employing supervised contrastive learning in the feature space to promote domain unification. When encountering new devices, extensive data collection efforts are not required; instead, user identification can be achieved using limited data consisting of only a few simple postures. Through experimentation with two 8-subject pressure datasets (BedPressure and ChrPressure), our proposed method demonstrates the capability to achieve user identification across 12 sitting scenarios using only a dataset containing 2 postures. Our average recognition accuracy reaches 79.05%, representing an improvement of 2.62% over the best baseline model.
△ Less
Submitted 6 September, 2024;
originally announced September 2024.