-
Part$^{2}$GS: Part-aware Modeling of Articulated Objects using 3D Gaussian Splatting
Authors:
Tianjiao Yu,
Vedant Shah,
Muntasir Wahed,
Ying Shen,
Kiet A. Nguyen,
Ismini Lourentzou
Abstract:
Articulated objects are common in the real world, yet modeling their structure and motion remains a challenging task for 3D reconstruction methods. In this work, we introduce Part$^{2}$GS, a novel framework for modeling articulated digital twins of multi-part objects with high-fidelity geometry and physically consistent articulation. Part$^{2}$GS leverages a part-aware 3D Gaussian representation t…
▽ More
Articulated objects are common in the real world, yet modeling their structure and motion remains a challenging task for 3D reconstruction methods. In this work, we introduce Part$^{2}$GS, a novel framework for modeling articulated digital twins of multi-part objects with high-fidelity geometry and physically consistent articulation. Part$^{2}$GS leverages a part-aware 3D Gaussian representation that encodes articulated components with learnable attributes, enabling structured, disentangled transformations that preserve high-fidelity geometry. To ensure physically consistent motion, we propose a motion-aware canonical representation guided by physics-based constraints, including contact enforcement, velocity consistency, and vector-field alignment. Furthermore, we introduce a field of repel points to prevent part collisions and maintain stable articulation paths, significantly improving motion coherence over baselines. Extensive evaluations on both synthetic and real-world datasets show that Part$^{2}$GS consistently outperforms state-of-the-art methods by up to 10$\times$ in Chamfer Distance for movable parts.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
Lower Bounds against the Ideal Proof System in Finite Fields
Authors:
Tal Elbaz,
Nashlen Govindasamy,
Jiaqi Lu,
Iddo Tzameret
Abstract:
Lower bounds against strong algebraic proof systems and specifically fragments of the Ideal Proof System (IPS), have been obtained in an ongoing line of work. All of these bounds, however, are proved only over large (or characteristic $0$) fields, yet finite fields are the more natural setting for propositional proof complexity, especially for progress toward lower bounds for Frege systems such as…
▽ More
Lower bounds against strong algebraic proof systems and specifically fragments of the Ideal Proof System (IPS), have been obtained in an ongoing line of work. All of these bounds, however, are proved only over large (or characteristic $0$) fields, yet finite fields are the more natural setting for propositional proof complexity, especially for progress toward lower bounds for Frege systems such as $AC^0[p]$-Frege. This work establishes lower bounds against fragments of IPS over fixed finite fields. Specifically, we show that a variant of the knapsack instance studied by Govindasamy, Hakoniemi, and Tzameret (FOCS'22) has no polynomial-size IPS refutation over finite fields when the refutation is multilinear and written as a constant-depth circuit. The key ingredient of our argument is the recent set-multilinearization result of Forbes (CCC'24), which extends the earlier result of Limaye, Srinivasan, and Tavenas (FOCS'21) to all fields, and an extension of the techniques of Govindasamy, Hakoniemi, and Tzameret to finite fields. We also separate this proof system from the one studied by Govindasamy, Hakoniemi, and Tzameret.
In addition, we present new lower bounds for read-once algebraic branching program refutations, roABP-IPS, in finite fields, extending results of Forbes, Shpilka, Tzameret, and Wigderson (Theor. of Comput.'21) and Hakoniemi, Limaye, and Tzameret (STOC'24).
Finally, we show that any lower bound against any proof system at least as strong as (non-multilinear) constant-depth IPS over finite fields for any instance, even a purely algebraic instance (i.e., not a translation of a Boolean formula or CNF), implies a hard CNF formula for the respective IPS fragment, and hence an $AC^0[p]$-Frege lower bound by known simulations over finite fields (Grochow and Pitassi (J. ACM'18)).
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation
Authors:
Teng Li,
Quanfeng Lu,
Lirui Zhao,
Hao Li,
Xizhou Zhu,
Yu Qiao,
Jun Zhang,
Wenqi Shao
Abstract:
Unified image understanding and generation has emerged as a promising paradigm in multimodal artificial intelligence. Despite recent progress, the optimal architectural design for such unified models remains an open challenge. In this work, we start by analyzing the modality alignment behaviors of task-specific expert models for understanding and generation, as well as current unified models. Our…
▽ More
Unified image understanding and generation has emerged as a promising paradigm in multimodal artificial intelligence. Despite recent progress, the optimal architectural design for such unified models remains an open challenge. In this work, we start by analyzing the modality alignment behaviors of task-specific expert models for understanding and generation, as well as current unified models. Our analysis reveals a crucial observation: understanding tasks benefit from a progressively increasing modality alignment across network depth, which helps build up semantic information for better comprehension; In contrast, generation tasks follow a different trend: modality alignment increases in the early layers but decreases in the deep layers to recover spatial details. These divergent alignment patterns create a fundamental conflict in fully shared Transformer backbones, where a uniform representational flow often leads to performance compromises across two tasks. Motivated by this finding, we introduce UniFork, a novel Y-shaped architecture that shares the shallow layers for cross-task representation learning, while employing task-specific branches in deeper layers to avoid task interference. This design effectively balances shared learning and task specialization. Through extensive ablation experiments, we demonstrate that Unifork consistently outperforms conventional fully shared Transformer architectures, and achieves performance on par with or better than task-specific models.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition
Authors:
Jiaqi Li,
Junshu Tang,
Zhiyong Xu,
Longhuang Wu,
Yuan Zhou,
Shuai Shao,
Tianbao Yu,
Zhiguo Cao,
Qinglin Lu
Abstract:
Recent advances in diffusion-based and controllable video generation have enabled high-quality and temporally coherent video synthesis, laying the groundwork for immersive interactive gaming experiences. However, current methods face limitations in dynamics, generality, long-term consistency, and efficiency, which limit the ability to create various gameplay videos. To address these gaps, we intro…
▽ More
Recent advances in diffusion-based and controllable video generation have enabled high-quality and temporally coherent video synthesis, laying the groundwork for immersive interactive gaming experiences. However, current methods face limitations in dynamics, generality, long-term consistency, and efficiency, which limit the ability to create various gameplay videos. To address these gaps, we introduce Hunyuan-GameCraft, a novel framework for high-dynamic interactive video generation in game environments. To achieve fine-grained action control, we unify standard keyboard and mouse inputs into a shared camera representation space, facilitating smooth interpolation between various camera and movement operations. Then we propose a hybrid history-conditioned training strategy that extends video sequences autoregressively while preserving game scene information. Additionally, to enhance inference efficiency and playability, we achieve model distillation to reduce computational overhead while maintaining consistency across long temporal sequences, making it suitable for real-time deployment in complex interactive environments. The model is trained on a large-scale dataset comprising over one million gameplay recordings across over 100 AAA games, ensuring broad coverage and diversity, then fine-tuned on a carefully annotated synthetic dataset to enhance precision and control. The curated game scene data significantly improves the visual fidelity, realism and action controllability. Extensive experiments demonstrate that Hunyuan-GameCraft significantly outperforms existing models, advancing the realism and playability of interactive game video generation.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
Operation and performance of the CMS silicon strip tracker with proton-proton collisions at the CERN LHC
Authors:
CMS Collaboration
Abstract:
Salient aspects of the commissioning, calibration, and performance of the CMS silicon strip tracker are discussed, drawing on experience during operation with proton-proton collisions delivered by the CERN LHC. The data were obtained with a variety of luminosities. The operating temperature of the strip tracker was changed several times during this period and results are shown as a function of tem…
▽ More
Salient aspects of the commissioning, calibration, and performance of the CMS silicon strip tracker are discussed, drawing on experience during operation with proton-proton collisions delivered by the CERN LHC. The data were obtained with a variety of luminosities. The operating temperature of the strip tracker was changed several times during this period and results are shown as a function of temperature in several cases. Details of the system performance are presented, including occupancy, signal-to-noise ratio, Lorentz angle, and single-hit spatial resolution. Saturation effects in the APV25 readout chip preamplifier observed during early Run 2 are presented, showing the effect on various observables and the subsequent remedy. Studies of radiation effects on the strip tracker are presented both for the optical readout links and the silicon sensors. The observed effects are compared to simulation, where available, and they generally agree well with expectations.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
Tensor network calculation of boundary and corner magnetization
Authors:
Roman Krcmar,
Jozef Genzor,
Andrej Gendiar,
Tomotoshi Nishino
Abstract:
The Corner Transfer Matrix Renormalization Group (CTMRG) algorithm is modified to measure the magnetization at the boundary of the system, including the corners of the square-shaped lattice. Using automatic differentiation, we calculate the magnetization's first derivative, allowing us to determine the boundary critical exponent $β$ accurately.
The Corner Transfer Matrix Renormalization Group (CTMRG) algorithm is modified to measure the magnetization at the boundary of the system, including the corners of the square-shaped lattice. Using automatic differentiation, we calculate the magnetization's first derivative, allowing us to determine the boundary critical exponent $β$ accurately.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
Facial Landmark Visualization and Emotion Recognition Through Neural Networks
Authors:
Israel Juárez-Jiménez,
Tiffany Guadalupe Martínez Paredes,
Jesús García-Ramírez,
Eric Ramos Aguilar
Abstract:
Emotion recognition from facial images is a crucial task in human-computer interaction, enabling machines to learn human emotions through facial expressions. Previous studies have shown that facial images can be used to train deep learning models; however, most of these studies do not include a through dataset analysis. Visualizing facial landmarks can be challenging when extracting meaningful dat…
▽ More
Emotion recognition from facial images is a crucial task in human-computer interaction, enabling machines to learn human emotions through facial expressions. Previous studies have shown that facial images can be used to train deep learning models; however, most of these studies do not include a through dataset analysis. Visualizing facial landmarks can be challenging when extracting meaningful dataset insights; to address this issue, we propose facial landmark box plots, a visualization technique designed to identify outliers in facial datasets. Additionally, we compare two sets of facial landmark features: (i) the landmarks' absolute positions and (ii) their displacements from a neutral expression to the peak of an emotional expression. Our results indicate that a neural network achieves better performance than a random forest classifier.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
A Common Pool of Privacy Problems: Legal and Technical Lessons from a Large-Scale Web-Scraped Machine Learning Dataset
Authors:
Rachel Hong,
Jevan Hutson,
William Agnew,
Imaad Huda,
Tadayoshi Kohno,
Jamie Morgenstern
Abstract:
We investigate the contents of web-scraped data for training AI systems, at sizes where human dataset curators and compilers no longer manually annotate every sample. Building off of prior privacy concerns in machine learning models, we ask: What are the legal privacy implications of web-scraped machine learning datasets? In an empirical study of a popular training dataset, we find significant pre…
▽ More
We investigate the contents of web-scraped data for training AI systems, at sizes where human dataset curators and compilers no longer manually annotate every sample. Building off of prior privacy concerns in machine learning models, we ask: What are the legal privacy implications of web-scraped machine learning datasets? In an empirical study of a popular training dataset, we find significant presence of personally identifiable information despite sanitization efforts. Our audit provides concrete evidence to support the concern that any large-scale web-scraped dataset may contain personal data. We use these findings of a real-world dataset to inform our legal analysis with respect to existing privacy and data protection laws. We surface various privacy risks of current data curation practices that may propagate personal information to downstream models. From our findings, we argue for reorientation of current frameworks of "publicly available" information to meaningfully limit the development of AI built upon indiscriminate scraping of the internet.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
The effect of target orientation on the mean first passage time of a Brownian particle to a small elliptical absorber
Authors:
Sanchita Chakraborty,
Theodore Kolokolnikov,
Alan E. Lindsay
Abstract:
We develop a high order asymptotic expansion for the mean first passage time (MFPT) of the capture of Brownian particles by a small elliptical trap in a bounded two dimensional region. This new result describes the effect that trap orientation plays on the capture rate and extends existing results that give information only on the role of trap position on the capture rate. Our results are validate…
▽ More
We develop a high order asymptotic expansion for the mean first passage time (MFPT) of the capture of Brownian particles by a small elliptical trap in a bounded two dimensional region. This new result describes the effect that trap orientation plays on the capture rate and extends existing results that give information only on the role of trap position on the capture rate. Our results are validated against numerical simulations which confirm the accuracy of the asymptotic approximation. In the case of the unit disk domain, we identify a bifurcation such that the high order correction to the global MFPT (GMFPT) is minimized when the trap is orientated in the radial direction for traps centered at $0<r<r_c :=\sqrt{2-\sqrt{2}}$. When centered at position $r_c<r<1$, the GMFPT correction is minimized by orientating the trap in the angular direction. In the scenario of a general two-dimensional geometry, we identify the orientation that minimizes the GMFPT in terms of the regular part of the Neumann Green's function. This theory is demonstrated on several regular domains such as disks, ellipses and rectangles.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
Feedback cooling scheme for an optically levitated oscillator with controlled cross-talk
Authors:
J. M. H. Gosling,
A. Pontin,
F. Alder,
M. Rademacher,
T. S. Monteiro,
P. F. Barker
Abstract:
Levitated optical mechanical systems have demonstrated excellent force and impulse sensitivity and are currently being developed for the creation of non-classical states of motion in these new quantum systems. An important requirement in the design of these systems is the ability to independently control and cool all three translational degrees of freedom. Here we describe the design and implement…
▽ More
Levitated optical mechanical systems have demonstrated excellent force and impulse sensitivity and are currently being developed for the creation of non-classical states of motion in these new quantum systems. An important requirement in the design of these systems is the ability to independently control and cool all three translational degrees of freedom. Here we describe the design and implementation of a stable and robust 3D velocity feedback cooling scheme with particular emphasis on creating minimal cross-talk between the independent oscillatory modes when cooling.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
Structured Harmonic Generation via Geometric Phase Enabled Pump Shaping
Authors:
Ting-Ting Liu,
Shi-Hui Ding,
Chun-Yu Li,
Hui Liu,
Zhi-Han Zhu,
Peng Chen,
Yan-Qing Lu
Abstract:
Nonlinear optics is crucial for shaping the spatial structure of shortwave light and its interactions with matter, but achieving this through simple harmonic generation with a single pump is challenging. This study demonstrates nonlinear spin-orbit conversion using spin-dependent pump shaping via geometric phase, allowing the direct creation of desired structured harmonic waves from a Gaussian pum…
▽ More
Nonlinear optics is crucial for shaping the spatial structure of shortwave light and its interactions with matter, but achieving this through simple harmonic generation with a single pump is challenging. This study demonstrates nonlinear spin-orbit conversion using spin-dependent pump shaping via geometric phase, allowing the direct creation of desired structured harmonic waves from a Gaussian pump beam. By using the liquid-crystal flat optical elements fabricated with photoalignment, we experimentally produce higher-order cylindrically vectorial modes in second harmonic fields. We examine the vectorial spatial wavefunctions, their propagation invariance, and nonlinear spin-orbit conversion. Our results provide an efficient method for full structuring nonlinear light in broader harmonic systems, with significant applications in laser micromachining and high-energy physics.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
Higher dimensional Sacks-Uhlenbeck-type functionals and applications
Authors:
Gianmichele Di Matteo,
Tobias Lamm
Abstract:
In this work, we generalize Sacks-Uhlenbeck's existence result for harmonic spheres, constructing for $n \ge 2$, regular, non-trivial, $n$-harmonic $n$-spheres into suitable target manifolds. We obtain an infinite family of new null-homotopic such maps. The proof follows a similar perturbative argument, which in high dimensions leads to a degenerate and double-phase-type Euler-Lagrange system, mak…
▽ More
In this work, we generalize Sacks-Uhlenbeck's existence result for harmonic spheres, constructing for $n \ge 2$, regular, non-trivial, $n$-harmonic $n$-spheres into suitable target manifolds. We obtain an infinite family of new null-homotopic such maps. The proof follows a similar perturbative argument, which in high dimensions leads to a degenerate and double-phase-type Euler-Lagrange system, making the uniform regularity needed to formalize the bubbling harder to achieve. Then, we develop a refined neck-analysis leading to an energy identity along the approximation, assuming a suitable Struwe-type entropy bound along a sequence of critical points. Finally, we combine these results to solve quite general min-max problems for the $n$-energy modulo bubbling.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
Proportional Sensitivity in Generative Adversarial Network (GAN)-Augmented Brain Tumor Classification Using Convolutional Neural Network
Authors:
Mahin Montasir Afif,
Abdullah Al Noman,
K. M. Tahsin Kabir,
Md. Mortuza Ahmmed,
Md. Mostafizur Rahman,
Mufti Mahmud,
Md. Ashraful Babu
Abstract:
Generative Adversarial Networks (GAN) have shown potential in expanding limited medical imaging datasets. This study explores how different ratios of GAN-generated and real brain tumor MRI images impact the performance of a CNN in classifying healthy vs. tumorous scans. A DCGAN was used to create synthetic images which were mixed with real ones at various ratios to train a custom CNN. The CNN was…
▽ More
Generative Adversarial Networks (GAN) have shown potential in expanding limited medical imaging datasets. This study explores how different ratios of GAN-generated and real brain tumor MRI images impact the performance of a CNN in classifying healthy vs. tumorous scans. A DCGAN was used to create synthetic images which were mixed with real ones at various ratios to train a custom CNN. The CNN was then evaluated on a separate real-world test set. Our results indicate that the model maintains high sensitivity and precision in tumor classification, even when trained predominantly on synthetic data. When only a small portion of GAN data was added, such as 900 real images and 100 GAN images, the model achieved excellent performance, with test accuracy reaching 95.2%, and precision, recall, and F1-score all exceeding 95%. However, as the proportion of GAN images increased further, performance gradually declined. This study suggests that while GANs are useful for augmenting limited datasets especially when real data is scarce, too much synthetic data can introduce artifacts that affect the model's ability to generalize to real world cases.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
$^{50}$Cr and $^{53}$Cr neutron capture cross sections measurement at the n_TOF facility at CERN
Authors:
P. Pérez-Maroto,
C. Guerrero,
A. Casanovas,
B. Fernández,
E. Mendoza,
V. Alcayne,
J. Lerendegui-Marco,
C. Domingo-Pardo,
J. M. Quesada,
R. Capote,
the n_TOF Collaboration
Abstract:
$^{50,53}$Cr are very relevant in criticality safety benchmarks related to nuclear reactors. The discrepancies between the neutron capture cross section evaluations have an important effect on the $k_{eff}$ and $k_{\infty}$ in criticality benchmarks particularly sensitive to chromium. The $^{50,53}$Cr(n,$γ…
▽ More
$^{50,53}$Cr are very relevant in criticality safety benchmarks related to nuclear reactors. The discrepancies between the neutron capture cross section evaluations have an important effect on the $k_{eff}$ and $k_{\infty}$ in criticality benchmarks particularly sensitive to chromium. The $^{50,53}$Cr(n,$γ$) cross sections is to be determined between 1 and 100 keV with an 8-10% accuracy following the requirements of the NEA High Priority Request List (HPRL) to solve the current discrepancies. We have measured the neutron capture cross sections by the time-of-flight technique at the EAR1 experimental area of the n_TOF facility, using an array of four C$_6$D$_6$ detectors with very low neutron sensitivity. The highly-enriched samples used are significantly thinner than in previous measurements, thus minimizing the multiple-scattering effects. We have produced, and analyzed with the R-matrix analysis code SAMMY, capture yields featuring 33 resonances of $^{50}$Cr and 51 of $^{53}$Cr with an accuracy between 5% and 9%, hence fulfilling the requirements made by the NEA. The differential and integral cross sections have been compared to previous data and evaluations. The new $^{50,53}$Cr(n,$γ$) cross sections measured at the CERN n\TOF facility provide a valuable input for upcoming evaluations, which are deemed necessary given that the results presented herein do not support the increase in both cross sections proposed in the recent INDEN evaluation.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
Global Microprocessor Correctness in the Presence of Transient Execution
Authors:
Andrew T. Walter,
Konstantinos Athanasiou,
Panagiotis Manolios
Abstract:
Correctness for microprocessors is generally understood to be conformance with the associated instruction set architecture (ISA). This is the basis for one of the most important abstractions in computer science, allowing hardware designers to develop highly-optimized processors that are functionally "equivalent" to an ideal processor that executes instructions atomically. This specification is alm…
▽ More
Correctness for microprocessors is generally understood to be conformance with the associated instruction set architecture (ISA). This is the basis for one of the most important abstractions in computer science, allowing hardware designers to develop highly-optimized processors that are functionally "equivalent" to an ideal processor that executes instructions atomically. This specification is almost always informal, e.g., commercial microprocessors generally do not come with conformance specifications. In this paper, we advocate for the use of formal specifications, using the theory of refinement. We introduce notions of correctness that can be used to deal with transient execution attacks, including Meltdown and Spectre. Such attacks have shown that ubiquitous microprocessor optimizations, appearing in numerous processors for decades, are inherently buggy. Unlike alternative approaches that use non-interference properties, our notion of correctness is global, meaning it is single specification that: formalizes conformance, includes functional correctness and is parameterized by an microarchitecture. We introduce action skipping refinement, a new type of refinement and we describe how our notions of refinement can be decomposed into properties that are more amenable to automated verification using the the concept of shared-resource commitment refinement maps. We do this in the context of formal, fully executable bit- and cycle-accurate models of an ISA and a microprocessor. Finally, we show how light-weight formal methods based on property-based testing can be used to identify transient execution bugs.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
Profile monitoring of random functions with Gaussian process basis expansions
Authors:
Takayuki Iguchi,
Jonathan R. Stewart,
Eric Chicken
Abstract:
We consider the problem of online profile monitoring of random functions that admit basis expansions possessing random coefficients for the purpose of out-of-control state detection. Our approach is applicable to a broad class of random functions which feature two sources of variation: additive error and random fluctuations through random coefficients in the basis representation of functions. We f…
▽ More
We consider the problem of online profile monitoring of random functions that admit basis expansions possessing random coefficients for the purpose of out-of-control state detection. Our approach is applicable to a broad class of random functions which feature two sources of variation: additive error and random fluctuations through random coefficients in the basis representation of functions. We focus on a two-phase monitoring problem with a first stage consisting of learning the in-control process and the second stage leveraging the learned process for out-of-control state detection. The foundations of our method are derived under the assumption that the coefficients in the basis expansion are Gaussian random variables, which facilitates the development of scalable and effective monitoring methodology for the observed processes that makes weak functional assumptions on the underlying process. We demonstrate the potential of our method through simulation studies that highlight some of the nuances that emerge in profile monitoring problems with random functions, and through an application.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
Sacks-Uhlenbeck type regularity for subcritical generalized $p$-harmonic maps into Homogeneous targets
Authors:
Gianmichele Di Matteo,
Tobias Lamm
Abstract:
Adapting \cite{strz3}, we define generalized $p$-harmonic maps into Riemannian homogeneous targets, a notion of solutions not belonging to the energy space. Restricting our attention to the subcritical range $p$ greater than the domain dimension $n$, we show a uniform $C^{1,α}$-regularity result for a sequence of such maps in the limit $p \searrow n$, assuming a uniform $n$-energy bound on its ele…
▽ More
Adapting \cite{strz3}, we define generalized $p$-harmonic maps into Riemannian homogeneous targets, a notion of solutions not belonging to the energy space. Restricting our attention to the subcritical range $p$ greater than the domain dimension $n$, we show a uniform $C^{1,α}$-regularity result for a sequence of such maps in the limit $p \searrow n$, assuming a uniform $n$-energy bound on its elements. The method of the proof follows the exact same lines as in \cite{strz3} but we need to check uniformity of estimates not previously considered there.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
The Average Soft X-ray Spectra of eROSITA Active Galactic Nuclei
Authors:
Shi-Jiang Chen,
Johannes Buchner,
Teng Liu,
Scott Hagen,
Sophia G. H. Waddell,
Kirpal Nandra,
Mara Salvato,
Zsofi Igo,
Catarina Aydar,
Andrea Merloni,
Qingling Ni,
Jia-Lai Kang,
Zhen-Yi Cai,
Jun-Xian Wang,
Ruancun Li,
Miriam E. Ramos-Ceja,
Jeremy Sanders,
Antonis Georgakakis,
Yi Zhang
Abstract:
Context. AGNs are strong X-ray emitters shaped by disk-corona interactions. The soft excess (0.5-2.0 keV) reveals key information about the "warm corona" bridging the disk and hot corona. Yet, how this feature evolves with accretion properties remains poorly constrained, especially in large samples using spectral stacking. Aims. The eROSITA All-Sky Survey (eRASS:5) provides an unprecedented sample…
▽ More
Context. AGNs are strong X-ray emitters shaped by disk-corona interactions. The soft excess (0.5-2.0 keV) reveals key information about the "warm corona" bridging the disk and hot corona. Yet, how this feature evolves with accretion properties remains poorly constrained, especially in large samples using spectral stacking. Aims. The eROSITA All-Sky Survey (eRASS:5) provides an unprecedented sample. We investigate how the average AGN X-ray spectra evolve with accretion parameters, and explore disk-corona connection by further combining stacked UV data. Methods. We developed Xstack, a novel tool that stacks rest-frame X-ray spectra and responses while preserving spectral shape through optimized weighting. We stack 17929 AGNs ("spec-z" sample, 23 Ms) with similar X-ray loudness alpha_ox, UV luminosity L_UV, and 4159 AGNs ("BH-mass" sample, 3 Ms) with similar Eddington ratio lambda_Edd and black hole mass M_BH. The resulting stacked X-ray spectra are analyzed with a phenomenological model. We further fit the stacked optical-UV-Xray SED with AGNSED model. Results. Soft excess strengthens strongly with alpha_ox and lambda_Edd (~5), while the hard X-ray spectral shape remains largely unchanged, supporting that soft excess is dominated by warm corona rather than reflection. AGNSED modeling reveals that warm corona radius (R_g units) generally increases with lambda_Edd and decreases with M_BH, or equivalently the disk-to-warm-corona transition consistently occurs near 1e4 K. The hot corona contracts with lambda_Edd and is unaffected by M_BH, aligning with disk evaporation predictions. Conclusions. The soft excess likely originates from a warm corona, with the disk to warm corona transition tied to hydrogen ionization near 1e4 K - supporting earlier eFEDS-HSC stacking results (Hagen et al. 2024). This study shows the strength of spectral stacking in probing AGN disk-corona physics.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
Quantitative correlation between structural (dis-)order and diffuseness of phase transition in lead scandium tantalate
Authors:
T. Granzow,
A. Aravindhan,
Y. Nouchokgwe,
V. Kovacova,
S. Glinsek,
S. Hirose,
T. Usui,
H. Uršič,
I. Goričan,
W. Jo,
C. -H. Hong,
E. Defay
Abstract:
Ferroelectrics show a phase transition to a paraelectric phase at a well-defined transition temperature. Introducing disorder makes this transition diffuse, and the system becomes a relaxor. Since the degree of (dis-)order is usually manipulated by varying the chemical composition, it is difficult to establish a direct relationship between disorder and the degree of diffuseness. Perovskite structu…
▽ More
Ferroelectrics show a phase transition to a paraelectric phase at a well-defined transition temperature. Introducing disorder makes this transition diffuse, and the system becomes a relaxor. Since the degree of (dis-)order is usually manipulated by varying the chemical composition, it is difficult to establish a direct relationship between disorder and the degree of diffuseness. Perovskite structured lead scandium tantalate (Pb[Sc$_{1/2}$Ta$_{1/2}$]O$_3$, PST) offers the opportunity to tune the character of the transition by thermal annealing without changing the stoichiometry. Here it is demonstrated that there is a linear correlation between the structural ordering, quantified by the intensity ratio $S$ of the pseudocubic (111)/(200) x-ray diffraction peaks, and the diffuseness parameter $γ$ deduced from temperature-dependent dielectric spectroscopy. The relation is universal, independent of whether the sample is a thin film, multilayer capacitor or bulk ceramic, and also independent of the absolute value of the dielectric permittivity.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
Large Language Model Unlearning for Source Code
Authors:
Xue Jiang,
Yihong Dong,
Zheng Fang,
Yingwei Ma,
Tangxinyu Wang,
Rongyu Cao,
Binhua Li,
Zhi Jin,
Wenpin Jiao,
Yongbin Li,
Ge Li
Abstract:
LLM4SE has demonstrated significant success, but LLMs' potential memorization of sensitive or outdated training data introduces critical risks to legal compliance, software security, and code quality. LLM unlearning techniques, which can eliminate the influence of undesired data from LLMs in a post-training way, present a promising solution to address these concerns. While recent efforts in LLM un…
▽ More
LLM4SE has demonstrated significant success, but LLMs' potential memorization of sensitive or outdated training data introduces critical risks to legal compliance, software security, and code quality. LLM unlearning techniques, which can eliminate the influence of undesired data from LLMs in a post-training way, present a promising solution to address these concerns. While recent efforts in LLM unlearning show effectiveness in natural language, their applicability to source code remains underexplored. Our empirical study reveals that existing LLM unlearning approaches, when applied to source code, cause severe model utility degradation, rendering models practically unusable for code generation. In this paper, we propose PROD, a novel unlearning approach that enables LLMs to forget undesired code content while effectively preserving their code generation capabilities. PROD suppresses the probability of forget data in LLMs' output distribution while promoting candidate distributional components, enabling the model to jointly learn to forget specific content and retain its general capabilities. To facilitate this study, we establish a benchmark for code unlearning evaluation, which includes three critical downstream tasks: copyrighted code unlearning, insecure code unlearning, and deprecated API unlearning. Our evaluation demonstrates that PROD achieves superior balance between forget quality and model utility compared to existing unlearning approaches across three downstream tasks, while consistently exhibiting improvements when applied to LLMs of varying series. PROD also exhibits superior robustness against adversarial attacks without generating or exposing the data to be forgotten. The results underscore that our approach not only extends the application boundary of unlearning techniques to source code, but also holds significant implications for advancing reliable code generation.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
Cache Me If You Can: How Many KVs Do You Need for Effective Long-Context LMs?
Authors:
Adithya Bhaskar,
Alexander Wettig,
Tianyu Gao,
Yihe Dong,
Danqi Chen
Abstract:
Language models handle increasingly long contexts for tasks such as book summarization, but this leads to growing memory costs for the key-value (KV) cache. Many prior works have proposed ways of discarding KVs from memory, but their approaches are tailored to favorable settings, obscuring caveats like high peak memory and performance degradation, and a fair comparison between methods is difficult…
▽ More
Language models handle increasingly long contexts for tasks such as book summarization, but this leads to growing memory costs for the key-value (KV) cache. Many prior works have proposed ways of discarding KVs from memory, but their approaches are tailored to favorable settings, obscuring caveats like high peak memory and performance degradation, and a fair comparison between methods is difficult. In this paper, we propose the *KV footprint* as a unified metric, which accounts for both the amount of KV entries stored and their lifespan in memory. We evaluate methods based on the smallest footprint they attain while preserving performance in both long-context understanding and generation, with context lengths of up to 128K tokens. This metric reveals the high peak memory of prior KV eviction methods. One class of methods -- *post-fill eviction* -- has a high footprint due to being incompatible with eviction during pre-filling. We adapt these methods to be able to evict KVs during pre-filling, achieving substantially lower KV footprints. We then turn to *recency eviction* methods, wherein we propose PruLong, an end-to-end optimization method for learning which attention heads need to retain the full KV cache and which do not. PruLong saves memory while preserving long-context performance, achieving 12% smaller KV footprint than prior methods while retaining performance in challenging recall tasks. Our paper clarifies the complex tangle of long-context inference methods and paves the way for future development to minimize the KV footprint.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
RGBTrack: Fast, Robust Depth-Free 6D Pose Estimation and Tracking
Authors:
Teng Guo,
Jingjin Yu
Abstract:
We introduce a robust framework, RGBTrack, for real-time 6D pose estimation and tracking that operates solely on RGB data, thereby eliminating the need for depth input for such dynamic and precise object pose tracking tasks. Building on the FoundationPose architecture, we devise a novel binary search strategy combined with a render-and-compare mechanism to efficiently infer depth and generate robu…
▽ More
We introduce a robust framework, RGBTrack, for real-time 6D pose estimation and tracking that operates solely on RGB data, thereby eliminating the need for depth input for such dynamic and precise object pose tracking tasks. Building on the FoundationPose architecture, we devise a novel binary search strategy combined with a render-and-compare mechanism to efficiently infer depth and generate robust pose hypotheses from true-scale CAD models. To maintain stable tracking in dynamic scenarios, including rapid movements and occlusions, RGBTrack integrates state-of-the-art 2D object tracking (XMem) with a Kalman filter and a state machine for proactive object pose recovery. In addition, RGBTrack's scale recovery module dynamically adapts CAD models of unknown scale using an initial depth estimate, enabling seamless integration with modern generative reconstruction techniques. Extensive evaluations on benchmark datasets demonstrate that RGBTrack's novel depth-free approach achieves competitive accuracy and real-time performance, making it a promising practical solution candidate for application areas including robotics, augmented reality, and computer vision.
The source code for our implementation will be made publicly available at https://github.com/GreatenAnoymous/RGBTrack.git.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
Closed-Loop Molecular Communication with Local and Global Degradation: Modeling and ISI Analysis
Authors:
Lukas Brand,
Fardad Vakilipoor,
Sören Botsch,
Timo Jakumeit,
Sebastian Lotter,
Robert Schober,
Maximilian Schäfer
Abstract:
This paper presents a novel physics-based model for signal propagation in closed-loop molecular communication (MC) systems, which are particularly relevant for many envisioned biomedical applications, such as health monitoring or drug delivery within the closed-loop human cardiovascular system (CVS). Compared to open-loop systems, which are mostly considered in MC, closed-loop systems exhibit diff…
▽ More
This paper presents a novel physics-based model for signal propagation in closed-loop molecular communication (MC) systems, which are particularly relevant for many envisioned biomedical applications, such as health monitoring or drug delivery within the closed-loop human cardiovascular system (CVS). Compared to open-loop systems, which are mostly considered in MC, closed-loop systems exhibit different characteristic effects influencing signaling molecule (SM) propagation. One key phenomenon are the periodic SM arrivals at the receiver (RX), leading to various types of inter-symbol interference (ISI) inherent to closed-loop system. To capture these characteristic effects, we propose an analytical model for the SM propagation inside closed-loop systems. The model accounts for arbitrary spatio-temporal SM release patterns at the transmitter (TX), and incorporates several environmental effects such as fluid flow, SM diffusion, and SM degradation. Moreover, to capture a wide range of practically relevant degradation and clearance mechanisms, the model includes both local removal (e.g., due to SM absorption into organs) and global removal (e.g., due to chemical degradation) of SMs. The accuracy of the proposed model is validated with three-dimensional (3-D) particle-based simulations (PBSs). Moreover, we utilize the proposed model to develop a rigorous characterization of the various types of ISI encountered in closed-loop MC systems.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
Monocular One-Shot Metric-Depth Alignment for RGB-Based Robot Grasping
Authors:
Teng Guo,
Baichuan Huang,
Jingjin Yu
Abstract:
Accurate 6D object pose estimation is a prerequisite for successfully completing robotic prehensile and non-prehensile manipulation tasks. At present, 6D pose estimation for robotic manipulation generally relies on depth sensors based on, e.g., structured light, time-of-flight, and stereo-vision, which can be expensive, produce noisy output (as compared with RGB cameras), and fail to handle transp…
▽ More
Accurate 6D object pose estimation is a prerequisite for successfully completing robotic prehensile and non-prehensile manipulation tasks. At present, 6D pose estimation for robotic manipulation generally relies on depth sensors based on, e.g., structured light, time-of-flight, and stereo-vision, which can be expensive, produce noisy output (as compared with RGB cameras), and fail to handle transparent objects. On the other hand, state-of-the-art monocular depth estimation models (MDEMs) provide only affine-invariant depths up to an unknown scale and shift. Metric MDEMs achieve some successful zero-shot results on public datasets, but fail to generalize. We propose a novel framework, Monocular One-shot Metric-depth Alignment (MOMA), to recover metric depth from a single RGB image, through a one-shot adaptation building on MDEM techniques. MOMA performs scale-rotation-shift alignments during camera calibration, guided by sparse ground-truth depth points, enabling accurate depth estimation without additional data collection or model retraining on the testing setup. MOMA supports fine-tuning the MDEM on transparent objects, demonstrating strong generalization capabilities. Real-world experiments on tabletop 2-finger grasping and suction-based bin-picking applications show MOMA achieves high success rates in diverse tasks, confirming its effectiveness.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
Can an Extra Degree of Freedom in Scalar-Tensor Non-Metricity Gravity Account for the Evolution of the Universe?
Authors:
Ghulam Murtaza,
Avik De,
Andronikos Paliathanasis,
Tee-How Loo
Abstract:
We investigate whether the extra scalar degree of freedom that arises in the second connection class of scalar-tensor non-metricity gravity can accurately replicate and potentially enrich the cosmic expansion history. Focusing on a spatially flat FLRW background, we introduce Hubble-normalized variables and recast the field equations into an autonomous dynamical system. Four representative scenari…
▽ More
We investigate whether the extra scalar degree of freedom that arises in the second connection class of scalar-tensor non-metricity gravity can accurately replicate and potentially enrich the cosmic expansion history. Focusing on a spatially flat FLRW background, we introduce Hubble-normalized variables and recast the field equations into an autonomous dynamical system. Four representative scenarios are analyzed comprehensively. Phase-space research reveals a rich hierarchy of critical points: matter-dominated, stiff-fluid, and de Sitter solutions, together with asymptotic trajectories leading to Big-Crunch/Rip singularities and transient, unstable matter epochs. With suitable parameter choices, the standard $Λ$CDM sequence is reinstated; however, novel late-time and high-curvature regimes arise exclusively from the non-metricity sector. A systematic comparison of metric scalar-tensor and teleparallel scalar-torsion theories reveals unique stability characteristics and potential observational discriminants. Our findings indicate that the additional time-dependent function inherent to scalar-tensor non-metricity gravity can effectively explain the Universe's evolution while providing new phenomenology that can be tested by upcoming surveys.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
Chain-of-Thought Prompting Obscures Hallucination Cues in Large Language Models: An Empirical Evaluation
Authors:
Jiahao Cheng,
Tiancheng Su,
Jia Yuan,
Guoxiu He,
Jiawei Liu,
Xinqi Tao,
Jingwen Xie,
Huaxia Li
Abstract:
Large Language Models (LLMs) often exhibit \textit{hallucinations}, generating factually incorrect or semantically irrelevant content in response to prompts. Chain-of-Thought (CoT) prompting can mitigate hallucinations by encouraging step-by-step reasoning, but its impact on hallucination detection remains underexplored. To bridge this gap, we conduct a systematic empirical evaluation. We begin wi…
▽ More
Large Language Models (LLMs) often exhibit \textit{hallucinations}, generating factually incorrect or semantically irrelevant content in response to prompts. Chain-of-Thought (CoT) prompting can mitigate hallucinations by encouraging step-by-step reasoning, but its impact on hallucination detection remains underexplored. To bridge this gap, we conduct a systematic empirical evaluation. We begin with a pilot experiment, revealing that CoT reasoning significantly affects the LLM's internal states and token probability distributions. Building on this, we evaluate the impact of various CoT prompting methods on mainstream hallucination detection methods across both instruction-tuned and reasoning-oriented LLMs. Specifically, we examine three key dimensions: changes in hallucination score distributions, variations in detection accuracy, and shifts in detection confidence. Our findings show that while CoT prompting helps reduce hallucination frequency, it also tends to obscure critical signals used for detection, impairing the effectiveness of various detection methods. Our study highlights an overlooked trade-off in the use of reasoning. Code is publicly available at: https://anonymous.4open.science/r/cot-hallu-detect.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
Tower+: Bridging Generality and Translation Specialization in Multilingual LLMs
Authors:
Ricardo Rei,
Nuno M. Guerreiro,
José Pombal,
João Alves,
Pedro Teixeirinha,
Amin Farajian,
André F. T. Martins
Abstract:
Fine-tuning pretrained LLMs has been shown to be an effective strategy for reaching state-of-the-art performance on specific tasks like machine translation. However, this process of adaptation often implies sacrificing general-purpose capabilities, such as conversational reasoning and instruction-following, hampering the utility of the system in real-world applications that require a mixture of sk…
▽ More
Fine-tuning pretrained LLMs has been shown to be an effective strategy for reaching state-of-the-art performance on specific tasks like machine translation. However, this process of adaptation often implies sacrificing general-purpose capabilities, such as conversational reasoning and instruction-following, hampering the utility of the system in real-world applications that require a mixture of skills. In this paper, we introduce Tower+, a suite of models designed to deliver strong performance across both translation and multilingual general-purpose text capabilities. We achieve a Pareto frontier between translation specialization and multilingual general-purpose capabilities by introducing a novel training recipe that builds on Tower (Alves et al., 2024), comprising continued pretraining, supervised fine-tuning, preference optimization, and reinforcement learning with verifiable rewards. At each stage of training, we carefully generate and curate data to strengthen performance on translation as well as general-purpose tasks involving code generation, mathematics problem solving, and general instruction-following. We develop models at multiple scales: 2B, 9B, and 72B. Our smaller models often outperform larger general-purpose open-weight and proprietary LLMs (e.g., Llama 3.3 70B, GPT-4o). Our largest model delivers best-in-class translation performance for high-resource languages and top results in multilingual Arena Hard evaluations and in IF-MT, a benchmark we introduce for evaluating both translation and instruction-following. Our findings highlight that it is possible to rival frontier models in general capabilities, while optimizing for specific business domains, such as translation and localization.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
Empowering Near-Field Communications in Low-Altitude Economy with LLM: Fundamentals, Potentials, Solutions, and Future Directions
Authors:
Zhuo Xu,
Tianyue Zheng,
Linglong Dai
Abstract:
The low-altitude economy (LAE) is gaining significant attention from academia and industry. Fortunately, LAE naturally aligns with near-field communications in extremely large-scale MIMO (XL-MIMO) systems. By leveraging near-field beamfocusing, LAE can precisely direct beam energy to unmanned aerial vehicles, while the additional distance dimension boosts overall spectrum efficiency. However, near…
▽ More
The low-altitude economy (LAE) is gaining significant attention from academia and industry. Fortunately, LAE naturally aligns with near-field communications in extremely large-scale MIMO (XL-MIMO) systems. By leveraging near-field beamfocusing, LAE can precisely direct beam energy to unmanned aerial vehicles, while the additional distance dimension boosts overall spectrum efficiency. However, near-field communications in LAE still face several challenges, such as the increase in signal processing complexity and the necessity of distinguishing between far and near-field users. Inspired by the large language models (LLM) with powerful ability to handle complex problems, we apply LLM to solve challenges of near-field communications in LAE. The objective of this article is to provide a comprehensive analysis and discussion on LLM-empowered near-field communications in LAE. Specifically, we first introduce fundamentals of LLM and near-field communications, including the key advantages of LLM and key characteristics of near-field communications. Then, we reveal the opportunities and challenges of near-field communications in LAE. To address these challenges, we present a LLM-based scheme for near-field communications in LAE, and provide a case study which jointly distinguishes far and near-field users and designs multi-user precoding matrix. Finally, we outline and highlight several future research directions and open issues.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
Behavior Driven Development for 3D Games
Authors:
Fernando Pastor Ricós,
Beatriz Marín,
I. S. W. B. Prasetya,
Tanja E. J. Vos,
Joseph Davidson,
Karel Hovorka
Abstract:
Computer 3D games are complex software environments that require novel testing processes to ensure high-quality standards. The Intelligent Verification/Validation for Extended Reality Based Systems (iv4XR) framework addresses this need by enabling the implementation of autonomous agents to automate game testing scenarios. This framework facilitates the automation of regression test cases for compl…
▽ More
Computer 3D games are complex software environments that require novel testing processes to ensure high-quality standards. The Intelligent Verification/Validation for Extended Reality Based Systems (iv4XR) framework addresses this need by enabling the implementation of autonomous agents to automate game testing scenarios. This framework facilitates the automation of regression test cases for complex 3D games like Space Engineers. Nevertheless, the technical expertise required to define test scripts using iv4XR can constrain seamless collaboration between developers and testers. This paper reports how integrating a Behavior-driven Development (BDD) approach with the iv4XR framework allows the industrial company behind Space Engineers to automate regression testing. The success of this industrial collaboration has inspired the iv4XR team to integrate the BDD approach to improve the automation of play-testing for the experimental 3D game LabRecruits. Furthermore, the iv4XR framework has been extended with tactical programming to enable the automation of long-play test scenarios in Space Engineers. These results underscore the versatility of the iv4XR framework in supporting diverse testing approaches while showcasing how BDD empowers users to create, manage, and execute automated game tests using comprehensive and human-readable statements.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
Phase Transition of the Ising Model on a 3-Dimensional Fractal Lattice
Authors:
Jozef Genzor,
Roman Krčmár,
Hiroshi Ueda,
Denis Kochan,
Andrej Gendiar,
Tomotoshi Nishino
Abstract:
The critical behavior of the classical Ising model on a three-dimensional fractal lattice with Hausdorff dimension $d_H = \ln32 / \ln4 = 2.5$ is investigated using the higher-order tensor renormalization group (HOTRG) method. We determine the critical temperature $T_c \approx 2.65231$ and the critical exponents for magnetization $β\approx 0.059$ and field response $δ\approx 35$. Unlike a previousl…
▽ More
The critical behavior of the classical Ising model on a three-dimensional fractal lattice with Hausdorff dimension $d_H = \ln32 / \ln4 = 2.5$ is investigated using the higher-order tensor renormalization group (HOTRG) method. We determine the critical temperature $T_c \approx 2.65231$ and the critical exponents for magnetization $β\approx 0.059$ and field response $δ\approx 35$. Unlike a previously studied 2D fractal with $d_H \approx 1.792$, the specific heat for this 3D fractal exhibits a divergent singularity at $T_c$. The results are compared with those for regular lattices and other fractal structures to elucidate the role of dimensionality in critical phenomena.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
Exploring MAXI J1744-294: IXPE insights into a Newly Discovered X-ray Transient
Authors:
Lorenzo Marra,
Romana Mikušincová,
Federico M. Vincentelli,
Fiamma Capitanio,
Melania Del Santo,
Sergio Fabiani,
Shifra Mandel,
Fabio Muleri,
Maxime Parra,
Paolo Soffitta,
Antonella Tarana,
M. Cristina Baglio,
Stefano Bianchi,
Enrico Costa,
Antonino D'Aì,
Barbara De Marco,
Michal Dovčiak,
Vittoria Elvezia Gianolli,
Andrea Gnarini,
Maitrayee Gupta,
Adam Ingram,
Guglielmo Mastroserio,
Giorgio Matt,
Kaya Mori,
Pierre-Olivier Petrucci
, et al. (11 additional authors not shown)
Abstract:
We present the first IXPE spectro-polarimetric observation of the black hole candidate MAXI J1744$-$294, a transient X-ray source discovered during a bright 2025 outburst in the Galactic center region. During the $\sim$150 ks observation, the source was found in the soft state, and its spectrum was well described by an absorbed multicolor disk with a minor high-energy tail. No significant polariza…
▽ More
We present the first IXPE spectro-polarimetric observation of the black hole candidate MAXI J1744$-$294, a transient X-ray source discovered during a bright 2025 outburst in the Galactic center region. During the $\sim$150 ks observation, the source was found in the soft state, and its spectrum was well described by an absorbed multicolor disk with a minor high-energy tail. No significant polarization was detected, and we derived a 3$σ$ upper limit on the polarization degree of $1.3\%$ in the 2--8 keV energy band. This result is consistent with previous findings for soft-state black hole binaries observed at low to intermediate inclination angles. By comparing the polarization degree upper limit with theoretical predictions for standard accretion disk emission, we constrain the disk inclination to $i \lesssim 38^\circ$--$71^\circ$, depending on the black hole spin and the disk atmosphere albedo.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
Magnetic braking and dynamo evolution of $β$ Hydri
Authors:
A. R. G. Santos,
T. S. Metcalfe,
O. Kochukhov,
T. R. Ayres,
R. Gafeira,
T. L. Campante
Abstract:
The evolution of magnetic braking and dynamo processes in subgiant stars is essential for understanding how these stars lose angular momentum. We investigate the magnetic braking and dynamo evolution of $β$ Hydri, a G-type subgiant, to test the hypothesis of weakened magnetic braking and the potential rejuvenation of large-scale magnetic fields. We analyze spectropolarimetric observations from HAR…
▽ More
The evolution of magnetic braking and dynamo processes in subgiant stars is essential for understanding how these stars lose angular momentum. We investigate the magnetic braking and dynamo evolution of $β$ Hydri, a G-type subgiant, to test the hypothesis of weakened magnetic braking and the potential rejuvenation of large-scale magnetic fields. We analyze spectropolarimetric observations from HARPS (HARPSpol; polarimetric mode of High Accuracy Radial velocity Planet Searcher), and combine them with archival X-ray data and asteroseismic properties from TESS (Transiting Exoplanet Survey Satellite) to estimate the current wind braking torque of $β$ Hydri. Despite experiencing weakened magnetic braking during the second half of its main-sequence lifetime, our results indicate that $β$ Hydri has regained significant magnetic activity and a large-scale magnetic field. This observation aligns with the "born-again" dynamo hypothesis. Furthermore, our estimated wind braking torque is considerably stronger than what would be expected for a star in the weakened magnetic braking regime. This suggests that subgiants with extended convective zones can temporarily re-establish large-scale dynamo action. These results provide critical constraints on stellar rotation models and improve our understanding of the interplay between magnetic field structure, stellar activity cycles, and angular momentum evolution in old solar-type stars.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
Navigating the Deep: Signature Extraction on Deep Neural Networks
Authors:
Haolin Liu,
Adrien Siproudhis,
Samuel Experton,
Peter Lorenz,
Christina Boura,
Thomas Peyrin
Abstract:
Neural network model extraction has emerged in recent years as an important security concern, as adversaries attempt to recover a network's parameters via black-box queries. A key step in this process is signature extraction, which aims to recover the absolute values of the network's weights layer by layer. Prior work, notably by Carlini et al. (2020), introduced a technique inspired by differenti…
▽ More
Neural network model extraction has emerged in recent years as an important security concern, as adversaries attempt to recover a network's parameters via black-box queries. A key step in this process is signature extraction, which aims to recover the absolute values of the network's weights layer by layer. Prior work, notably by Carlini et al. (2020), introduced a technique inspired by differential cryptanalysis to extract neural network parameters. However, their method suffers from several limitations that restrict its applicability to networks with a few layers only. Later works focused on improving sign extraction, but largely relied on the assumption that signature extraction itself was feasible.
In this work, we revisit and refine the signature extraction process by systematically identifying and addressing for the first time critical limitations of Carlini et al.'s signature extraction method. These limitations include rank deficiency and noise propagation from deeper layers. To overcome these challenges, we propose efficient algorithmic solutions for each of the identified issues, greatly improving the efficiency of signature extraction. Our approach permits the extraction of much deeper networks than was previously possible. We validate our method through extensive experiments on ReLU-based neural networks, demonstrating significant improvements in extraction depth and accuracy. For instance, our extracted network matches the target network on at least 95% of the input space for each of the eight layers of a neural network trained on the CIFAR-10 dataset, while previous works could barely extract the first three layers. Our results represent a crucial step toward practical attacks on larger and more complex neural network architectures.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
Volumetric Parameterization for 3-Dimensional Simply-Connected Manifolds
Authors:
Zhiyuan Lyu,
Qiguang Chen,
Gary P. T. Choi,
Lok Ming Lui
Abstract:
With advances in technology, there has been growing interest in developing effective mapping methods for 3-dimensional objects in recent years. Volumetric parameterization for 3D solid manifolds plays an important role in processing 3D data. However, the conventional approaches cannot control the bijectivity and local geometric distortions of the result mappings due to the complex structure of the…
▽ More
With advances in technology, there has been growing interest in developing effective mapping methods for 3-dimensional objects in recent years. Volumetric parameterization for 3D solid manifolds plays an important role in processing 3D data. However, the conventional approaches cannot control the bijectivity and local geometric distortions of the result mappings due to the complex structure of the solid manifolds. Moreover, prior methods mainly focus on one property instead of balancing different properties during the mapping process. In this paper, we propose several novel methods for computing volumetric parameterizations for 3D simply-connected manifolds. Analogous to surface parameterization, our framework incorporates several models designed to preserve geometric structure, achieve density equalization, and optimally balance geometric and density distortions. With these methods, various 3D manifold parameterizations with different desired properties can be achieved. These methods are tested on different examples and manifold remeshing applications, demonstrating their effectiveness and accuracy.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
Nanosculpted 3D helices of a magnetic Weyl semimetal with switchable nonreciprocity
Authors:
Max T. Birch,
Yukako Fujishiro,
Ilya Belopolski,
Masataka Mogi,
Yi-Ling Chiew,
Xiuzhen Yu,
Naoto Nagaosa,
Minoru Kawamura,
Yoshinori Tokura
Abstract:
The emergent properties of materials are defined by the symmetries of their underlying atomic, spin and charge order. The explorations of symmetry breaking effects are therefore usually limited by the intrinsic properties of known, stable materials. In recent years, advances in focused ion beam (FIB) fabrication have enabled the nanostructuring of bulk crystals into ultraprecise transport devices…
▽ More
The emergent properties of materials are defined by the symmetries of their underlying atomic, spin and charge order. The explorations of symmetry breaking effects are therefore usually limited by the intrinsic properties of known, stable materials. In recent years, advances in focused ion beam (FIB) fabrication have enabled the nanostructuring of bulk crystals into ultraprecise transport devices [1-4], facilitating the investigation of geometrical effects on mesoscopic length scales. In this work, we expand such explorations into three-dimensional (3D), curvilinear shapes, by sculpting helical nanostructure devices from single crystals of the high-mobility, centrosymmetric magnetic Weyl semimetal Co$_3$Sn$_2$S$_2$ [5,6]. The combination of the imposed chiral geometry and intrinsic ferromagnetism yields nonreciprocal electron transport [7-9]. The high coercivity results in an anomalous, reversable diode effect remnant under zero applied magnetic field, which is orders of magnitude larger than can be explained by a classical self-field mechanism. We argue the enhancement originates from the high carrier mobility and the resulting quasi-ballistic transport: the conduction electron mean free path approaches the length scale of the curvature, resulting in increased asymmetrical scattering at the boundaries. We further demonstrate the inverse effect of the nonreciprocal transport: the field-free, current-induced switching of the magnetisation. The results establish the vast potential of 3D nanosculpting to explore and enrich the functionality of quantum materials.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
Instituto de Telecomunicações at IWSLT 2025: Aligning Small-Scale Speech and Language Models for Speech-to-Text Learning
Authors:
Giuseppe Attanasio,
Sonal Sannigrahi,
Ben Peters,
André F. T. Martins
Abstract:
This paper presents the IT-IST submission to the IWSLT 2025 Shared Task on Instruction Following Speech Processing. We submit results for the Short Track, i.e., speech recognition, translation, and spoken question answering. Our model is a unified speech-to-text model that integrates a pre-trained continuous speech encoder and text decoder through a first phase of modality alignment and a second p…
▽ More
This paper presents the IT-IST submission to the IWSLT 2025 Shared Task on Instruction Following Speech Processing. We submit results for the Short Track, i.e., speech recognition, translation, and spoken question answering. Our model is a unified speech-to-text model that integrates a pre-trained continuous speech encoder and text decoder through a first phase of modality alignment and a second phase of instruction fine-tuning. Crucially, we focus on using small-scale language model backbones (< 2B) and restrict to high-quality, CC-BY data along with synthetic data generation to supplement existing resources.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
Simulating Correlated Electrons with Symmetry-Enforced Normalizing Flows
Authors:
Dominic Schuh,
Janik Kreit,
Evan Berkowitz,
Lena Funcke,
Thomas Luu,
Kim A. Nicoli,
Marcel Rodekamp
Abstract:
We present the first proof of principle that normalizing flows can accurately learn the Boltzmann distribution of the fermionic Hubbard model - a key framework for describing the electronic structure of graphene and related materials. State-of-the-art methods like Hybrid Monte Carlo often suffer from ergodicity issues near the time-continuum limit, leading to biased estimates. Leveraging symmetry-…
▽ More
We present the first proof of principle that normalizing flows can accurately learn the Boltzmann distribution of the fermionic Hubbard model - a key framework for describing the electronic structure of graphene and related materials. State-of-the-art methods like Hybrid Monte Carlo often suffer from ergodicity issues near the time-continuum limit, leading to biased estimates. Leveraging symmetry-aware architectures as well as independent and identically distributed sampling, our approach resolves these issues and achieves significant speed-ups over traditional methods.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
Low-Complexity Receiver Design for Affine Filter Bank Modulation
Authors:
Kuranage Roche Rayan Ranasinghe,
Bruno S. Chang,
Giuseppe Thadeu Freitas de Abreu
Abstract:
We propose a low-complexity receiver structure for the recently introduced Affine Filter Bank Modulation (AFBM) scheme, which is a novel waveform designed for integrated sensing and communications (ISAC) systems operating in doubly-dispersive (DD) channels. The proposed receiver structure is based on the Gaussian Belief Propagation (GaBP) framework, making use of only element-wise scalar operation…
▽ More
We propose a low-complexity receiver structure for the recently introduced Affine Filter Bank Modulation (AFBM) scheme, which is a novel waveform designed for integrated sensing and communications (ISAC) systems operating in doubly-dispersive (DD) channels. The proposed receiver structure is based on the Gaussian Belief Propagation (GaBP) framework, making use of only element-wise scalar operations to perform detection of the transmitted symbols. Simulation results demonstrate that AFBM in conjunction with GaBP outperforms affine frequency division multiplexing (AFDM) in terms of bit error rates (BERs) in DD channels, while achieving very low out-of-band emissions (OOBE) in high-mobility scenarios.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
Robust Reinforcement Learning for Discrete Compositional Generation via General Soft Operators
Authors:
Marco Jiralerspong,
Esther Derman,
Danilo Vucetic,
Nikolay Malkin,
Bilun Sun,
Tianyu Zhang,
Pierre-Luc Bacon,
Gauthier Gidel
Abstract:
A major bottleneck in scientific discovery involves narrowing a large combinatorial set of objects, such as proteins or molecules, to a small set of promising candidates. While this process largely relies on expert knowledge, recent methods leverage reinforcement learning (RL) to enhance this filtering. They achieve this by estimating proxy reward functions from available datasets and using regula…
▽ More
A major bottleneck in scientific discovery involves narrowing a large combinatorial set of objects, such as proteins or molecules, to a small set of promising candidates. While this process largely relies on expert knowledge, recent methods leverage reinforcement learning (RL) to enhance this filtering. They achieve this by estimating proxy reward functions from available datasets and using regularization to generate more diverse candidates. These reward functions are inherently uncertain, raising a particularly salient challenge for scientific discovery. In this work, we show that existing methods, often framed as sampling proportional to a reward function, are inadequate and yield suboptimal candidates, especially in large search spaces. To remedy this issue, we take a robust RL approach and introduce a unified operator that seeks robustness to the uncertainty of the proxy reward function. This general operator targets peakier sampling distributions while encompassing known soft RL operators. It also leads us to a novel algorithm that identifies higher-quality, diverse candidates in both synthetic and real-world tasks. Ultimately, our work offers a new, flexible perspective on discrete compositional generation tasks. Code: https://github.com/marcojira/tgm.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
RAGentA: Multi-Agent Retrieval-Augmented Generation for Attributed Question Answering
Authors:
Ines Besrour,
Jingbo He,
Tobias Schreieder,
Michael Färber
Abstract:
We present RAGentA, a multi-agent retrieval-augmented generation (RAG) framework for attributed question answering (QA). With the goal of trustworthy answer generation, RAGentA focuses on optimizing answer correctness, defined by coverage and relevance to the question and faithfulness, which measures the extent to which answers are grounded in retrieved documents. RAGentA uses a multi-agent archit…
▽ More
We present RAGentA, a multi-agent retrieval-augmented generation (RAG) framework for attributed question answering (QA). With the goal of trustworthy answer generation, RAGentA focuses on optimizing answer correctness, defined by coverage and relevance to the question and faithfulness, which measures the extent to which answers are grounded in retrieved documents. RAGentA uses a multi-agent architecture that iteratively filters retrieved documents, generates attributed answers with in-line citations, and verifies completeness through dynamic refinement. Central to the framework is a hybrid retrieval strategy that combines sparse and dense methods, improving Recall@20 by 12.5% compared to the best single retrieval model, resulting in more correct and well-supported answers. Evaluated on a synthetic QA dataset derived from the FineWeb index, RAGentA outperforms standard RAG baselines, achieving gains of 1.09% in correctness and 10.72% in faithfulness. These results demonstrate the effectiveness of the multi-agent architecture and hybrid retrieval in advancing trustworthy QA.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
Resonance fluorescence and indistinguishable photons from a coherently driven B centre in hBN
Authors:
Domitille Gérard,
Stéphanie Buil,
Kenji Watanabe,
Takashi Taniguchi,
Jean-Pierre Hermier,
Aymeric Delteil
Abstract:
Optically active defects in hexagonal boron nitride (hBN) have become amongst the most attractive single-photon emitters in the solid state, owing to their high-quality photophysical properties, combined with the unlimited possibilities of integration offered by the host two-dimensional material. In particular, the B centres, with their narrow linewidth, low wavelength spread and controllable posi…
▽ More
Optically active defects in hexagonal boron nitride (hBN) have become amongst the most attractive single-photon emitters in the solid state, owing to their high-quality photophysical properties, combined with the unlimited possibilities of integration offered by the host two-dimensional material. In particular, the B centres, with their narrow linewidth, low wavelength spread and controllable positioning, have raised a particular interest for integrated quantum photonics. However, to date, either their excitation or their detection has been performed non-resonantly due to the difficulty of rejecting the backreflected laser light at the same wavelength, thereby preventing to take full benefit from their high coherence in quantum protocols. Here, we make use of a narrow-linewidth emitter integrated in a hybrid metal-dielectric structure to implement crossed-polarisation laser rejection. This allows us to observe resonantly scattered photons, with associated experimental signatures of optical coherence in both continuous-wave (cw) and pulsed regimes, respectively the Mollow triplet and Hong-Ou-Mandel interference from zero-phonon-line emission. The measured two-photon interference visibility of 0.92 demonstrates the potential of B centres in hBN for applications to integrated quantum information.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
Search for the in-situ production of $^{77}$Ge in the GERDA neutrinoless double-beta decay experiment
Authors:
M. Agostini,
A. Alexander,
G. Araujo,
A. M. Bakalyarov,
M. Balata,
I. Barabanov,
L. Baudis,
C. Bauer,
S. Belogurov,
A. Bettini,
L. Bezrukov,
V. Biancacci,
E. Bossio,
V. Bothe,
R. Brugnera,
A. Caldwell,
S. Calgaro,
C. Cattadori,
A. Chernogorov,
P. -J. Chiu,
T. Comellato,
V. D'Andrea,
E. V. Demidova,
N. Di Marco,
E. Doroshkevich
, et al. (86 additional authors not shown)
Abstract:
The beta decay of $^{77}$Ge and $^{77\mathrm{m}}$Ge, both produced by neutron capture on $^{76}$Ge, is a potential background for Germanium based neutrinoless double-beta decay search experiments such as GERDA or the LEGEND experiment. In this work we present a search for $^{77}$Ge decays in the full GERDA Phase II data set. A delayed coincidence method was employed to identify the decay of…
▽ More
The beta decay of $^{77}$Ge and $^{77\mathrm{m}}$Ge, both produced by neutron capture on $^{76}$Ge, is a potential background for Germanium based neutrinoless double-beta decay search experiments such as GERDA or the LEGEND experiment. In this work we present a search for $^{77}$Ge decays in the full GERDA Phase II data set. A delayed coincidence method was employed to identify the decay of $^{77}$Ge via the isomeric state of $^{77}$As (9/2$^+$, 475 keV, ${T_{1/2} = 114}\,μ$s, $^{77\mathrm{m}}$As). New digital signal processing methods were employed to select and analyze pile-up signals. No signal was observed, and an upper limit on the production rate of was set at $<0.216$ nuc/(kg$\cdot$yr) (90% CL). This corresponds to a total production rate of $^{77}$Ge and $^{77\mathrm{m}}$Ge of $<0.38$ nuc/(kg$\cdot$ yr) (90% CL), assuming equal production rates. A previous Monte Carlo study predicted a value for in-situ $^{77}$Ge and $^{77\mathrm{m}}$Ge production of (0.21$\pm$0.07) nuc/(kg$\cdot$yr), a prediction that is now further corroborated by our experimental limit. Moreover, tagging the isomeric state of $^{77\mathrm{m}}$As can be utilised to further suppress the $^{77}$Ge background. Considering the similar experimental configurations of LEGEND-1000 and GERDA, the cosmogenic background in LEGEND-1000 at LNGS is estimated to remain at a sub-dominant level.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
LAION-C: An Out-of-Distribution Benchmark for Web-Scale Vision Models
Authors:
Fanfei Li,
Thomas Klein,
Wieland Brendel,
Robert Geirhos,
Roland S. Zimmermann
Abstract:
Out-of-distribution (OOD) robustness is a desired property of computer vision models. Improving model robustness requires high-quality signals from robustness benchmarks to quantify progress. While various benchmark datasets such as ImageNet-C were proposed in the ImageNet era, most ImageNet-C corruption types are no longer OOD relative to today's large, web-scraped datasets, which already contain…
▽ More
Out-of-distribution (OOD) robustness is a desired property of computer vision models. Improving model robustness requires high-quality signals from robustness benchmarks to quantify progress. While various benchmark datasets such as ImageNet-C were proposed in the ImageNet era, most ImageNet-C corruption types are no longer OOD relative to today's large, web-scraped datasets, which already contain common corruptions such as blur or JPEG compression artifacts. Consequently, these benchmarks are no longer well-suited for evaluating OOD robustness in the era of web-scale datasets. Indeed, recent models show saturating scores on ImageNet-era OOD benchmarks, indicating that it is unclear whether models trained on web-scale datasets truly become better at OOD generalization or whether they have simply been exposed to the test distortions during training. To address this, we introduce LAION-C as a benchmark alternative for ImageNet-C. LAION-C consists of six novel distortion types specifically designed to be OOD, even for web-scale datasets such as LAION. In a comprehensive evaluation of state-of-the-art models, we find that the LAION-C dataset poses significant challenges to contemporary models, including MLLMs such as Gemini and GPT-4o. We additionally conducted a psychophysical experiment to evaluate the difficulty of our corruptions for human observers, enabling a comparison of models to lab-quality human robustness data. We observe a paradigm shift in OOD generalization: from humans outperforming models, to the best models now matching or outperforming the best human observers.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
Performance studies of thin gas gap Resistive Plate Chamber prototypes with low Global Warming Potential gases for the ANUBIS experiment
Authors:
Aashaq Shah,
Thomas Adolphus,
Yingchang Zhang,
Oleg Brandt
Abstract:
Resistive Plate Chambers (RPCs) have traditionally operated with high Global Warming Potential (GWP) gas mixtures, adding to the environmental footprint of large-scale physics experiments. In response, efforts are underway to explore environmentally friendly alternatives as a long-term solution and low-GWP as a feasible short- to medium-term replacement for standard RPC gases. This study tests a f…
▽ More
Resistive Plate Chambers (RPCs) have traditionally operated with high Global Warming Potential (GWP) gas mixtures, adding to the environmental footprint of large-scale physics experiments. In response, efforts are underway to explore environmentally friendly alternatives as a long-term solution and low-GWP as a feasible short- to medium-term replacement for standard RPC gases. This study tests a few mixtures in 50 cm $\times$ 50 cm, 1 mm single-gap High-Pressure Laminate (HPL) RPC prototypes, as part of ongoing efforts for the ANUBIS experiment, which will operate with a 9.8 m$^{3}$ active gas volume. Measurements of performance metrics, including current and efficiency, are conducted with both standard and modified mixtures to assess their viability in sustaining detector performance. The results are also relevant for large RPC systems in other experiments at the LHC, such as ATLAS and CMS, as well as in applications beyond the LHC, supporting a shift toward environmentally sustainable gas mixtures in particle physics detectors.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
LunarLoc: Segment-Based Global Localization on the Moon
Authors:
Annika Thomas,
Robaire Galliath,
Aleksander Garbuz,
Luke Anger,
Cormac O'Neill,
Trevor Johst,
Dami Thomas,
George Lordos,
Jonathan P. How
Abstract:
Global localization is necessary for autonomous operations on the lunar surface where traditional Earth-based navigation infrastructure, such as GPS, is unavailable. As NASA advances toward sustained lunar presence under the Artemis program, autonomous operations will be an essential component of tasks such as robotic exploration and infrastructure deployment. Tasks such as excavation and transpor…
▽ More
Global localization is necessary for autonomous operations on the lunar surface where traditional Earth-based navigation infrastructure, such as GPS, is unavailable. As NASA advances toward sustained lunar presence under the Artemis program, autonomous operations will be an essential component of tasks such as robotic exploration and infrastructure deployment. Tasks such as excavation and transport of regolith require precise pose estimation, but proposed approaches such as visual-inertial odometry (VIO) accumulate odometry drift over long traverses. Precise pose estimation is particularly important for upcoming missions such as the ISRU Pilot Excavator (IPEx) that rely on autonomous agents to operate over extended timescales and varied terrain. To help overcome odometry drift over long traverses, we propose LunarLoc, an approach to global localization that leverages instance segmentation for zero-shot extraction of boulder landmarks from onboard stereo imagery. Segment detections are used to construct a graph-based representation of the terrain, which is then aligned with a reference map of the environment captured during a previous session using graph-theoretic data association. This method enables accurate and drift-free global localization in visually ambiguous settings. LunarLoc achieves sub-cm level accuracy in multi-session global localization experiments, significantly outperforming the state of the art in lunar global localization. To encourage the development of further methods for global localization on the Moon, we release our datasets publicly with a playback module: https://github.com/mit-acl/lunarloc-data.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
A deep learning and machine learning approach to predict neonatal death in the context of São Paulo
Authors:
Mohon Raihan,
Plabon Kumar Saha,
Rajan Das Gupta,
A Z M Tahmidul Kabir,
Afia Anjum Tamanna,
Md. Harun-Ur-Rashid,
Adnan Bin Abdus Salam,
Md Tanvir Anjum,
A Z M Ahteshamul Kabir
Abstract:
Neonatal death is still a concerning reality for underdeveloped and even some developed countries. Worldwide data indicate that 26.693 babies out of 1,000 births die, according to Macro Trades. To reduce this number, early prediction of endangered babies is crucial. Such prediction enables the opportunity to take ample care of the child and mother so that early child death can be avoided. In this…
▽ More
Neonatal death is still a concerning reality for underdeveloped and even some developed countries. Worldwide data indicate that 26.693 babies out of 1,000 births die, according to Macro Trades. To reduce this number, early prediction of endangered babies is crucial. Such prediction enables the opportunity to take ample care of the child and mother so that early child death can be avoided. In this context, machine learning was used to determine whether a newborn baby is at risk. To train the predictive model, historical data of 1.4 million newborns was used. Machine learning and deep learning techniques such as logical regression, K-nearest neighbor, random forest classifier, extreme gradient boosting (XGBoost), convolutional neural network, and long short-term memory (LSTM) were implemented using the dataset to identify the most accurate model for predicting neonatal mortality. Among the machine learning algorithms, XGBoost and random forest classifier achieved the best accuracy with 94%, while among the deep learning models, LSTM delivered the highest accuracy with 99%. Therefore, using LSTM appears to be the most suitable approach to predict whether precautionary measures for a child are necessary.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
Real-Time Black-Box Optimization for Dynamic Discrete Environments Using Embedded Ising Machines
Authors:
Tomoya Kashimata,
Yohei Hamakawa,
Masaya Yamasaki,
Kosuke Tatsumura
Abstract:
Many real-time systems require the optimization of discrete variables. Black-box optimization (BBO) algorithms and multi-armed bandit (MAB) algorithms perform optimization by repeatedly taking actions and observing the corresponding instant rewards without any prior knowledge. Recently, a BBO method using an Ising machine has been proposed to find the best action that is represented by a combinati…
▽ More
Many real-time systems require the optimization of discrete variables. Black-box optimization (BBO) algorithms and multi-armed bandit (MAB) algorithms perform optimization by repeatedly taking actions and observing the corresponding instant rewards without any prior knowledge. Recently, a BBO method using an Ising machine has been proposed to find the best action that is represented by a combination of discrete values and maximizes the instant reward in static environments. In contrast, dynamic environments, where real-time systems operate, necessitate MAB algorithms that maximize the average reward over multiple trials. However, due to the enormous number of actions resulting from the combinatorial nature of discrete optimization, conventional MAB algorithms cannot effectively optimize dynamic, discrete environments. Here, we show a heuristic MAB method for dynamic, discrete environments by extending the BBO method, in which an Ising machine effectively explores the actions while considering interactions between variables and changes in dynamic environments. We demonstrate the dynamic adaptability of the proposed method in a wireless communication system with moving users.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
On graded and shifted notions, and thick morphisms
Authors:
Theodore Voronov
Abstract:
We consider the notions of $L_{\infty}$-, $P_{\infty}$-, and $S_{\infty}$-algebras (including "shifted" versions) in the ${\mathbb Z_{2}} \times {\mathbb Z}$-graded setting. We also consider thick (microformal) morphisms and show explicitly how they work in such graded context.
We consider the notions of $L_{\infty}$-, $P_{\infty}$-, and $S_{\infty}$-algebras (including "shifted" versions) in the ${\mathbb Z_{2}} \times {\mathbb Z}$-graded setting. We also consider thick (microformal) morphisms and show explicitly how they work in such graded context.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
Infalling Ultra Faint Dwarfs as Emissaries of the Axiverse?
Authors:
A. Pozo,
T. Broadhurst,
H. N. Luu,
G. Smoot,
K. Umetsu,
T. Chiueh,
H. -Y. Schive,
R. Emami,
L. Hernquist,
P. Mocz,
M. Vogelsberger
Abstract:
Recent discoveries of Ultra Faint dwarf galaxies (UFG's) infalling onto the Milky Way, namely Leo K \& M at $r \simeq 450$kpc, considerably strengthens the case that UFG's constitute a distinct galaxy class that is inherently smaller, fainter and metal poor compared to the classical dwarf spheroidals (dSph). This distinction is at odds with the inherent continuity of galaxy halo masses formed unde…
▽ More
Recent discoveries of Ultra Faint dwarf galaxies (UFG's) infalling onto the Milky Way, namely Leo K \& M at $r \simeq 450$kpc, considerably strengthens the case that UFG's constitute a distinct galaxy class that is inherently smaller, fainter and metal poor compared to the classical dwarf spheroidals (dSph). This distinction is at odds with the inherent continuity of galaxy halo masses formed under scale-free gravity for any standard dark matter model. Here we show that distinct galaxy classes do evolve in cosmological simulations of multiple light bosons representing the 'Axiverse' proposal of String Theory, where a discrete mass spectrum of axions is generically predicted to span many decades in mass. In this context, the
observed UFG class we show corresponds to a relatively heavy boson of $3\times 10^{-21}$eV, including Leo K \& M, whereas a lighter axion of $10^{-22}$eV comprises the bulk of dark matter in all larger galaxies including the dSph's. Although Leo M is larger in size than Leo K, we predict its velocity dispersion to be smaller $\simeq 1.7$km/s, compared to $\simeq 4.5$km/s for Leo K, since soliton cores are required by the Uncertainty Principle to be wider at lower momentum. This scenario can be definitively tested using millisecond pulsars close to the Galactic center, where the Compton frequencies of the heavy and light bosons imprint monotone timing residuals that may be detected by SKA on timescales of approximately 1 week and 4 months, respectively.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
Magnus Methods for Stochastic Delay-Differential Equations
Authors:
Mitchell T. Griggs,
Kevin Burrage,
Pamela M. Burrage
Abstract:
This paper introduces Magnus-based methods for solving stochastic delay-differential equations (SDDEs). We construct Magnus--Euler--Maruyama (MEM) and Magnus--Milstein (MM) schemes by combining stochastic Magnus integrators with Taylor methods for SDDEs. These schemes are applied incrementally between multiples of the delay times. We present proofs of their convergence orders and demonstrate these…
▽ More
This paper introduces Magnus-based methods for solving stochastic delay-differential equations (SDDEs). We construct Magnus--Euler--Maruyama (MEM) and Magnus--Milstein (MM) schemes by combining stochastic Magnus integrators with Taylor methods for SDDEs. These schemes are applied incrementally between multiples of the delay times. We present proofs of their convergence orders and demonstrate these rates through numerical examples and error graphs. Among the examples, we apply the MEM and MM schemes to both linear and nonlinear problems. We also apply the MEM scheme to a stochastic partial delay-differential equation (SPDDE), comparing its performance with the traditional Euler--Maruyama (EM) method. Under fine spatial discretization, the MEM scheme remains numerically stable while the EM method becomes unstable, yielding a significant computational advantage.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.