Search | arXiv e-print repository

Exploration by Random Reward Perturbation

Authors: Haozhe Ma, Guoji Fu, Zhengding Luo, Jiele Wu, Tze-Yun Leong

Abstract: We introduce Random Reward Perturbation (RRP), a novel exploration strategy for reinforcement learning (RL). Our theoretical analyses demonstrate that adding zero-mean noise to environmental rewards effectively enhances policy diversity during training, thereby expanding the range of exploration. RRP is fully compatible with the action-perturbation-based exploration strategies, such as $ε$-greedy,… ▽ More We introduce Random Reward Perturbation (RRP), a novel exploration strategy for reinforcement learning (RL). Our theoretical analyses demonstrate that adding zero-mean noise to environmental rewards effectively enhances policy diversity during training, thereby expanding the range of exploration. RRP is fully compatible with the action-perturbation-based exploration strategies, such as $ε$-greedy, stochastic policies, and entropy regularization, providing additive improvements to exploration effects. It is general, lightweight, and can be integrated into existing RL algorithms with minimal implementation effort and negligible computational overhead. RRP establishes a theoretical connection between reward shaping and noise-driven exploration, highlighting their complementary potential. Experiments show that RRP significantly boosts the performance of Proximal Policy Optimization and Soft Actor-Critic, achieving higher sample efficiency and escaping local optima across various tasks, under both sparse and dense reward scenarios. △ Less

Submitted 10 June, 2025; originally announced June 2025.

arXiv:2505.22648 [pdf, ps, other]

WebDancer: Towards Autonomous Information Seeking Agency

Authors: Jialong Wu, Baixuan Li, Runnan Fang, Wenbiao Yin, Liwen Zhang, Zhengwei Tao, Dingchu Zhang, Zekun Xi, Gang Fu, Yong Jiang, Pengjun Xie, Fei Huang, Jingren Zhou

Abstract: Addressing intricate real-world problems necessitates in-depth information seeking and multi-step reasoning. Recent progress in agentic systems, exemplified by Deep Research, underscores the potential for autonomous multi-step research. In this work, we present a cohesive paradigm for building end-to-end agentic information seeking agents from a data-centric and training-stage perspective. Our app… ▽ More Addressing intricate real-world problems necessitates in-depth information seeking and multi-step reasoning. Recent progress in agentic systems, exemplified by Deep Research, underscores the potential for autonomous multi-step research. In this work, we present a cohesive paradigm for building end-to-end agentic information seeking agents from a data-centric and training-stage perspective. Our approach consists of four key stages: (1) browsing data construction, (2) trajectories sampling, (3) supervised fine-tuning for effective cold start, and (4) reinforcement learning for enhanced generalisation. We instantiate this framework in a web agent based on the ReAct, WebDancer. Empirical evaluations on the challenging information seeking benchmarks, GAIA and WebWalkerQA, demonstrate the strong performance of WebDancer, achieving considerable results and highlighting the efficacy of our training paradigm. Further analysis of agent training provides valuable insights and actionable, systematic pathways for developing more capable agentic models. The codes and demo will be released in https://github.com/Alibaba-NLP/WebAgent. △ Less

Submitted 29 June, 2025; v1 submitted 28 May, 2025; originally announced May 2025.

arXiv:2505.21358 [pdf, ps, other]

The James Webb Space Telescope NIRSpec-PRISM Transmission Spectrum of the Super-Puff, Kepler-51d

Authors: Jessica E. Libby-Roberts, Aaron Bello-Arufe, Zachory K. Berta-Thompson, Caleb I. Cañas, Yayaati Chachan, Renyu Hu, Yui Kawashima, Catriona Murray, Kazumasa Ohno, Armen Tokadjian, Suvrath Mahadevan, Kento Masuda, Leslie Hebb, Caroline Morley, Guangwei Fu, Peter Gao, Kevin B. Stevenson

Abstract: Kepler-51 is a 500 Myr G dwarf hosting three "super-puffs" and one low-mass non-transiting planet. Kepler-51d, the coolest (T_eq ~ 350 K) transiting planet in this system, is also one of the lowest density super-puffs known to date (rho_p = 0.038 +/- 0.009 g/cm^3). With a planetary mass of Mp = 5.6 +/- 1.2 Earth masses and a radius of Rp = 9.32 +/- 0.18 Earth radii, the observed properties of this… ▽ More Kepler-51 is a 500 Myr G dwarf hosting three "super-puffs" and one low-mass non-transiting planet. Kepler-51d, the coolest (T_eq ~ 350 K) transiting planet in this system, is also one of the lowest density super-puffs known to date (rho_p = 0.038 +/- 0.009 g/cm^3). With a planetary mass of Mp = 5.6 +/- 1.2 Earth masses and a radius of Rp = 9.32 +/- 0.18 Earth radii, the observed properties of this planet are not readily explained by most planet formation theories. Hypotheses explaining Kepler-51d's low density range from a substantial H/He envelope comprising more than 30% of its mass, to a high-altitude haze layer, to a tilted ring system. To test these hypotheses, we present the NIRSpec-PRISM 0.6-5.3 micron transmission spectrum of Kepler-51d observed by the James Webb Space Telescope. We find a spectrum best fit by a sloped line covering the entire wavelength range. Based on forward modeling and atmosphere retrievals, Kepler-51d likely possesses a low-metallicity atmosphere with high-altitude hazes of submicron particle sizes spanning pressures of 1-100 microbars. However, the spectrum could also be explained by a tilted ring with an estimated lifetime on the order of ~0.1 Myr. We also investigate the stellar activity of this young Sun-like star, extracting a spot temperature significantly hotter than sunspots and spot covering fractions on the order of 0.1-10%, depending on the assumed spot parameters. △ Less

Submitted 27 May, 2025; originally announced May 2025.

Comments: 34 pages, 13 figures, submitted to AJ

arXiv:2505.19732 [pdf, ps, other]

Generic effective source for gravitational self-force calculations in Schwarzschild spacetime

Authors: Chao Zhang, Rong-gen Cai, Guoyang Fu, Yungui Gong, Xuchen Lu, Wenting Zhou

Abstract: The numerical calculation of gravitational self-force in extreme mass ratio inspiral systems (EMRIs) is fundamentally challenging due to the singular nature of point-particle sources. This singularity arises from the interaction between the smaller compact object and its own gravitational perturbation. To address these challenges, the effective source method offers an innovative approach. It repla… ▽ More The numerical calculation of gravitational self-force in extreme mass ratio inspiral systems (EMRIs) is fundamentally challenging due to the singular nature of point-particle sources. This singularity arises from the interaction between the smaller compact object and its own gravitational perturbation. To address these challenges, the effective source method offers an innovative approach. It replaces traditional regularization schemes with a reformulation of the problem, utilizing finite and physically meaningful effective sources that inherently incorporate renormalized quantities. This paper presents an analytic framework for constructing finite and continuous effective sources for massive point particles undergoing arbitrary geodesic motion in Schwarzschild spacetime. This represents the first fully analytic treatment of such sources for generic geodesic trajectories. The complete analytic expression for effective sources establishes a critical foundation for computing both self-consistent and second-order gravitational self-forces, thereby enabling accurate waveform modeling of EMRIs in gravitational wave astronomy. △ Less

Submitted 26 May, 2025; originally announced May 2025.

Comments: 29 pages, 6 figures; comments are welcome

arXiv:2505.19538 [pdf, other]

DoctorRAG: Medical RAG Fusing Knowledge with Patient Analogy through Textual Gradients

Authors: Yuxing Lu, Gecheng Fu, Wei Wu, Xukai Zhao, Sin Yee Goi, Jinzhuo Wang

Abstract: Existing medical RAG systems mainly leverage knowledge from medical knowledge bases, neglecting the crucial role of experiential knowledge derived from similar patient cases -- a key component of human clinical reasoning. To bridge this gap, we propose DoctorRAG, a RAG framework that emulates doctor-like reasoning by integrating both explicit clinical knowledge and implicit case-based experience.… ▽ More Existing medical RAG systems mainly leverage knowledge from medical knowledge bases, neglecting the crucial role of experiential knowledge derived from similar patient cases -- a key component of human clinical reasoning. To bridge this gap, we propose DoctorRAG, a RAG framework that emulates doctor-like reasoning by integrating both explicit clinical knowledge and implicit case-based experience. DoctorRAG enhances retrieval precision by first allocating conceptual tags for queries and knowledge sources, together with a hybrid retrieval mechanism from both relevant knowledge and patient. In addition, a Med-TextGrad module using multi-agent textual gradients is integrated to ensure that the final output adheres to the retrieved knowledge and patient query. Comprehensive experiments on multilingual, multitask datasets demonstrate that DoctorRAG significantly outperforms strong baseline RAG models and gains improvements from iterative refinements. Our approach generates more accurate, relevant, and comprehensive responses, taking a step towards more doctor-like medical reasoning systems. △ Less

Submitted 26 May, 2025; originally announced May 2025.

Comments: 32 pages, 5 figures, 5 tables

arXiv:2505.18092 [pdf, ps, other]

QwenLong-CPRS: Towards $\infty$-LLMs with Dynamic Context Optimization

Authors: Weizhou Shen, Chenliang Li, Fanqi Wan, Shengyi Liao, Shaopeng Lai, Bo Zhang, Yingcheng Shi, Yuning Wu, Gang Fu, Zhansheng Li, Bin Yang, Ji Zhang, Fei Huang, Jingren Zhou, Ming Yan

Abstract: This technical report presents QwenLong-CPRS, a context compression framework designed for explicit long-context optimization, addressing prohibitive computation overhead during the prefill stage and the "lost in the middle" performance degradation of large language models (LLMs) during long sequence processing. Implemented through a novel dynamic context optimization mechanism, QwenLong-CPRS enab… ▽ More This technical report presents QwenLong-CPRS, a context compression framework designed for explicit long-context optimization, addressing prohibitive computation overhead during the prefill stage and the "lost in the middle" performance degradation of large language models (LLMs) during long sequence processing. Implemented through a novel dynamic context optimization mechanism, QwenLong-CPRS enables multi-granularity context compression guided by natural language instructions, achieving both efficiency gains and improved performance. Evolved from the Qwen architecture series, QwenLong-CPRS introduces four key innovations: (1) Natural language-guided dynamic optimization, (2) Bidirectional reasoning layers for enhanced boundary awareness, (3) Token critic mechanisms with language modeling heads, and (4) Window-parallel inference. Comprehensive evaluations across five benchmarks (4K-2M word contexts) demonstrate QwenLong-CPRS's threefold effectiveness: (1) Consistent superiority over other context management methods like RAG and sparse attention in both accuracy and efficiency. (2) Architecture-agnostic integration with all flagship LLMs, including GPT-4o, Gemini2.0-pro, Claude3.7-sonnet, DeepSeek-v3, and Qwen2.5-max, achieves 21.59$\times$ context compression alongside 19.15-point average performance gains; (3) Deployed with Qwen2.5-32B-Instruct, QwenLong-CPRS surpasses leading proprietary LLMs by 4.85 and 10.88 points on Ruler-128K and InfiniteBench, establishing new SOTA performance. △ Less

Submitted 27 May, 2025; v1 submitted 23 May, 2025; originally announced May 2025.

arXiv:2505.10910 [pdf, ps, other]

Cloudy mornings and clear evenings on a giant extrasolar world

Authors: Sagnick Mukherjee, David K. Sing, Guangwei Fu, Kevin B. Stevenson, Stephen P. Schmidt, Harry Baskett, Patrick McCreery, Natalie H. Allen, Katherine A. Bennett, Duncan A. Christie, Carlos Gascón, Jayesh Goyal, Éric Hébrard, Joshua D. Lothringer, Mercedes López-Morales, Jacob Lustig-Yaeger, Erin M. May, L. C. Mayorga, Nathan Mayne, Lakeisha M. Ramos Rosado, Henrique Reggiani, Zafar Rustamkulov, Kevin C. Schlaufman, K. S. Sotzen, Daniel Thorngren , et al. (2 additional authors not shown)

Abstract: Aerosols are common in exoplanet atmospheres, but their formation-whether through gas condensation or photochemical reactions-remains uncertain. We report a 6$σ$ detection of limb asymmetry in the transmission spectrum of WASP-94A b, revealing a cloud-covered (11$σ$) cooler morning limb and a clear hotter evening limb with strong H$_2$O absorption (10$σ$). Models suggest cloud droplets formed near… ▽ More Aerosols are common in exoplanet atmospheres, but their formation-whether through gas condensation or photochemical reactions-remains uncertain. We report a 6$σ$ detection of limb asymmetry in the transmission spectrum of WASP-94A b, revealing a cloud-covered (11$σ$) cooler morning limb and a clear hotter evening limb with strong H$_2$O absorption (10$σ$). Models suggest cloud droplets formed near mbar pressures are lofted to 0.01 mbar by strong vertical dynamics in the morning limb. They evaporate when circulated to the hotter evening limb, requiring a minimum 280 K (3$σ$) limb-to-limb temperature difference. We confirm that aerosols in hot Jupiters like WASP-94A b can have clouds cycling between day and night sides instead of photochemical hazes. Ignoring these effects severely biases inferred chemical abundances, showing limb-resolved spectroscopy is critical for characterizing the formation mechanisms of transiting exoplanets-from gas giants to terrestrial exoplanets, indicating the need to reassess inferences from a decade's worth of Hubble Space Telescope observations. △ Less

Submitted 16 May, 2025; originally announced May 2025.

Comments: Submitted, 31 pages, 26 Figures, 7 Tables

arXiv:2505.10880 [pdf, ps, other]

Approximation and Generalization Abilities of Score-based Neural Network Generative Models for Sub-Gaussian Distributions

Authors: Guoji Fu, Wee Sun Lee

Abstract: This paper studies the approximation and generalization abilities of score-based neural network generative models (SGMs) in estimating an unknown distribution $P_0$ from $n$ i.i.d. observations in $d$ dimensions. Assuming merely that $P_0$ is $α$-sub-Gaussian, we prove that for any time step $t \in [t_0, n^{O(1)}]$, where $t_0 \geq O(α^2n^{-2/d}\log n)$, there exists a deep ReLU neural network wit… ▽ More This paper studies the approximation and generalization abilities of score-based neural network generative models (SGMs) in estimating an unknown distribution $P_0$ from $n$ i.i.d. observations in $d$ dimensions. Assuming merely that $P_0$ is $α$-sub-Gaussian, we prove that for any time step $t \in [t_0, n^{O(1)}]$, where $t_0 \geq O(α^2n^{-2/d}\log n)$, there exists a deep ReLU neural network with width $\leq O(\log^3n)$ and depth $\leq O(n^{3/d}\log_2n)$ that can approximate the scores with $\tilde{O}(n^{-1})$ mean square error and achieve a nearly optimal rate of $\tilde{O}(n^{-1}t_0^{-d/2})$ for score estimation, as measured by the score matching loss. Our framework is universal and can be used to establish convergence rates for SGMs under milder assumptions than previous work. For example, assuming further that the target density function $p_0$ lies in Sobolev or Besov classes, with an appropriately early stopping strategy, we demonstrate that neural network-based SGMs can attain nearly minimax convergence rates up to logarithmic factors. Our analysis removes several crucial assumptions, such as Lipschitz continuity of the score function or a strictly positive lower bound on the target density. △ Less

Submitted 16 May, 2025; originally announced May 2025.

Comments: 94 pages

arXiv:2505.10353 [pdf, other]

Photomultiplier Requirements and Pre-Calibration for the SABRE South Liquid Scintillator Veto

Authors: L. J. Milligan, P. Urquijo, E. Barberio, V. U. Bashu, L. J. Bignell, I. Bolognino, S. S. Chhun, F. Dastgiri, T. Fruth, G. Fu, G. C. Hill, Y. Hua, R. S. James, K. Janssens, S. Kapoor, G. J. Lane, K. T. Leaver, P. McGee, L. J. McKie, J. McKenzie, P. C. McNamara, W. J. D. Melbourne, M. Mews, W. H. Ng, K. J. Rule , et al. (10 additional authors not shown)

Abstract: We present a study of the oil-proof base Hamamatsu R5912 photomultiplier tubes that will be used in the SABRE South linear-alkylbenzene liquid scintillator veto. SABRE South is a dark matter direct detection experiment at the Stawell Underground Physics Laboratory, aiming to test the DAMA/LIBRA dark matter annual modulation signal. We discuss the requirements of the liquid scintillator system and… ▽ More We present a study of the oil-proof base Hamamatsu R5912 photomultiplier tubes that will be used in the SABRE South linear-alkylbenzene liquid scintillator veto. SABRE South is a dark matter direct detection experiment at the Stawell Underground Physics Laboratory, aiming to test the DAMA/LIBRA dark matter annual modulation signal. We discuss the requirements of the liquid scintillator system and its photomultipliers, outline the methods and analysis used for the characterisation measurements, and results from initial tests. We discuss the impact of these measurements on the performance of the active veto system and explore analysis methods to allow for low threshold operation. Finally, we include results from a small scale liquid scintillator detector prototype used to assess the future performance of pulse shape discrimination in the liquid scintillator veto, and how well accommodated it is by the R5912 PMTs. △ Less

Submitted 19 May, 2025; v1 submitted 15 May, 2025; originally announced May 2025.

arXiv:2505.07231 [pdf, ps, other]

Mean Field Portfolio Games with Epstein-Zin Preferences

Authors: Guanxing Fu, Ulrich Horst

Abstract: We study mean field portfolio games under Epstein-Zin preferences, which naturally encompass the classical time-additive power utility as a special case. In a general non-Markovian framework, we establish a uniqueness result by proving a one-to-one correspondence between Nash equilibria and the solutions to a class of BSDEs. A key ingredient in our approach is a necessary stochastic maximum princi… ▽ More We study mean field portfolio games under Epstein-Zin preferences, which naturally encompass the classical time-additive power utility as a special case. In a general non-Markovian framework, we establish a uniqueness result by proving a one-to-one correspondence between Nash equilibria and the solutions to a class of BSDEs. A key ingredient in our approach is a necessary stochastic maximum principle tailored to Epstein-Zin utility and a nonlinear transformation. In the deterministic setting, we further derive an explicit closed-form solution for the equilibrium investment and consumption policies. △ Less

Submitted 12 May, 2025; originally announced May 2025.

Comments: 25 pages; comments are welcome

arXiv:2505.05041 [pdf, other]

ADNP-15: An Open-Source Histopathological Dataset for Neuritic Plaque Segmentation in Human Brain Whole Slide Images with Frequency Domain Image Enhancement for Stain Normalization

Authors: Chenxi Zhao, Jianqiang Li, Qing Zhao, Jing Bai, Susana Boluda, Benoit Delatour, Lev Stimmer, Daniel Racoceanu, Gabriel Jimenez, Guanghui Fu

Abstract: Alzheimer's Disease (AD) is a neurodegenerative disorder characterized by amyloid-beta plaques and tau neurofibrillary tangles, which serve as key histopathological features. The identification and segmentation of these lesions are crucial for understanding AD progression but remain challenging due to the lack of large-scale annotated datasets and the impact of staining variations on automated ima… ▽ More Alzheimer's Disease (AD) is a neurodegenerative disorder characterized by amyloid-beta plaques and tau neurofibrillary tangles, which serve as key histopathological features. The identification and segmentation of these lesions are crucial for understanding AD progression but remain challenging due to the lack of large-scale annotated datasets and the impact of staining variations on automated image analysis. Deep learning has emerged as a powerful tool for pathology image segmentation; however, model performance is significantly influenced by variations in staining characteristics, necessitating effective stain normalization and enhancement techniques. In this study, we address these challenges by introducing an open-source dataset (ADNP-15) of neuritic plaques (i.e., amyloid deposits combined with a crown of dystrophic tau-positive neurites) in human brain whole slide images. We establish a comprehensive benchmark by evaluating five widely adopted deep learning models across four stain normalization techniques, providing deeper insights into their influence on neuritic plaque segmentation. Additionally, we propose a novel image enhancement method that improves segmentation accuracy, particularly in complex tissue structures, by enhancing structural details and mitigating staining inconsistencies. Our experimental results demonstrate that this enhancement strategy significantly boosts model generalization and segmentation accuracy. All datasets and code are open-source, ensuring transparency and reproducibility while enabling further advancements in the field. △ Less

Submitted 8 May, 2025; originally announced May 2025.

arXiv:2504.17209 [pdf, ps, other]

Characterisation of Hamamatsu R11065-20 PMTs for use in the SABRE South NaI(Tl) Crystal Detectors

Authors: O. Stanley, W. J. D. Melbourne, P. Urquijo, E. Barberio, V. U. Bashu, L. J. Bignell, I. Bolognino, G. Brooks, S. S. Chhun, F. Dastgiri, M. B. Froehlich, T. Fruth, G. Fu, G. C. Hill, R. S. James, K. Janssens, S. Kapoor, G. J. Lane, K. T. Leaver, P. McGee, P. C. McNamara, J. McKenzie, L. J. McKie, M. Mews, L. J. Milligan , et al. (9 additional authors not shown)

Abstract: The SABRE Experiment is a direct detection dark matter experiment using a target composed of multiple NaI(Tl) crystals. The experiment aims to be an independent check of the DAMA/LIBRA results with a detector in the Northern (Laboratori Nazionali Del Gran Sasso, LNGS) and Southern (Stawell Underground Physics Laboratory, SUPL) hemispheres. The SABRE South photomultiplier tubes (PMTs) will be used… ▽ More The SABRE Experiment is a direct detection dark matter experiment using a target composed of multiple NaI(Tl) crystals. The experiment aims to be an independent check of the DAMA/LIBRA results with a detector in the Northern (Laboratori Nazionali Del Gran Sasso, LNGS) and Southern (Stawell Underground Physics Laboratory, SUPL) hemispheres. The SABRE South photomultiplier tubes (PMTs) will be used near the low energy noise threshold and require a detailed calibration of their performance and contributions to the background in the NaI(Tl) dark matter search, prior to installation. We present the development of the pre-calibration procedures for the R11065-20 Hamamatsu PMTs. These PMTs are directly coupled to the NaI(Tl) crystals within the SABRE South experiment. In this paper we present methodologies to characterise the gain, dark rate, and timing properties of the PMTs. We develop a method for in-situ calibration without a light injection source. Additionally we explore the application of machine learning techniques using a Boosted Decision Tree (BDT) trained on the response of single PMTs to understand the information available for background rejection. Finally, we briefly present the simulation tool used to generate digitised PMT data from optical Monte Carlo simulations. △ Less

Submitted 15 June, 2025; v1 submitted 23 April, 2025; originally announced April 2025.

arXiv:2504.15667 [pdf, other]

Performance Estimation for Supervised Medical Image Segmentation Models on Unlabeled Data Using UniverSeg

Authors: Jingchen Zou, Jianqiang Li, Gabriel Jimenez, Qing Zhao, Daniel Racoceanu, Matias Cosarinsky, Enzo Ferrante, Guanghui Fu

Abstract: The performance of medical image segmentation models is usually evaluated using metrics like the Dice score and Hausdorff distance, which compare predicted masks to ground truth annotations. However, when applying the model to unseen data, such as in clinical settings, it is often impractical to annotate all the data, making the model's performance uncertain. To address this challenge, we propose… ▽ More The performance of medical image segmentation models is usually evaluated using metrics like the Dice score and Hausdorff distance, which compare predicted masks to ground truth annotations. However, when applying the model to unseen data, such as in clinical settings, it is often impractical to annotate all the data, making the model's performance uncertain. To address this challenge, we propose the Segmentation Performance Evaluator (SPE), a framework for estimating segmentation models' performance on unlabeled data. This framework is adaptable to various evaluation metrics and model architectures. Experiments on six publicly available datasets across six evaluation metrics including pixel-based metrics such as Dice score and distance-based metrics like HD95, demonstrated the versatility and effectiveness of our approach, achieving a high correlation (0.956$\pm$0.046) and low MAE (0.025$\pm$0.019) compare with real Dice score on the independent test set. These results highlight its ability to reliably estimate model performance without requiring annotations. The SPE framework integrates seamlessly into any model training process without adding training overhead, enabling performance estimation and facilitating the real-world application of medical image segmentation algorithms. The source code is publicly available △ Less

Submitted 22 April, 2025; originally announced April 2025.

arXiv:2504.14321 [pdf, other]

Multimodal Coreference Resolution for Chinese Social Media Dialogues: Dataset and Benchmark Approach

Authors: Xingyu Li, Chen Gong, Guohong Fu

Abstract: Multimodal coreference resolution (MCR) aims to identify mentions referring to the same entity across different modalities, such as text and visuals, and is essential for understanding multimodal content. In the era of rapidly growing mutimodal content and social media, MCR is particularly crucial for interpreting user interactions and bridging text-visual references to improve communication and p… ▽ More Multimodal coreference resolution (MCR) aims to identify mentions referring to the same entity across different modalities, such as text and visuals, and is essential for understanding multimodal content. In the era of rapidly growing mutimodal content and social media, MCR is particularly crucial for interpreting user interactions and bridging text-visual references to improve communication and personalization. However, MCR research for real-world dialogues remains unexplored due to the lack of sufficient data resources. To address this gap, we introduce TikTalkCoref, the first Chinese multimodal coreference dataset for social media in real-world scenarios, derived from the popular Douyin short-video platform. This dataset pairs short videos with corresponding textual dialogues from user comments and includes manually annotated coreference clusters for both person mentions in the text and the coreferential person head regions in the corresponding video frames. We also present an effective benchmark approach for MCR, focusing on the celebrity domain, and conduct extensive experiments on our dataset, providing reliable benchmark results for this newly constructed dataset. We will release the TikTalkCoref dataset to facilitate future research on MCR for real-world social media dialogues. △ Less

Submitted 19 May, 2025; v1 submitted 19 April, 2025; originally announced April 2025.

arXiv:2504.11284 [pdf, ps, other]

Bipartite Ranking From Multiple Labels: On Loss Versus Label Aggregation

Authors: Michal Lukasik, Lin Chen, Harikrishna Narasimhan, Aditya Krishna Menon, Wittawat Jitkrittum, Felix X. Yu, Sashank J. Reddi, Gang Fu, Mohammadhossein Bateni, Sanjiv Kumar

Abstract: Bipartite ranking is a fundamental supervised learning problem, with the goal of learning a ranking over instances with maximal Area Under the ROC Curve (AUC) against a single binary target label. However, one may often observe multiple binary target labels, e.g., from distinct human annotators. How can one synthesize such labels into a single coherent ranking? In this work, we formally analyze tw… ▽ More Bipartite ranking is a fundamental supervised learning problem, with the goal of learning a ranking over instances with maximal Area Under the ROC Curve (AUC) against a single binary target label. However, one may often observe multiple binary target labels, e.g., from distinct human annotators. How can one synthesize such labels into a single coherent ranking? In this work, we formally analyze two approaches to this problem -- loss aggregation and label aggregation -- by characterizing their Bayes-optimal solutions. We show that while both approaches can yield Pareto-optimal solutions, loss aggregation can exhibit label dictatorship: one can inadvertently (and undesirably) favor one label over others. This suggests that label aggregation can be preferable to loss aggregation, which we empirically verify. △ Less

Submitted 9 June, 2025; v1 submitted 15 April, 2025; originally announced April 2025.

Comments: Accepted by ICML 2025

arXiv:2504.02222 [pdf, other]

APSeg: Auto-Prompt Model with Acquired and Injected Knowledge for Nuclear Instance Segmentation and Classification

Authors: Liying Xu, Hongliang He, Wei Han, Hanbin Huang, Siwei Feng, Guohong Fu

Abstract: Nuclear instance segmentation and classification provide critical quantitative foundations for digital pathology diagnosis. With the advent of the foundational Segment Anything Model (SAM), the accuracy and efficiency of nuclear segmentation have improved significantly. However, SAM imposes a strong reliance on precise prompts, and its class-agnostic design renders its classification results entir… ▽ More Nuclear instance segmentation and classification provide critical quantitative foundations for digital pathology diagnosis. With the advent of the foundational Segment Anything Model (SAM), the accuracy and efficiency of nuclear segmentation have improved significantly. However, SAM imposes a strong reliance on precise prompts, and its class-agnostic design renders its classification results entirely dependent on the provided prompts. Therefore, we focus on generating prompts with more accurate localization and classification and propose \textbf{APSeg}, \textbf{A}uto-\textbf{P}rompt model with acquired and injected knowledge for nuclear instance \textbf{Seg}mentation and classification. APSeg incorporates two knowledge-aware modules: (1) Distribution-Guided Proposal Offset Module (\textbf{DG-POM}), which learns distribution knowledge through density map guided, and (2) Category Knowledge Semantic Injection Module (\textbf{CK-SIM}), which injects morphological knowledge derived from category descriptions. We conducted extensive experiments on the PanNuke and CoNSeP datasets, demonstrating the effectiveness of our approach. The code will be released upon acceptance. △ Less

Submitted 2 April, 2025; originally announced April 2025.

Comments: 10 pages, 3 figures

arXiv:2504.01577 [pdf, other]

Instance Migration Diffusion for Nuclear Instance Segmentation in Pathology

Authors: Lirui Qi, Hongliang He, Tong Wang, Siwei Feng, Guohong Fu

Abstract: Nuclear instance segmentation plays a vital role in disease diagnosis within digital pathology. However, limited labeled data in pathological images restricts the overall performance of nuclear instance segmentation. To tackle this challenge, we propose a novel data augmentation framework Instance Migration Diffusion Model (IM-Diffusion), IM-Diffusion designed to generate more varied pathological… ▽ More Nuclear instance segmentation plays a vital role in disease diagnosis within digital pathology. However, limited labeled data in pathological images restricts the overall performance of nuclear instance segmentation. To tackle this challenge, we propose a novel data augmentation framework Instance Migration Diffusion Model (IM-Diffusion), IM-Diffusion designed to generate more varied pathological images by constructing diverse nuclear layouts and internuclear spatial relationships. In detail, we introduce a Nuclear Migration Module (NMM) which constructs diverse nuclear layouts by simulating the process of nuclear migration. Building on this, we further present an Internuclear-regions Inpainting Module (IIM) to generate diverse internuclear spatial relationships by structure-aware inpainting. On the basis of the above, IM-Diffusion generates more diverse pathological images with different layouts and internuclear spatial relationships, thereby facilitating downstream tasks. Evaluation on the CoNSeP and GLySAC datasets demonstrate that the images generated by IM-Diffusion effectively enhance overall instance segmentation performance. Code will be made public later. △ Less

Submitted 2 April, 2025; originally announced April 2025.

arXiv:2503.04522 [pdf, other]

In-Context Reverse Classification Accuracy: Efficient Estimation of Segmentation Quality without Ground-Truth

Authors: Matias Cosarinsky, Ramiro Billot, Lucas Mansilla, Gabriel Gimenez, Nicolas Gaggión, Guanghui Fu, Enzo Ferrante

Abstract: Assessing the quality of automatic image segmentation is crucial in clinical practice, but often very challenging due to the limited availability of ground truth annotations. In this paper, we introduce In-Context Reverse Classification Accuracy (In-Context RCA), a novel framework for automatically estimating segmentation quality in the absence of ground-truth annotations. By leveraging recent in-… ▽ More Assessing the quality of automatic image segmentation is crucial in clinical practice, but often very challenging due to the limited availability of ground truth annotations. In this paper, we introduce In-Context Reverse Classification Accuracy (In-Context RCA), a novel framework for automatically estimating segmentation quality in the absence of ground-truth annotations. By leveraging recent in-context learning segmentation models and incorporating retrieval-augmentation techniques to select the most relevant reference images, our approach enables efficient quality estimation with minimal reference data. Validated across diverse medical imaging modalities, our method demonstrates robust performance and computational efficiency, offering a promising solution for automated quality control in clinical workflows, where fast and reliable segmentation assessment is essential. The code is available at https://github.com/mcosarinsky/In-Context-RCA. △ Less

Submitted 6 March, 2025; originally announced March 2025.

arXiv:2503.03895 [pdf, other]

doi 10.3847/1538-3881/adc684

HST Transmission Spectra of the Hot-Neptune HD 219666 b: Detection of Water and the Challenge of Constraining Both Water and Methane

Authors: Matthew M. Murphy, Thomas G. Beatty, Luis Welbanks, Guangwei Fu

Abstract: Although Neptunian-sized (2 - 5 R$_{Earth}$) planets appear to be extremely common in the Galaxy, many mysteries remain about their overall nature. To date, only 11 Neptunian-sized planets have had their atmospheres spectroscopically characterized, and these observations hint at interesting diversity within this class of planets. Much of our understanding of these worlds and others derive from tra… ▽ More Although Neptunian-sized (2 - 5 R$_{Earth}$) planets appear to be extremely common in the Galaxy, many mysteries remain about their overall nature. To date, only 11 Neptunian-sized planets have had their atmospheres spectroscopically characterized, and these observations hint at interesting diversity within this class of planets. Much of our understanding of these worlds and others derive from transmission spectroscopy with the Hubble Space Telescope's Wide Field Camera 3 (HST/WFC3). One key outcome of HST/WFC3 observations has been the consistent detection of water but no methane in Neptunian atmospheres, though recent James Webb Space Telescope (JWST) observations are potentially starting to overturn this "missing methane" paradigm. In this work, we present the transmission spectrum of the hot Neptune HD 219666 b from 1.1 - 1.6 $μ$m from two transit observations using HST/WFC3 G141. Our fiducial atmospheric retrieval detects water at ~3-$σ$ in HD 219666 b's atmosphere and prefers no contribution from methane, similar to these previous observations of other planets. Motivated by recent detections of methane in Neptunian atmospheres by JWST, we explore additional models and find that a methane-only scenario could adequately fit the data, though it is not preferred and likely unphysical. We discuss the impact of this methane detection challenge on our understanding of planetary atmospheres based on HST/WFC3 observations alone, and where JWST observations offer a solution. △ Less

Submitted 7 May, 2025; v1 submitted 5 March, 2025; originally announced March 2025.

Comments: This work has been published in the Astronomical Journal. 25 pages including references. 5 figures and 3 tables in Main Text, 5 figures in Appendix

Journal ref: AJ 169 286 (2025)

arXiv:2503.00989 [pdf, other]

A four-field mixed formulation for incompressible finite elasticity

Authors: Guosheng Fu, Michael Neunteufel, Joachim Schöberl, Adam Zdunek

Abstract: In this work, we generalize the mass-conserving mixed stress (MCS) finite element method for Stokes equations [Gopalakrishnan J., Lederer P., and Schöberl J., A mass conserving mixed stress formulation for the Stokes equations, IMA Journal of Numerical Analysis 40(3), 1838-1874 (2019)], involving normal velocity and tangential-normal stress continuous fields, to incompressible finite elasticity. B… ▽ More In this work, we generalize the mass-conserving mixed stress (MCS) finite element method for Stokes equations [Gopalakrishnan J., Lederer P., and Schöberl J., A mass conserving mixed stress formulation for the Stokes equations, IMA Journal of Numerical Analysis 40(3), 1838-1874 (2019)], involving normal velocity and tangential-normal stress continuous fields, to incompressible finite elasticity. By means of the three-field Hu-Washizu principle, introducing the displacement gradient and 1st Piola-Kirchhoff stress tensor as additional fields, we circumvent the inversion of the constitutive law. We lift the arising distributional derivatives of the displacement gradient to a regular auxiliary displacement gradient field. Static condensation can be applied at the element level, providing a global pure displacement problem to be solved. We present a stabilization motivated by Hybrid Discontinuous Galerkin methods. A solving algorithm is discussed, which asserts the solvability of the arising linearized subproblems for problems with physically positive eigenvalues. The excellent performance of the proposed method is corroborated by several numerical experiments. △ Less

Submitted 2 March, 2025; originally announced March 2025.

MSC Class: 74S05 (Primary) 74B20 (Secondary)

arXiv:2502.19794 [pdf, other]

ChatMol: A Versatile Molecule Designer Based on the Numerically Enhanced Large Language Model

Authors: Chuanliu Fan, Ziqiang Cao, Zicheng Ma, Nan Yu, Yimin Peng, Jun Zhang, Yiqin Gao, Guohong Fu

Abstract: Goal-oriented de novo molecule design, namely generating molecules with specific property or substructure constraints, is a crucial yet challenging task in drug discovery. Existing methods, such as Bayesian optimization and reinforcement learning, often require training multiple property predictors and struggle to incorporate substructure constraints. Inspired by the success of Large Language Mode… ▽ More Goal-oriented de novo molecule design, namely generating molecules with specific property or substructure constraints, is a crucial yet challenging task in drug discovery. Existing methods, such as Bayesian optimization and reinforcement learning, often require training multiple property predictors and struggle to incorporate substructure constraints. Inspired by the success of Large Language Models (LLMs) in text generation, we propose ChatMol, a novel approach that leverages LLMs for molecule design across diverse constraint settings. Initially, we crafted a molecule representation compatible with LLMs and validated its efficacy across multiple online LLMs. Afterwards, we developed specific prompts geared towards diverse constrained molecule generation tasks to further fine-tune current LLMs while integrating feedback learning derived from property prediction. Finally, to address the limitations of LLMs in numerical recognition, we referred to the position encoding method and incorporated additional encoding for numerical values within the prompt. Experimental results across single-property, substructure-property, and multi-property constrained tasks demonstrate that ChatMol consistently outperforms state-of-the-art baselines, including VAE and RL-based methods. Notably, in multi-objective binding affinity maximization task, ChatMol achieves a significantly lower KD value of 0.25 for the protein target ESR1, while maintaining the highest overall performance, surpassing previous methods by 4.76%. Meanwhile, with numerical enhancement, the Pearson correlation coefficient between the instructed property values and those of the generated molecules increased by up to 0.49. These findings highlight the potential of LLMs as a versatile framework for molecule generation, offering a promising alternative to traditional latent space and RL-based approaches. △ Less

Submitted 27 February, 2025; originally announced February 2025.

Comments: 16 pages, 8 figures,conference

arXiv:2502.10455 [pdf, other]

E2LVLM:Evidence-Enhanced Large Vision-Language Model for Multimodal Out-of-Context Misinformation Detection

Authors: Junjie Wu, Yumeng Fu, Nan Yu, Guohong Fu

Abstract: Recent studies in Large Vision-Language Models (LVLMs) have demonstrated impressive advancements in multimodal Out-of-Context (OOC) misinformation detection, discerning whether an authentic image is wrongly used in a claim. Despite their success, the textual evidence of authentic images retrieved from the inverse search is directly transmitted to LVLMs, leading to inaccurate or false information i… ▽ More Recent studies in Large Vision-Language Models (LVLMs) have demonstrated impressive advancements in multimodal Out-of-Context (OOC) misinformation detection, discerning whether an authentic image is wrongly used in a claim. Despite their success, the textual evidence of authentic images retrieved from the inverse search is directly transmitted to LVLMs, leading to inaccurate or false information in the decision-making phase. To this end, we present E2LVLM, a novel evidence-enhanced large vision-language model by adapting textual evidence in two levels. First, motivated by the fact that textual evidence provided by external tools struggles to align with LVLMs inputs, we devise a reranking and rewriting strategy for generating coherent and contextually attuned content, thereby driving the aligned and effective behavior of LVLMs pertinent to authentic images. Second, to address the scarcity of news domain datasets with both judgment and explanation, we generate a novel OOC multimodal instruction-following dataset by prompting LVLMs with informative content to acquire plausible explanations. Further, we develop a multimodal instruction-tuning strategy with convincing explanations for beyond detection. This scheme contributes to E2LVLM for multimodal OOC misinformation detection and explanation. A multitude of experiments demonstrate that E2LVLM achieves superior performance than state-of-the-art methods, and also provides compelling rationales for judgments. △ Less

Submitted 11 February, 2025; originally announced February 2025.

arXiv:2502.07867 [pdf]

doi 10.3847/1538-4357/ad9dd3

The Magnetically Induced Radial Velocity Variation of Gliese 341 and an Upper Limit to the Mass of Its Transiting Earth-sized Planet

Authors: Victoria DiTomasso, Mercedes Lopez-Morales, Sarah Peacock, Luca Malavolta, James Kirk, Kevin B. Stevenson, Guangwei Fu, Jacob Lustig-Yaeger

Abstract: The Transiting Exoplanet Survey Satellite (TESS) mission identified a potential 0.88 REarth planet with a period of 7.577 days, orbiting the nearby M1V star GJ 341 (TOI 741.01). This system has already been observed by the James Webb Space Telescope (JWST) to search for presence of an atmosphere on this planet. Here, we present an in-depth analysis of the GJ 341 system using all available public d… ▽ More The Transiting Exoplanet Survey Satellite (TESS) mission identified a potential 0.88 REarth planet with a period of 7.577 days, orbiting the nearby M1V star GJ 341 (TOI 741.01). This system has already been observed by the James Webb Space Telescope (JWST) to search for presence of an atmosphere on this planet. Here, we present an in-depth analysis of the GJ 341 system using all available public data. We provide improved parameters for the host star, an updated value of the planet radius, and support the planetary nature of the object (now GJ 341 b). We use 57 HARPS radial velocities to model the magnetic cycle and activity of the host star, and constrain the mass of GJ 341 b to upper limits of 4.0 MEarth (3 sigma) and 2.9 MEarth (1 sigma). We also rule out the presence of additional companions with M sin i > 15.1 MEarth, and P < 1750 days, and the presence of contaminating background objects during the TESS and JWST observations. These results provide key information to aid the interpretation of the recent JWST atmospheric observations and other future observations of this planet. △ Less

Submitted 11 February, 2025; originally announced February 2025.

Comments: 18 pages, 17 figures

Journal ref: The Astrophysical Journal, Volume 979, Issue 2, id.214, 18 pp. February 2025

arXiv:2502.06785 [pdf, other]

DeepCrossAttention: Supercharging Transformer Residual Connections

Authors: Mike Heddes, Adel Javanmard, Kyriakos Axiotis, Gang Fu, MohammadHossein Bateni, Vahab Mirrokni

Abstract: Transformer networks have achieved remarkable success across diverse domains, leveraging a variety of architectural innovations, including residual connections. However, traditional residual connections, which simply sum the outputs of previous layers, can dilute crucial information. This work introduces DeepCrossAttention (DCA), an approach that enhances residual learning in transformers. DCA emp… ▽ More Transformer networks have achieved remarkable success across diverse domains, leveraging a variety of architectural innovations, including residual connections. However, traditional residual connections, which simply sum the outputs of previous layers, can dilute crucial information. This work introduces DeepCrossAttention (DCA), an approach that enhances residual learning in transformers. DCA employs learnable, input-dependent weights to dynamically combine layer outputs, enabling the model to selectively focus on the most relevant information in any of the previous layers. Furthermore, DCA incorporates depth-wise cross-attention, allowing for richer interactions between layers at different depths. Our language modeling experiments show that DCA achieves improved perplexity for a given training time. Moreover, DCA obtains the same model quality up to 3x faster while adding a negligible number of parameters. Theoretical analysis confirms that DCA provides an improved trade-off between accuracy and model size when the ratio of collective layer ranks to the ambient dimension falls below a critical threshold. △ Less

Submitted 10 February, 2025; originally announced February 2025.

arXiv:2501.08696 [pdf, other]

Deep Learning-Based Feature Fusion for Emotion Analysis and Suicide Risk Differentiation in Chinese Psychological Support Hotlines

Authors: Han Wang, Jianqiang Li, Qing Zhao, Zhonglong Chen, Changwei Song, Jing Tang, Yuning Huang, Wei Zhai, Yongsheng Tong, Guanghui Fu

Abstract: Mental health is a critical global public health issue, and psychological support hotlines play a pivotal role in providing mental health assistance and identifying suicide risks at an early stage. However, the emotional expressions conveyed during these calls remain underexplored in current research. This study introduces a method that combines pitch acoustic features with deep learning-based fea… ▽ More Mental health is a critical global public health issue, and psychological support hotlines play a pivotal role in providing mental health assistance and identifying suicide risks at an early stage. However, the emotional expressions conveyed during these calls remain underexplored in current research. This study introduces a method that combines pitch acoustic features with deep learning-based features to analyze and understand emotions expressed during hotline interactions. Using data from China's largest psychological support hotline, our method achieved an F1-score of 79.13% for negative binary emotion classification.Additionally, the proposed approach was validated on an open dataset for multi-class emotion classification,where it demonstrated better performance compared to the state-of-the-art methods. To explore its clinical relevance, we applied the model to analysis the frequency of negative emotions and the rate of emotional change in the conversation, comparing 46 subjects with suicidal behavior to those without. While the suicidal group exhibited more frequent emotional changes than the non-suicidal group, the difference was not statistically significant.Importantly, our findings suggest that emotional fluctuation intensity and frequency could serve as novel features for psychological assessment scales and suicide risk prediction.The proposed method provides valuable insights into emotional dynamics and has the potential to advance early intervention and improve suicide prevention strategies through integration with clinical tools and assessments The source code is publicly available at https://github.com/Sco-field/Speechemotionrecognition/tree/main. △ Less

Submitted 15 January, 2025; originally announced January 2025.

arXiv:2501.02081 [pdf, other]

Statistical trends in JWST transiting exoplanet atmospheres

Authors: Guangwei Fu, Kevin B. Stevenson, David K. Sing, Sagnick Mukherjee, Luis Welbanks, Daniel Thorngren, Shang-Min Tsai, Peter Gao, Joshua Lothringer, Thomas G. Beatty, Cyril Gapp, Thomas M. Evans-Soma, Romain Allart, Stefan Pelletier, Pa Chia Thao, Andrew W. Mann

Abstract: Our brains are hardwired for pattern recognition as correlations are useful for predicting and understanding nature. As more exoplanet atmospheres are being characterized with JWST, we are starting to unveil their properties on a population level. Here we present a framework for comparing exoplanet transmission spectroscopy from 3 to 5$μ$m with four bands: L (2.9 - 3.7$μ$m), SO$_2$ (3.95 - 4.1$μ$m… ▽ More Our brains are hardwired for pattern recognition as correlations are useful for predicting and understanding nature. As more exoplanet atmospheres are being characterized with JWST, we are starting to unveil their properties on a population level. Here we present a framework for comparing exoplanet transmission spectroscopy from 3 to 5$μ$m with four bands: L (2.9 - 3.7$μ$m), SO$_2$ (3.95 - 4.1$μ$m), CO$_2$ (4.25 - 4.4$μ$m) and CO (4.5 - 4.9$μ$m). Together, the four bands cover the major carbon, oxygen, nitrogen, and sulfur-bearing molecules including H$_2$O, CH$_4$, NH$_3$, H$_2$S, SO$_2$, CO$_2$, and CO. Among the eight high-precision gas giant exoplanet planet spectra we collected, we found strong correlations between the SO$_2$-L index and planet mass (r=-0.41$\pm$0.09) and temperature (r=-0.64$\pm$0.08), indicating SO$_2$ preferably exists (SO$_2$-L$>$-0.5) among low mass ($\sim<$0.3M$_J$) and cooler ($\sim<$1200K) targets. We also observe strong temperature dependency for both CO$_2$-L and CO-L indices. Under equilibrium chemistry and isothermal thermal structure assumptions, we find that the planet sample favors super-solar metallicity and low C/O ratio ($<$0.7). In addition, the presence of a mass-metallicity correlation is favored over uniform metallicity with the eight planets. We further introduce the SO$_2$-L versus CO$_2$-L diagram alike the color-magnitude diagram for stars and brown dwarfs. All reported trends here will be testable and be further quantified with existing and future JWST observations within the next few years. △ Less

Submitted 3 January, 2025; originally announced January 2025.

Comments: Accepted to ApJ, JWST keeps on delivering!

arXiv:2501.01498 [pdf, other]

TOI-421 b: A Hot Sub-Neptune with a Haze-Free, Low Mean Molecular Weight Atmosphere

Authors: Brian Davenport, Eliza M. -R. Kempton, Matthew C. Nixon, Jegug Ih, Drake Deming, Guangwei Fu, E. M. May, Jacob L. Bean, Peter Gao, Leslie Rogers, Matej Malik

Abstract: Common features of sub-Neptunes atmospheres observed to date include signatures of aerosols at moderate equilibrium temperatures (~500-800 K), and a prevalence of high mean molecular weight atmospheres, perhaps indicating novel classes of planets such as water worlds. Here we present a 0.83-5 micron JWST transmission spectrum of the sub-Neptune TOI-421 b. This planet is unique among previously obs… ▽ More Common features of sub-Neptunes atmospheres observed to date include signatures of aerosols at moderate equilibrium temperatures (~500-800 K), and a prevalence of high mean molecular weight atmospheres, perhaps indicating novel classes of planets such as water worlds. Here we present a 0.83-5 micron JWST transmission spectrum of the sub-Neptune TOI-421 b. This planet is unique among previously observed counterparts in its high equilibrium temperature ($T_{eq} \approx 920$) and its Sun-like host star. We find marked differences between the atmosphere of TOI-421 b and those of sub-Neptunes previously characterized with JWST, which all orbit M stars. Specifically, water features in the NIRISS/SOSS bandpass indicate a low mean molecular weight atmosphere consistent with solar metallicity, and no appreciable aerosol coverage. Hints of SO$_2$ and CO (but not CO$_2$ or CH$_4$) also exist in our NIRSpec/G395M observations, but not at sufficient signal-to-noise to draw firm conclusions. Our results support a picture in which sub-Neptunes hotter than ~850 K do not form hydrocarbon hazes due to a lack of methane to photolyze. TOI-421 b additionally fits the paradigm of the radius valley for planets orbiting FGK stars being sculpted by mass loss processes, which would leave behind primordial atmospheres overlying rock/iron interiors. Further observations of TOI-421 b and similar hot sub-Neptunes will confirm whether haze-free atmospheres and low mean molecular weights are universal characteristics of such objects. △ Less

Submitted 2 January, 2025; originally announced January 2025.

Comments: Submitted to ApJ Letters, comments welcome

arXiv:2501.00385 [pdf, ps, other]

Closed-form formulas in number-conserved pairing theory

Authors: G. J. Fu

Abstract: In this work, I present closed-form formulas for the norm and many-body density matrices between general wave functions with exact particle numbers in pairing theory, using properties of the generalized Kronecker delta. These formulas, expressed as sums of minors and Pfaffians, apply to both even and odd particle-number systems and accommodate pair condensate as well as broken-pair configurations.… ▽ More In this work, I present closed-form formulas for the norm and many-body density matrices between general wave functions with exact particle numbers in pairing theory, using properties of the generalized Kronecker delta. These formulas, expressed as sums of minors and Pfaffians, apply to both even and odd particle-number systems and accommodate pair condensate as well as broken-pair configurations. This formalism directly facilitates applications in the generator coordinate method and symmetry restoration techniques, including angular momentum projection. △ Less

Submitted 31 December, 2024; originally announced January 2025.

arXiv:2412.21039 [pdf, other]

A locally-conservative proximal Galerkin method for pointwise bound constraints

Authors: Guosheng Fu, Brendan Keith, Rami Masri

Abstract: We introduce the first-order system proximal Galerkin (FOSPG) method, a locally mass-conserving, hybridizable finite element method for solving heterogeneous anisotropic diffusion and obstacle problems. Like other proximal Galerkin methods, FOSPG finds solutions by solving a recursive sequence of smooth, discretized, nonlinear subproblems. We establish the well-posedness and convergence of these n… ▽ More We introduce the first-order system proximal Galerkin (FOSPG) method, a locally mass-conserving, hybridizable finite element method for solving heterogeneous anisotropic diffusion and obstacle problems. Like other proximal Galerkin methods, FOSPG finds solutions by solving a recursive sequence of smooth, discretized, nonlinear subproblems. We establish the well-posedness and convergence of these nonlinear subproblems along with stability and error estimates under low regularity assumptions for the linearized equations obtained by solving each subproblem using Newton's method. The FOSPG method exhibits several advantages, including high-order accuracy, discrete maximum principle or bound-preserving discrete solutions, and local mass conservation. It also achieves prescribed solution accuracy within asymptotically mesh-independent numbers of subproblems and linear solves per subproblem iteration. Numerical experiments on benchmarks for anisotropic diffusion and obstacle problems confirm these attributes. Furthermore, an open-source implementation of the method is provided to facilitate broader adoption and reproducibility. △ Less

Submitted 30 December, 2024; originally announced December 2024.

MSC Class: 35J86; 49J40; 65N30

arXiv:2412.20457 [pdf, other]

Quasinormal modes of a d-dimensional regular black hole featuring an integrable singularity

Authors: Zhongzhinan Dong, Dan Zhang, Guoyang Fu, Jian-Pin Wu

Abstract: In this paper, we exhaustively investigate the quasinormal modes (QNMs) of a probe scalar field over a d-dimensional regular black hole (BH) characterized by the parameter A. The quasinormal frequencies (QNFs) exhibit different behaviors with respect to the parameter A for d = 4 and d > 4. Firstly, the trends of QNFs with respect to A exhibit completely opposite patterns for the case of d = 4 and… ▽ More In this paper, we exhaustively investigate the quasinormal modes (QNMs) of a probe scalar field over a d-dimensional regular black hole (BH) characterized by the parameter A. The quasinormal frequencies (QNFs) exhibit different behaviors with respect to the parameter A for d = 4 and d > 4. Firstly, the trends of QNFs with respect to A exhibit completely opposite patterns for the case of d = 4 and d > 4. Secondly, in the 4-dimensional regular BH, a non-monotonic behavior with respect to A is observed in the imaginary part of the fundamental modes with vanishing angular quantum number. In contrast, this non-monotonic behavior only appears in the overtones when d > 4. Thirdly, an overtone outburst accompanied by an oscillatory patter is observed only in the case of d > 4, but not in d = 4. △ Less

Submitted 7 March, 2025; v1 submitted 29 December, 2024; originally announced December 2024.

arXiv:2412.20450 [pdf, other]

Observational appearances of an inner extremal regular black hole illuminated by various accretion flows

Authors: Dan Zhang, Guoyang Fu, Xi-Jing Wang, Qiyuan Pan, Xiao-Mei Kuang, Jian-Pin Wu

Abstract: This paper investigates the observational appearances of an inner extremal regular black hole(IERBH) illuminated by various types of accretion models. The study reveals that when the BH is illuminated by specific accretion flows, the effects of quantum gravity become more pronounced,significantly impacting key observational features such as the shadow radius, photon ring, and total observed intens… ▽ More This paper investigates the observational appearances of an inner extremal regular black hole(IERBH) illuminated by various types of accretion models. The study reveals that when the BH is illuminated by specific accretion flows, the effects of quantum gravity become more pronounced,significantly impacting key observational features such as the shadow radius, photon ring, and total observed intensity. Specifically, the introduction of a more realistic radially infalling spherical accretion flow further accentuates these differences. This dynamic flow results in a darker central region in the BH image due to the Doppler effect, which modulates the observed intensity based on the relative motion of the infalling matter. The shadow radius and total observed intensity are notably affected by the quantum correction parameters, providing additional signatures that distinguish regular BHs from their classical counterparts. △ Less

Submitted 29 December, 2024; originally announced December 2024.

arXiv:2412.19058 [pdf, ps, other]

A System of BSDEs with Singular Terminal Values Arising in Optimal Liquidation with Regime Switching

Authors: Guanxing Fu, Xiaomin Shi, Zuo Quan Xu

Abstract: We study a stochastic control problem with regime switching arising in an optimal liquidation problem with dark pools and multiple regimes. The new feature of this model is that it introduces a system of BSDEs with jumps and with singular terminal values, which appears in literature for the first time. The existence result for this system is obtained. As a result, we solve the stochastic control p… ▽ More We study a stochastic control problem with regime switching arising in an optimal liquidation problem with dark pools and multiple regimes. The new feature of this model is that it introduces a system of BSDEs with jumps and with singular terminal values, which appears in literature for the first time. The existence result for this system is obtained. As a result, we solve the stochastic control problem with regime switching. More importantly, the uniqueness result of this system is also obtained, in contrast to merely minimal solutions established in most related literature. △ Less

Submitted 19 January, 2025; v1 submitted 26 December, 2024; originally announced December 2024.

Comments: 19 pages

arXiv:2411.13889 [pdf, other]

doi 10.1088/1748-0221/20/04/T04001

The SABRE South Technical Design Report Executive Summary

Authors: E. Barberio, T. Baroncelli, V. U. Bashu, L. J. Bignell, I. Bolognino, G. Brooks, S. S. Chhun, F. Dastgiri, A. Di Giacinto, G. D'Imperio, A. R. Duffy, M. B. Froehlich, T. Fruth, G. Fu, G. C. Hill, R. S. James, K. Janssens, S. Kapoor, G. J. Lane, K. T. Leaver, A. Mariani, P. McGee, L. J. McKie, P. C. McNamara, J. McKenzie , et al. (20 additional authors not shown)

Abstract: In this technical design report (TDR) executive summary we describe the SABRE South detector to be built at the Stawell Underground Physics Laboratory (SUPL). The SABRE South detector is designed to test the long-standing DAMA/LIBRA signal of an annually modulating rate consistent with dark matter by using the same target material. Located in the Southern Hemisphere, the detector is uniquely posit… ▽ More In this technical design report (TDR) executive summary we describe the SABRE South detector to be built at the Stawell Underground Physics Laboratory (SUPL). The SABRE South detector is designed to test the long-standing DAMA/LIBRA signal of an annually modulating rate consistent with dark matter by using the same target material. Located in the Southern Hemisphere, the detector is uniquely positioned to disentangle modulating seasonal effects. SABRE South uses seven ultra-high purity NaI(Tl) crystals (with a total target mass of either 35 kg or 50 kg), hermetically sealed in copper enclosures that are suspended within a liquid scintillator active veto. High quantum efficiency and low background Hamamatsu R11065 photomultiplier tubes are directly coupled to both ends of the crystal, and enclosed with the crystal in an oxygen free copper enclosure. The active veto system consists of 11.6 kL of linear alkylbenzene (LAB) doped with a mixture of fluorophores and contained in a steel vessel, which is instrumented with at least 18 Hamamatsu R5912 photomultipliers. The active veto tags key radiogenic backgrounds intrinsic to the crystals, such as ${^{40}}$K, and is expected to suppress the total background by 27% in the 1-6 keV region of interest. In addition to the liquid scintillator veto, a muon veto is positioned above the detector shielding. This muon veto consists of eight EJ-200 scintillator modules, with Hamamatsu R13089 photomultipliers coupled to both ends. With an expected total background of 0.72 cpd/kg/keV, SABRE South can test the DAMA/LIBRA signal with 5$σ$ discovery or 3$σ$ exclusion after two years of data taking. △ Less

Submitted 9 April, 2025; v1 submitted 21 November, 2024; originally announced November 2024.

Journal ref: JINST 20 T04001 (2025)

arXiv:2411.09593 [pdf, other]

SMILE-UHURA Challenge -- Small Vessel Segmentation at Mesoscopic Scale from Ultra-High Resolution 7T Magnetic Resonance Angiograms

Authors: Soumick Chatterjee, Hendrik Mattern, Marc Dörner, Alessandro Sciarra, Florian Dubost, Hannes Schnurre, Rupali Khatun, Chun-Chih Yu, Tsung-Lin Hsieh, Yi-Shan Tsai, Yi-Zeng Fang, Yung-Ching Yang, Juinn-Dar Huang, Marshall Xu, Siyu Liu, Fernanda L. Ribeiro, Saskia Bollmann, Karthikesh Varma Chintalapati, Chethan Mysuru Radhakrishna, Sri Chandana Hudukula Ram Kumara, Raviteja Sutrave, Abdul Qayyum, Moona Mazher, Imran Razzak, Cristobal Rodero , et al. (23 additional authors not shown)

Abstract: The human brain receives nutrients and oxygen through an intricate network of blood vessels. Pathology affecting small vessels, at the mesoscopic scale, represents a critical vulnerability within the cerebral blood supply and can lead to severe conditions, such as Cerebral Small Vessel Diseases. The advent of 7 Tesla MRI systems has enabled the acquisition of higher spatial resolution images, maki… ▽ More The human brain receives nutrients and oxygen through an intricate network of blood vessels. Pathology affecting small vessels, at the mesoscopic scale, represents a critical vulnerability within the cerebral blood supply and can lead to severe conditions, such as Cerebral Small Vessel Diseases. The advent of 7 Tesla MRI systems has enabled the acquisition of higher spatial resolution images, making it possible to visualise such vessels in the brain. However, the lack of publicly available annotated datasets has impeded the development of robust, machine learning-driven segmentation algorithms. To address this, the SMILE-UHURA challenge was organised. This challenge, held in conjunction with the ISBI 2023, in Cartagena de Indias, Colombia, aimed to provide a platform for researchers working on related topics. The SMILE-UHURA challenge addresses the gap in publicly available annotated datasets by providing an annotated dataset of Time-of-Flight angiography acquired with 7T MRI. This dataset was created through a combination of automated pre-segmentation and extensive manual refinement. In this manuscript, sixteen submitted methods and two baseline methods are compared both quantitatively and qualitatively on two different datasets: held-out test MRAs from the same dataset as the training data (with labels kept secret) and a separate 7T ToF MRA dataset where both input volumes and labels are kept secret. The results demonstrate that most of the submitted deep learning methods, trained on the provided training dataset, achieved reliable segmentation performance. Dice scores reached up to 0.838 $\pm$ 0.066 and 0.716 $\pm$ 0.125 on the respective datasets, with an average performance of up to 0.804 $\pm$ 0.15. △ Less

Submitted 14 November, 2024; originally announced November 2024.

arXiv:2411.01439 [pdf, other]

Shannon entropy of optimized proton-neutron pair condensates

Authors: Shu-Yuan Liang, Yi Lu, Yang Lei, Calvin W. Johnson, Guan-Jian Fu, Jia Jie Shen

Abstract: Proton-neutron pairing and like-nucleon pairing are two different facets of atomic nuclear configurations. While like-nucleon pair condensates manifest their superfluidic nature in semi magic nuclei, it is not absolutely clear if there exists a T=0 proton-neutron pair condensate phase in $N=Z$ nuclei. With an explicit formalism of general pair condensates with good particle numbers, we optimize pr… ▽ More Proton-neutron pairing and like-nucleon pairing are two different facets of atomic nuclear configurations. While like-nucleon pair condensates manifest their superfluidic nature in semi magic nuclei, it is not absolutely clear if there exists a T=0 proton-neutron pair condensate phase in $N=Z$ nuclei. With an explicit formalism of general pair condensates with good particle numbers, we optimize proton-neutron pair condensates for all $N=Z$ nuclei between $^{16}$O and $^{100}$Sn, given shell model effective interactions. As comparison, we also optimize like-nucleon pair condensates for their semi-magic isotones. Shannon entanglement entropy is a measurement of mixing among pair configurations, and can signal intrinsic phase transition. It turns out the like-nucleon pair condensates for semi-magic nuclei have large entropies signaling an entangled phase, but the proton-neutron pair condensates end up not far from a Hartree-Fock solution, with small entropy. With artificial pairing interaction strengths, we show that the general proton-neutron pair condensate can transit from an entangled T=1 phase to an entangled T=0 phase, i.e. pairing phase transition driven by external parameters. In the T=0 limit, the proton-neutron pair condensate optimized for $^{24}$Mg turns out to be a purely P pair condensate with large entanglement entropy, although such cases may occur in cold atom systems, unlikely in atomic nuclei. △ Less

Submitted 11 November, 2024; v1 submitted 3 November, 2024; originally announced November 2024.

arXiv:2411.00560 [pdf, other]

Topology and Intersection-Union Constrained Loss Function for Multi-Region Anatomical Segmentation in Ocular Images

Authors: Ruiyu Xia, Jianqiang Li, Xi Xu, Guanghui Fu

Abstract: Ocular Myasthenia Gravis (OMG) is a rare and challenging disease to detect in its early stages, but symptoms often first appear in the eye muscles, such as drooping eyelids and double vision. Ocular images can be used for early diagnosis by segmenting different regions, such as the sclera, iris, and pupil, which allows for the calculation of area ratios to support accurate medical assessments. How… ▽ More Ocular Myasthenia Gravis (OMG) is a rare and challenging disease to detect in its early stages, but symptoms often first appear in the eye muscles, such as drooping eyelids and double vision. Ocular images can be used for early diagnosis by segmenting different regions, such as the sclera, iris, and pupil, which allows for the calculation of area ratios to support accurate medical assessments. However, no publicly available dataset and tools currently exist for this purpose. To address this, we propose a new topology and intersection-union constrained loss function (TIU loss) that improves performance using small training datasets. We conducted experiments on a public dataset consisting of 55 subjects and 2,197 images. Our proposed method outperformed two widely used loss functions across three deep learning networks, achieving a mean Dice score of 83.12% [82.47%, 83.81%] with a 95% bootstrap confidence interval. In a low-percentage training scenario (10% of the training data), our approach showed an 8.32% improvement in Dice score compared to the baseline. Additionally, we evaluated the method in a clinical setting with 47 subjects and 501 images, achieving a Dice score of 64.44% [63.22%, 65.62%]. We did observe some bias when applying the model in clinical settings. These results demonstrate that the proposed method is accurate, and our code along with the trained model is publicly available. △ Less

Submitted 1 November, 2024; originally announced November 2024.

Comments: 5 pages, 4 figures, International Symposium on Biomedical Imaging 2025

ACM Class: I.4.6; J.3

arXiv:2410.20669 [pdf, ps, other]

Hyponormal block Toeplitz operators with non-harmonic symbols on the weighted Bergman space

Authors: Guangyang Fu, Jiang Zhou

Abstract: In this paper, we discuss hyponormal block Toeplitz operators $T_Φ$ over the vector-valued weighted Bergman space $A_α^2\left(\mathbb{C}^n\right)$. And two conditions about hyponormal block Toeplitz operators $T_Φ$ on $A_α^2\left(\mathbb{C}^n\right)$ were discussed separately, where $ Φ(z)=A z^p \bar{z}^q + B z^s \bar{z}^t $, $A,B$ are any n-order complex square matrices. In this paper, we discuss hyponormal block Toeplitz operators $T_Φ$ over the vector-valued weighted Bergman space $A_α^2\left(\mathbb{C}^n\right)$. And two conditions about hyponormal block Toeplitz operators $T_Φ$ on $A_α^2\left(\mathbb{C}^n\right)$ were discussed separately, where $ Φ(z)=A z^p \bar{z}^q + B z^s \bar{z}^t $, $A,B$ are any n-order complex square matrices. △ Less

Submitted 27 October, 2024; originally announced October 2024.

arXiv:2410.16920 [pdf, other]

doi 10.1103/PhysRevD.111.064016

Spontaneous Vectorization in the Einstein-Maxwell-Vector Model

Authors: Guang-Zai Ye, Chong-Ye Chen, GuoYang Fu, Chao Niu, Cheng-Yong Zhang, Peng Liu

Abstract: We investigate spontaneous vectorization in the Einstein-Maxwell-Vector (EMV) model, introducing a novel mechanism driven by the interplay between electromagnetic and vector fields. A key innovation in our work is the resolution of an apparent divergence in the vector field near the event horizon, achieved by employing a generalized coordinate transformation. This not only extends the domain of ex… ▽ More We investigate spontaneous vectorization in the Einstein-Maxwell-Vector (EMV) model, introducing a novel mechanism driven by the interplay between electromagnetic and vector fields. A key innovation in our work is the resolution of an apparent divergence in the vector field near the event horizon, achieved by employing a generalized coordinate transformation. This not only extends the domain of existence for vectorized Reissner-Nordström black holes (VRNBHs), but also refines the theoretical understanding of such solutions. We introduce a new concept of combined charge $\sqrt{\tilde{Q}^2 + \tilde{P}^2}$, which better captures the underlying physics of these black holes and provides a unified framework for analyzing thermodynamics and observable phenomena such as light ring structures. Our findings suggest that VRNBHs exhibit enhanced thermodynamic preference and distinctive light ring properties compared to Reissner-Nordström solutions. Moreover, we demonstrate how this combined charge approach reveals connections to two-charge black hole solutions, offering promising avenues for observational verification within the context of effective field theories. △ Less

Submitted 10 April, 2025; v1 submitted 22 October, 2024; originally announced October 2024.

Comments: 31 pages, 11 figures Corrected typos. Revised the wording of the article, but the result unchanged

Journal ref: Phys.Rev.D 111 (2025) 6, 064016

arXiv:2410.11054 [pdf, other]

doi 10.3847/1538-3881/ad9dd1

An HST Transmission Spectrum of the Closest M-Dwarf Transiting Rocky Planet LTT 1445Ab

Authors: Katherine A. Bennett, David K. Sing, Kevin B. Stevenson, Hannah R. Wakeford, Zafar Rustamkulov, Natalie H. Allen, Joshua D. Lothringer, Ryan J. MacDonald, Nathan J. Mayne, Guangwei Fu

Abstract: Which rocky exoplanets have atmospheres? This presumably simply question is the first that must be answered to understand the prevalence of nearby habitable planets. A mere 6.9 pc from Earth, LTT 1445A is the closest transiting M-dwarf system, and its largest known planet, at $\rm 1.31\; R_{\oplus}$ and 424 K, is one of the most promising targets in which to search for an atmosphere. We use HST/WF… ▽ More Which rocky exoplanets have atmospheres? This presumably simply question is the first that must be answered to understand the prevalence of nearby habitable planets. A mere 6.9 pc from Earth, LTT 1445A is the closest transiting M-dwarf system, and its largest known planet, at $\rm 1.31\; R_{\oplus}$ and 424 K, is one of the most promising targets in which to search for an atmosphere. We use HST/WFC3 transmission spectroscopy with the G280 and G141 grisms to study the spectrum of LTT 1445Ab between $\rm 0.2-1.65\;μm$. In doing so, we uncover a UV flare on the neighboring star LTT 1445C that is completely invisible at optical wavelengths; we report one of the first simultaneous near-UV/optical spectra of an M~dwarf flare. The planet spectrum is consistent with a flat line (with median transit depth uncertainties of 128 and 52 ppm for the G280 and G141 observations, respectively), though the infrared portion displays potential features that could be explained by known opacity sources such as HCN. Some atmospheric retrievals weakly favor ($\sim2σ$) an atmosphere, but it remains challenging to discern between stellar contamination, an atmosphere, and a featureless spectrum at this time. We do, however, confidently rule out $\leq100\times$ solar metallicity atmospheres. Although stellar contamination retrievals cannot fit the infrared features well, the overall spectrum is consistent with stellar contamination from hot or cold spots. Based on the UV/optical data, we place limits on the extent of stellar variability expected in the near-infrared ($30-40$ ppm), which will be critical for future JWST observations. △ Less

Submitted 13 December, 2024; v1 submitted 14 October, 2024; originally announced October 2024.

Comments: 26 pages, 13 figures, 7 tables. Accepted for publication in AJ

arXiv:2410.10323 [pdf, other]

MentalGLM Series: Explainable Large Language Models for Mental Health Analysis on Chinese Social Media

Authors: Wei Zhai, Nan Bai, Qing Zhao, Jianqiang Li, Fan Wang, Hongzhi Qi, Meng Jiang, Xiaoqin Wang, Bing Xiang Yang, Guanghui Fu

Abstract: As the prevalence of mental health challenges, social media has emerged as a key platform for individuals to express their emotions.Deep learning tends to be a promising solution for analyzing mental health on social media. However, black box models are often inflexible when switching between tasks, and their results typically lack explanations. With the rise of large language models (LLMs), their… ▽ More As the prevalence of mental health challenges, social media has emerged as a key platform for individuals to express their emotions.Deep learning tends to be a promising solution for analyzing mental health on social media. However, black box models are often inflexible when switching between tasks, and their results typically lack explanations. With the rise of large language models (LLMs), their flexibility has introduced new approaches to the field. Also due to the generative nature, they can be prompted to explain decision-making processes. However, their performance on complex psychological analysis still lags behind deep learning. In this paper, we introduce the first multi-task Chinese Social Media Interpretable Mental Health Instructions (C-IMHI) dataset, consisting of 9K samples, which has been quality-controlled and manually validated. We also propose MentalGLM series models, the first open-source LLMs designed for explainable mental health analysis targeting Chinese social media, trained on a corpus of 50K instructions. The proposed models were evaluated on three downstream tasks and achieved better or comparable performance compared to deep learning models, generalized LLMs, and task fine-tuned LLMs. We validated a portion of the generated decision explanations with experts, showing promising results. We also evaluated the proposed models on a clinical dataset, where they outperformed other LLMs, indicating their potential applicability in the clinical field. Our models show strong performance, validated across tasks and perspectives. The decision explanations enhance usability and facilitate better understanding and practical application of the models. Both the constructed dataset and the models are publicly available via: https://github.com/zwzzzQAQ/MentalGLM. △ Less

Submitted 14 October, 2024; originally announced October 2024.

arXiv:2410.01625 [pdf, other]

A Fourth Planet in the Kepler-51 System Revealed by Transit Timing Variations

Authors: Kento Masuda, Jessica E. Libby-Roberts, John H. Livingston, Kevin B. Stevenson, Peter Gao, Shreyas Vissapragada, Guangwei Fu, Te Han, Michael Greklek-McKeon, Suvrath Mahadevan, Eric Agol, Aaron Bello-Arufe, Zachory Berta-Thompson, Caleb I. Canas, Yayaati Chachan, Leslie Hebb, Renyu Hu, Yui Kawashima, Heather A. Knutson, Caroline V. Morley, Catriona A. Murray, Kazumasa Ohno, Armen Tokadjian, Xi Zhang, Luis Welbanks , et al. (27 additional authors not shown)

Abstract: Kepler-51 is a $\lesssim 1\,\mathrm{Gyr}$-old Sun-like star hosting three transiting planets with radii $\approx 6$-$9\,R_\oplus$ and orbital periods $\approx 45$-$130\,\mathrm{days}$. Transit timing variations (TTVs) measured with past Kepler and Hubble Space Telescope (HST) observations have been successfully modeled by considering gravitational interactions between the three transiting planets,… ▽ More Kepler-51 is a $\lesssim 1\,\mathrm{Gyr}$-old Sun-like star hosting three transiting planets with radii $\approx 6$-$9\,R_\oplus$ and orbital periods $\approx 45$-$130\,\mathrm{days}$. Transit timing variations (TTVs) measured with past Kepler and Hubble Space Telescope (HST) observations have been successfully modeled by considering gravitational interactions between the three transiting planets, yielding low masses and low mean densities ($\lesssim 0.1\,\mathrm{g/cm^3}$) for all three planets. However, the transit time of the outermost transiting planet Kepler-51d recently measured by the James Webb Space Telescope (JWST) 10 years after the Kepler observations is significantly discrepant from the prediction made by the three-planet TTV model, which we confirmed with ground-based and follow-up HST observations. We show that the departure from the three-planet model is explained by including a fourth outer planet, Kepler-51e, in the TTV model. A wide range of masses ($\lesssim M_\mathrm{Jup}$) and orbital periods ($\lesssim 10\,\mathrm{yr}$) are possible for Kepler-51e. Nevertheless, all the coplanar solutions found from our brute-force search imply masses $\lesssim 10\,M_\oplus$ for the inner transiting planets. Thus their densities remain low, though with larger uncertainties than previously estimated. Unlike other possible solutions, the one in which Kepler-51e is around the $2:1$ mean motion resonance with Kepler-51d implies low orbital eccentricities ($\lesssim 0.05$) and comparable masses ($\sim 5\,M_\oplus$) for all four planets, as is seen in other compact multi-planet systems. This work demonstrates the importance of long-term follow-up of TTV systems for probing longer period planets in a system. △ Less

Submitted 4 October, 2024; v1 submitted 2 October, 2024; originally announced October 2024.

Comments: 48 pages, 26 figures, accepted for publication in AJ

arXiv:2410.00543 [pdf, other]

doi 10.1103/PhysRevD.111.104008

Quasinormal modes of a charged loop quantum black hole

Authors: Li-Gang Zhu, Guoyang Fu, Shulan Li, Dan Zhang, Jian-Pin Wu

Abstract: This study presents a systematic investigation of quasinormal modes (QNMs) for probe fields-massless/massive scalar and Dirac fields-around a charged loop quantum gravity black hole (LQG-BH) characterized by the quantum parameter $b_0$ and the charge parameter $Q$. Through spectral analysis of quasinormal frequencies (QNFs), we uncover a distinct overtone outburst driven by quantum gravity effects… ▽ More This study presents a systematic investigation of quasinormal modes (QNMs) for probe fields-massless/massive scalar and Dirac fields-around a charged loop quantum gravity black hole (LQG-BH) characterized by the quantum parameter $b_0$ and the charge parameter $Q$. Through spectral analysis of quasinormal frequencies (QNFs), we uncover a distinct overtone outburst driven by quantum gravity effects, prominently manifested in the scalar field spectrum with the multipole quantum number $l=0$. Both the outburst and its accompanying oscillatory patterns grow more pronounced with increasing overtone numbers. In contrast, massless scalar fields with $l>1$ and Dirac fields exhibit delayed outburst development, with non-monotonic behavior dominating the first two overtones. Notably, increasing the charge $Q$ universally suppresses quantum-gravity-induced features, including outbursts, non-monotonicity, and oscillations. Furthermore, we present evidence suggesting the presence of quasi-resonances in the massive scalar QNM spectrum, thereby illustrating the potential for the emergence of arbitrarily long-lived modes in this charged LQG spacetime. These findings establish a robust and universal interplay between quantum gravity effects and charge dynamics, providing new insights into the spectral properties of quantum-corrected BHs. △ Less

Submitted 3 May, 2025; v1 submitted 1 October, 2024; originally announced October 2024.

Journal ref: Phys.Rev.D 111 (2025) 10, 104008

arXiv:2409.08138 [pdf, other]

Probing Quantum Gravity Effects with Eccentric Extreme Mass-Ratio Inspirals

Authors: Guoyang Fu, Yunqi Liu, Bin Wang, Jian-Pin Wu, Chao Zhang

Abstract: In this paper, we investigate the impact of loop quantum gravity (LQG) on extreme mass-ratio inspirals (EMRIs), and the results indicate that LQG effects cause the orbital decay to occur faster compared to the Schwarzschild case. Furthermore, we use the augmented analytic kludge approach to generate EMRI waveforms and study the LISA's capability to detect the LQG effect with faithfulness. Addition… ▽ More In this paper, we investigate the impact of loop quantum gravity (LQG) on extreme mass-ratio inspirals (EMRIs), and the results indicate that LQG effects cause the orbital decay to occur faster compared to the Schwarzschild case. Furthermore, we use the augmented analytic kludge approach to generate EMRI waveforms and study the LISA's capability to detect the LQG effect with faithfulness. Additionally, employing the Fisher information matrix method for parameter estimation, we estimate that after one year of observation, the uncertainty in $r_0$ reduces to approximately $6.59\times 10^{-4}$ with a signal-to-noise ratio of $49$. △ Less

Submitted 19 September, 2024; v1 submitted 12 September, 2024; originally announced September 2024.

Comments: 22 pages,7 figures

arXiv:2409.07146 [pdf, other]

Gated Slot Attention for Efficient Linear-Time Sequence Modeling

Authors: Yu Zhang, Songlin Yang, Ruijie Zhu, Yue Zhang, Leyang Cui, Yiqiao Wang, Bolun Wang, Freda Shi, Bailin Wang, Wei Bi, Peng Zhou, Guohong Fu

Abstract: Linear attention Transformers and their gated variants, celebrated for enabling parallel training and efficient recurrent inference, still fall short in recall-intensive tasks compared to traditional Transformers and demand significant resources for training from scratch. This paper introduces Gated Slot Attention (GSA), which enhances Attention with Bounded-memory-Control (ABC) by incorporating a… ▽ More Linear attention Transformers and their gated variants, celebrated for enabling parallel training and efficient recurrent inference, still fall short in recall-intensive tasks compared to traditional Transformers and demand significant resources for training from scratch. This paper introduces Gated Slot Attention (GSA), which enhances Attention with Bounded-memory-Control (ABC) by incorporating a gating mechanism inspired by Gated Linear Attention (GLA). Essentially, GSA comprises a two-layer GLA linked via $\operatorname{softmax}$, utilizing context-aware memory reading and adaptive forgetting to improve memory capacity while maintaining compact recurrent state size. This design greatly enhances both training and inference efficiency through GLA's hardware-efficient training algorithm and reduced state size. Additionally, retaining the $\operatorname{softmax}$ operation is particularly beneficial in "finetuning pretrained Transformers to RNNs" (T2R) settings, reducing the need for extensive training from scratch. Extensive experiments confirm GSA's superior performance in scenarios requiring in-context recall and in T2R settings. △ Less

Submitted 31 October, 2024; v1 submitted 11 September, 2024; originally announced September 2024.

Comments: NeurIPS 2024

arXiv:2409.06164 [pdf, other]

Deep Learning and Large Language Models for Audio and Text Analysis in Predicting Suicidal Acts in Chinese Psychological Support Hotlines

Authors: Yining Chen, Jianqiang Li, Changwei Song, Qing Zhao, Yongsheng Tong, Guanghui Fu

Abstract: Suicide is a pressing global issue, demanding urgent and effective preventive interventions. Among the various strategies in place, psychological support hotlines had proved as a potent intervention method. Approximately two million people in China attempt suicide annually, with many individuals making multiple attempts. Prompt identification and intervention for high-risk individuals are crucial… ▽ More Suicide is a pressing global issue, demanding urgent and effective preventive interventions. Among the various strategies in place, psychological support hotlines had proved as a potent intervention method. Approximately two million people in China attempt suicide annually, with many individuals making multiple attempts. Prompt identification and intervention for high-risk individuals are crucial to preventing tragedies. With the rapid advancement of artificial intelligence (AI), especially the development of large-scale language models (LLMs), new technological tools have been introduced to the field of mental health. This study included 1284 subjects, and was designed to validate whether deep learning models and LLMs, using audio and transcribed text from support hotlines, can effectively predict suicide risk. We proposed a simple LLM-based pipeline that first summarizes transcribed text from approximately one hour of speech to extract key features, and then predict suicidial bahaviours in the future. We compared our LLM-based method with the traditional manual scale approach in a clinical setting and with five advanced deep learning models. Surprisingly, the proposed simple LLM pipeline achieved strong performance on a test set of 46 subjects, with an F1 score of 76\% when combined with manual scale rating. This is 7\% higher than the best speech-based deep learning models and represents a 27.82\% point improvement in F1 score compared to using the manual scale apporach alone. Our study explores new applications of LLMs and demonstrates their potential for future use in suicide prevention efforts. △ Less

Submitted 9 September, 2024; originally announced September 2024.

arXiv:2408.16463 [pdf, other]

An Exploratory Deep Learning Approach for Predicting Subsequent Suicidal Acts in Chinese Psychological Support Hotlines

Authors: Changwei Song, Qing Zhao, Jianqiang Li, Yining Chen, Yongsheng Tong, Guanghui Fu

Abstract: Psychological support hotlines are an effective suicide prevention measure that typically relies on professionals using suicide risk assessment scales to predict individual risk scores. However, the accuracy of scale-based predictive methods for suicide risk assessment can vary widely depending on the expertise of the operator. This limitation underscores the need for more reliable methods, prompt… ▽ More Psychological support hotlines are an effective suicide prevention measure that typically relies on professionals using suicide risk assessment scales to predict individual risk scores. However, the accuracy of scale-based predictive methods for suicide risk assessment can vary widely depending on the expertise of the operator. This limitation underscores the need for more reliable methods, prompting this research's innovative exploration of the use of artificial intelligence to improve the accuracy and efficiency of suicide risk prediction within the context of psychological support hotlines. The study included data from 1,549 subjects from 2015-2017 in China who contacted a psychological support hotline. Each participant was followed for 12 months to identify instances of suicidal behavior. We proposed a novel multi-task learning method that uses the large-scale pre-trained model Whisper for feature extraction and fits psychological scales while predicting the risk of suicide. The proposed method yields a 2.4\% points improvement in F1-score compared to the traditional manual approach based on the psychological scales. Our model demonstrated superior performance compared to the other eight popular models. To our knowledge, this study is the first to apply deep learning to long-term speech data to predict suicide risk in China, indicating grate potential for clinical applications. The source code is publicly available at: \url{https://github.com/songchangwei/Suicide-Risk-Prediction}. △ Less

Submitted 29 August, 2024; originally announced August 2024.

arXiv:2408.15064 [pdf, other]

The constraint on modified black holes with extreme mass ratio inspirals

Authors: Chao Zhang, Guoyang Fu, Yungui Gong

Abstract: The low-energy effective action of String Theory introduces corrections to the dilaton-graviton sector, resulting in deformed black holes beyond general relativity. We analyze extreme mass-ratio inspiral systems (EMRIs), where a stellar-mass object spirals into a slowly rotating supermassive black hole including a distinct deviation parameter. This study examines the effects of this deformation on… ▽ More The low-energy effective action of String Theory introduces corrections to the dilaton-graviton sector, resulting in deformed black holes beyond general relativity. We analyze extreme mass-ratio inspiral systems (EMRIs), where a stellar-mass object spirals into a slowly rotating supermassive black hole including a distinct deviation parameter. This study examines the effects of this deformation on gravitational wave fluxes, orbital evolution, and phase dynamics, incorporating leading-order post-Newtonian corrections. With one-year observations of EMRIs, we employ the Fisher information matrix method to evaluate the potential for detecting deviations from general relativity through space-based gravitational wave detectors that utilize time-delay interferometry to suppress laser noise. The constraint on modified black holes, $Δα\preceq 10^{-5}$, is almost the same with and without the time-delay interferometry combination. This analysis enhances our understanding and underscores the crucial role of observations in advancing gravitational phenomena within String Theory. △ Less

Submitted 28 August, 2024; v1 submitted 27 August, 2024; originally announced August 2024.

Comments: 19 pages, 4 figures; Added some references and revised some sentences; Comments are welcome

arXiv:2408.11233 [pdf, ps, other]

On the Gaussian Kinematic Formula of R. Adler and J. Taylor

Authors: Joseph H. G. Fu

Abstract: We apply methods of algebraic integral geometry to prove a special case of the Gaussian kinematic formula of Adler-Taylor. The idea, suggested already by Adler and Taylor, is to view the GKF as the limit of spherical kinematic formulas for spheres of large dimension $N$ and curvature $\frac 1 N$. We apply methods of algebraic integral geometry to prove a special case of the Gaussian kinematic formula of Adler-Taylor. The idea, suggested already by Adler and Taylor, is to view the GKF as the limit of spherical kinematic formulas for spheres of large dimension $N$ and curvature $\frac 1 N$. △ Less

Submitted 19 September, 2024; v1 submitted 20 August, 2024; originally announced August 2024.

Comments: 13 pages; still more corrections

MSC Class: Primary 53C65; Secondary 53Z50

arXiv:2408.08697 [pdf, other]

The DAMA/LIBRA signal: an induced modulation effect?

Authors: R. S. James, K. Rule, E. Barberio, V. U. Bashu, L. J. Bignell, I. Bolognino, G. Brooks, S. S. Chhun, F. Dastgiri, A. R. Duffy, M. Froehlich, T. M. A. Fruth, G. Fu, G. C. Hill, K. Janssens, S. Kapoor, G. J. Lane, K. T. Leaver, P. McGee, L. J. McKie, P. C. McNamara, J. McKenzie, W. J. D. Melbourne, M. Mews, L. J. Milligan , et al. (14 additional authors not shown)

Abstract: The persistence of the DAMA/LIBRA (DAMA) modulation over the past two decades has been a source of great contention within the dark matter community. The DAMA collaboration reports a persistent, modulating event rate within their setup of NaI(Tl) scintillating crystals at the INFN Laboratori Nazionali del Gran Sasso (LNGS) underground laboratory. A recent work alluded that this signal could have a… ▽ More The persistence of the DAMA/LIBRA (DAMA) modulation over the past two decades has been a source of great contention within the dark matter community. The DAMA collaboration reports a persistent, modulating event rate within their setup of NaI(Tl) scintillating crystals at the INFN Laboratori Nazionali del Gran Sasso (LNGS) underground laboratory. A recent work alluded that this signal could have arisen due to an analysis artefact, caused by DAMA not accounting for time variation of decaying background radioisotopes in their analysis procedure. In this work, we examine in detail this 'induced modulation' effect, arguing that a number of aspects of the DAMA signal are incompatible with an induced modulation arising from decays of background isotopes over the lifetime of the experiment. Using a toy model of the DAMA/LIBRA experiment, we explore the induced modulation effect under different variations of the activities of the relevant isotopes - namely, $^3$H and $^{210}$Pb - highlighting the various inconsistencies between the resultant toy datasets and the DAMA signal. We stress the importance of the SABRE experiment, whose goal is to unambiguously test for the presence of such a modulating signal in an experiment using the same target material and comparable levels of background. △ Less

Submitted 28 March, 2025; v1 submitted 16 August, 2024; originally announced August 2024.

arXiv:2407.19422 [pdf, other]

A Generic Review of Integrating Artificial Intelligence in Cognitive Behavioral Therapy

Authors: Meng Jiang, Qing Zhao, Jianqiang Li, Fan Wang, Tianyu He, Xinyan Cheng, Bing Xiang Yang, Grace W. K. Ho, Guanghui Fu

Abstract: Cognitive Behavioral Therapy (CBT) is a well-established intervention for mitigating psychological issues by modifying maladaptive cognitive and behavioral patterns. However, delivery of CBT is often constrained by resource limitations and barriers to access. Advancements in artificial intelligence (AI) have provided technical support for the digital transformation of CBT. Particularly, the emerge… ▽ More Cognitive Behavioral Therapy (CBT) is a well-established intervention for mitigating psychological issues by modifying maladaptive cognitive and behavioral patterns. However, delivery of CBT is often constrained by resource limitations and barriers to access. Advancements in artificial intelligence (AI) have provided technical support for the digital transformation of CBT. Particularly, the emergence of pre-training models (PTMs) and large language models (LLMs) holds immense potential to support, augment, optimize and automate CBT delivery. This paper reviews the literature on integrating AI into CBT interventions. We begin with an overview of CBT. Then, we introduce the integration of AI into CBT across various stages: pre-treatment, therapeutic process, and post-treatment. Next, we summarized the datasets relevant to some CBT-related tasks. Finally, we discuss the benefits and current limitations of applying AI to CBT. We suggest key areas for future research, highlighting the need for further exploration and validation of the long-term efficacy and clinical utility of AI-enhanced CBT. The transformative potential of AI in reshaping the practice of CBT heralds a new era of more accessible, efficient, and personalized mental health interventions. △ Less

Submitted 28 July, 2024; originally announced July 2024.

Showing 1–50 of 283 results for author: Fu, G