Search | arXiv e-print repository

DO-RAG: A Domain-Specific QA Framework Using Knowledge Graph-Enhanced Retrieval-Augmented Generation

Authors: David Osei Opoku, Ming Sheng, Yong Zhang

Abstract: Domain-specific QA systems require not just generative fluency but high factual accuracy grounded in structured expert knowledge. While recent Retrieval-Augmented Generation (RAG) frameworks improve context recall, they struggle with integrating heterogeneous data and maintaining reasoning consistency. To address these challenges, we propose DO-RAG, a scalable and customizable hybrid QA framework… ▽ More Domain-specific QA systems require not just generative fluency but high factual accuracy grounded in structured expert knowledge. While recent Retrieval-Augmented Generation (RAG) frameworks improve context recall, they struggle with integrating heterogeneous data and maintaining reasoning consistency. To address these challenges, we propose DO-RAG, a scalable and customizable hybrid QA framework that integrates multi-level knowledge graph construction with semantic vector retrieval. Our system employs a novel agentic chain-of-thought architecture to extract structured relationships from unstructured, multimodal documents, constructing dynamic knowledge graphs that enhance retrieval precision. At query time, DO-RAG fuses graph and vector retrieval results to generate context-aware responses, followed by hallucination mitigation via grounded refinement. Experimental evaluations in the database and electrical domains show near-perfect recall and over 94% answer relevancy, with DO-RAG outperforming baseline frameworks by up to 33.38%. By combining traceability, adaptability, and performance efficiency, DO-RAG offers a reliable foundation for multi-domain, high-precision QA at scale. △ Less

Submitted 17 May, 2025; originally announced May 2025.

Comments: 6 pages, 5 figures;

arXiv:2505.15142 [pdf, ps, other]

On Lan-Sheng-Zuo conjecture

Authors: Bowen Liu, Mao Sheng

Abstract: In this paper we study the Lan-Sheng-Zuo conjecture proposed in arXiv:1210.8280. We prove that the conjecture holds for smooth projective curves with genus $g\leq 1$, and construct explicit counter-examples of arbitrary big rank (the first example is $p=2,r=3$) Higgs bundles over any smooth projective curves with genus $g\ge2$. In this paper we study the Lan-Sheng-Zuo conjecture proposed in arXiv:1210.8280. We prove that the conjecture holds for smooth projective curves with genus $g\leq 1$, and construct explicit counter-examples of arbitrary big rank (the first example is $p=2,r=3$) Higgs bundles over any smooth projective curves with genus $g\ge2$. △ Less

Submitted 28 May, 2025; v1 submitted 21 May, 2025; originally announced May 2025.

Comments: 18 pages, modified the assumption of higher dimensional case, all comments all welcome!

MSC Class: 14H99

arXiv:2504.17395 [pdf, other]

SDVPT: Semantic-Driven Visual Prompt Tuning for Open-World Object Counting

Authors: Yiming Zhao, Guorong Li, Laiyun Qing, Amin Beheshti, Jian Yang, Michael Sheng, Yuankai Qi, Qingming Huang

Abstract: Open-world object counting leverages the robust text-image alignment of pre-trained vision-language models (VLMs) to enable counting of arbitrary categories in images specified by textual queries. However, widely adopted naive fine-tuning strategies concentrate exclusively on text-image consistency for categories contained in training, which leads to limited generalizability for unseen categories.… ▽ More Open-world object counting leverages the robust text-image alignment of pre-trained vision-language models (VLMs) to enable counting of arbitrary categories in images specified by textual queries. However, widely adopted naive fine-tuning strategies concentrate exclusively on text-image consistency for categories contained in training, which leads to limited generalizability for unseen categories. In this work, we propose a plug-and-play Semantic-Driven Visual Prompt Tuning framework (SDVPT) that transfers knowledge from the training set to unseen categories with minimal overhead in parameters and inference time. First, we introduce a two-stage visual prompt learning strategy composed of Category-Specific Prompt Initialization (CSPI) and Topology-Guided Prompt Refinement (TGPR). The CSPI generates category-specific visual prompts, and then TGPR distills latent structural patterns from the VLM's text encoder to refine these prompts. During inference, we dynamically synthesize the visual prompts for unseen categories based on the semantic correlation between unseen and training categories, facilitating robust text-image alignment for unseen categories. Extensive experiments integrating SDVPT with all available open-world object counting models demonstrate its effectiveness and adaptability across three widely used datasets: FSC-147, CARPK, and PUCPR+. △ Less

Submitted 24 April, 2025; originally announced April 2025.

arXiv:2503.10844 [pdf, other]

doi 10.3847/1538-4357/adc0a2

Cosmic filament spin -- II: filament spin and its impact on galaxy spin-filament alignment in a cosmological simulation

Authors: Peng Wang, Xiao-Xiao Tang, Hao-Da Wang, Noam I. Libeskind, Elmo Tempel, Wei Wang, Youcai Zhang, Ming-Jie Sheng, Hao-Ran Yu, Haojie Xu

Abstract: Observational studies have reported that cosmic filaments on the megaparsec scale exhibit rotational motion. Subsequent simulation studies have shown qualitative agreement with these findings, but quantitative discrepancies remain due to differences in data and methods, which require verification. To address this issue, we adopt the same methodology as used in the observations to identify filament… ▽ More Observational studies have reported that cosmic filaments on the megaparsec scale exhibit rotational motion. Subsequent simulation studies have shown qualitative agreement with these findings, but quantitative discrepancies remain due to differences in data and methods, which require verification. To address this issue, we adopt the same methodology as used in the observations to identify filament spin from the galaxy distribution constructed from a hydrodynamic simulation. Using the same approach to measure filament spin, we find that the simulation results closely match the observational findings, with only minor discrepancies arising from slight differences in the fraction of filaments classified as dynamically cold or hot based on their dynamic temperature. Additionally, an analysis of how filament spin affects the galaxy spin-filament correlation shows that filaments with strong spin signals and dynamically cold have a greater impact on the galaxy spin-filament correlation than those with weaker spin signals and dynamically hot filaments. These results not only provide further evidence that cosmic filaments exhibit spin, but also highlight the importance of this rotation in the acquisition of angular momentum by individual galaxies. Future studies exploring the influence of filament spin on galaxy spin may shed light on the physical origins of filaments and the angular momentum of galaxies. △ Less

Submitted 6 May, 2025; v1 submitted 13 March, 2025; originally announced March 2025.

Comments: 11 pages, 7 figures, accepted for publication in ApJ

arXiv:2503.10841 [pdf, other]

doi 10.3847/1538-4357/adbbd7

Cosmic filament spin -- I: a comparative study in observation

Authors: Xiao-Xiao Tang, Peng Wang, Wei Wang, Ming-Jie Sheng, Hao-Ran Yu, Haojie Xu

Abstract: In the cosmic web, filaments play a crucial role in connecting walls to clusters and also act as an important stage for galaxy formation and evolution. Recent observational studies claim that filaments have spin. In this study, we examined the potential impact of diversity in filament identification algorithms and galaxy survey datasets on the quantification of filament spin. The results of this s… ▽ More In the cosmic web, filaments play a crucial role in connecting walls to clusters and also act as an important stage for galaxy formation and evolution. Recent observational studies claim that filaments have spin. In this study, we examined the potential impact of diversity in filament identification algorithms and galaxy survey datasets on the quantification of filament spin. The results of this study demonstrate qualitative agreement with previous research, suggesting that a reliable filament spin signal is detectable when the viewing angle of filament spine larger than 80 degrees under a rough estimation. The detected filament spin signal is intricately linked to the viewing angle, dynamic temperature, etc. The quantitative difference of filament spin signal among samples is slightly dependent on the filament identification algorithms, while the value is relatively greater dependent on the redshift space distortion effect in the galaxy sample. △ Less

Submitted 5 May, 2025; v1 submitted 13 March, 2025; originally announced March 2025.

Comments: 14 pages, 6 figures and 1 tables, accepted for publication in ApJ

arXiv:2502.09869 [pdf, other]

Beyond Explicit and Implicit: How Users Provide Feedback to Shape Personalized Recommendation Content

Authors: Wenqi Li, Jui-Ching Kuo, Manyu Sheng, Pengyi Zhang, Qunfang Wu

Abstract: As personalized recommendation algorithms become integral to social media platforms, users are increasingly aware of their ability to influence recommendation content. However, limited research has explored how users provide feedback through their behaviors and platform mechanisms to shape the recommendation content. We conducted semi-structured interviews with 34 active users of algorithmic-drive… ▽ More As personalized recommendation algorithms become integral to social media platforms, users are increasingly aware of their ability to influence recommendation content. However, limited research has explored how users provide feedback through their behaviors and platform mechanisms to shape the recommendation content. We conducted semi-structured interviews with 34 active users of algorithmic-driven social media platforms (e.g., Xiaohongshu, Douyin). In addition to explicit and implicit feedback, this study introduced intentional implicit feedback, highlighting the actions users intentionally took to refine recommendation content through perceived feedback mechanisms. Additionally, choices of feedback behaviors were found to align with specific purposes. Explicit feedback was primarily used for feed customization, while unintentional implicit feedback was more linked to content consumption. Intentional implicit feedback was employed for multiple purposes, particularly in increasing content diversity and improving recommendation relevance. This work underscores the user intention dimension in the explicit-implicit feedback dichotomy and offers insights for designing personalized recommendation feedback that better responds to users' needs. △ Less

Submitted 13 February, 2025; originally announced February 2025.

Comments: The final version is available at https://doi.org/10.1145/3706598.3713241

arXiv:2502.08351 [pdf, ps, other]

Effects of initial spin orientation on the generation of polarized electron beams from laser wakefield acceleration in plasma

Authors: L. R. Yin, X. F. Li, Y. J. Gu, N. Cao, Q. Kong, M. Buescher, S. M. Weng, M. Chen, Z. M. Sheng

Abstract: The effects of the initial spin orientation on the final electron beam polarization via laser wakefield acceleration in pre-polarized plasma are investigated theoretically and numerically. From a variation of the initial spin direction, the spin dynamics of the electron beam is found to depend on the self-injection mechanism. The effects of wakefields and laser fields are studied using test partic… ▽ More The effects of the initial spin orientation on the final electron beam polarization via laser wakefield acceleration in pre-polarized plasma are investigated theoretically and numerically. From a variation of the initial spin direction, the spin dynamics of the electron beam is found to depend on the self-injection mechanism. The effects of wakefields and laser fields are studied using test particle dynamics and particle-in-cell simulation based on the Thomas-Bargmann-Michel-Telegdi equation, respectively. Compared to the case of transverse injection, the scheme of longitudinal injection is more favorable to obtain a highly polarization electron beam. △ Less

Submitted 12 February, 2025; originally announced February 2025.

Comments: 9 pages, 8 figures

arXiv:2412.02934 [pdf, other]

BGTplanner: Maximizing Training Accuracy for Differentially Private Federated Recommenders via Strategic Privacy Budget Allocation

Authors: Xianzhi Zhang, Yipeng Zhou, Miao Hu, Di Wu, Pengshan Liao, Mohsen Guizani, Michael Sheng

Abstract: To mitigate the rising concern about privacy leakage, the federated recommender (FR) paradigm emerges, in which decentralized clients co-train the recommendation model without exposing their raw user-item rating data. The differentially private federated recommender (DPFR) further enhances FR by injecting differentially private (DP) noises into clients. Yet, current DPFRs, suffering from noise dis… ▽ More To mitigate the rising concern about privacy leakage, the federated recommender (FR) paradigm emerges, in which decentralized clients co-train the recommendation model without exposing their raw user-item rating data. The differentially private federated recommender (DPFR) further enhances FR by injecting differentially private (DP) noises into clients. Yet, current DPFRs, suffering from noise distortion, cannot achieve satisfactory accuracy. Various efforts have been dedicated to improving DPFRs by adaptively allocating the privacy budget over the learning process. However, due to the intricate relation between privacy budget allocation and model accuracy, existing works are still far from maximizing DPFR accuracy. To address this challenge, we develop BGTplanner (Budget Planner) to strategically allocate the privacy budget for each round of DPFR training, improving overall training performance. Specifically, we leverage the Gaussian process regression and historical information to predict the change in recommendation accuracy with a certain allocated privacy budget. Additionally, Contextual Multi-Armed Bandit (CMAB) is harnessed to make privacy budget allocation decisions by reconciling the current improvement and long-term privacy constraints. Our extensive experimental results on real datasets demonstrate that \emph{BGTplanner} achieves an average improvement of 6.76\% in training performance compared to state-of-the-art baselines. △ Less

Submitted 3 December, 2024; originally announced December 2024.

arXiv:2411.17361 [pdf, other]

Towards Robust Cross-Domain Recommendation with Joint Identifiability of User Preference

Authors: Jing Du, Zesheng Ye, Bin Guo, Zhiwen Yu, Jia Wu, Jian Yang, Michael Sheng, Lina Yao

Abstract: Recent cross-domain recommendation (CDR) studies assume that disentangled domain-shared and domain-specific user representations can mitigate domain gaps and facilitate effective knowledge transfer. However, achieving perfect disentanglement is challenging in practice, because user behaviors in CDR are highly complex, and the true underlying user preferences cannot be fully captured through observ… ▽ More Recent cross-domain recommendation (CDR) studies assume that disentangled domain-shared and domain-specific user representations can mitigate domain gaps and facilitate effective knowledge transfer. However, achieving perfect disentanglement is challenging in practice, because user behaviors in CDR are highly complex, and the true underlying user preferences cannot be fully captured through observed user-item interactions alone. Given this impracticability, we instead propose to model {\it joint identifiability} that establishes unique correspondence of user representations across domains, ensuring consistent preference modeling even when user behaviors exhibit shifts in different domains. To achieve this, we introduce a hierarchical user preference modeling framework that organizes user representations by the neural network encoder's depth, allowing separate treatment of shallow and deeper subspaces. In the shallow subspace, our framework models the interest centroids for each user within each domain, probabilistically determining the users' interest belongings and selectively aligning these centroids across domains to ensure fine-grained consistency in domain-irrelevant features. For deeper subspace representations, we enforce joint identifiability by decomposing it into a shared cross-domain stable component and domain-variant components, linked by a bijective transformation for unique correspondence. Empirical studies on real-world CDR tasks with varying domain correlations demonstrate that our method consistently surpasses state-of-the-art, even with weakly correlated tasks, highlighting the importance of joint identifiability in achieving robust CDR. △ Less

Submitted 26 November, 2024; originally announced November 2024.

Comments: 12 pages, 6 figures, under review

arXiv:2411.15845 [pdf, other]

Space-ground Fluid AI for 6G Edge Intelligence

Authors: Qian Chen, Zhanwei Wang, Xianhao Chen, Juan Wen, Di Zhou, Sijing Ji, Min Sheng, Kaibin Huang

Abstract: Edge artificial intelligence (AI) and space-ground integrated networks (SGINs) are two main usage scenarios of the sixth-generation (6G) mobile networks. Edge AI supports pervasive low-latency AI services to users, whereas SGINs provide digital services to spatial, aerial, maritime, and ground users. This article advocates the integration of the two technologies by extending edge AI to space, ther… ▽ More Edge artificial intelligence (AI) and space-ground integrated networks (SGINs) are two main usage scenarios of the sixth-generation (6G) mobile networks. Edge AI supports pervasive low-latency AI services to users, whereas SGINs provide digital services to spatial, aerial, maritime, and ground users. This article advocates the integration of the two technologies by extending edge AI to space, thereby delivering AI services to every corner of the planet. Beyond a simple combination, our novel framework, called Space-ground Fluid AI, leverages the predictive mobility of satellites to facilitate fluid horizontal and vertical task/model migration in the networks. This ensures non-disruptive AI service provisioning in spite of the high mobility of satellite servers. The aim of the article is to introduce the (Space-ground) Fluid AI technology. First, we outline the network architecture and unique characteristics of Fluid AI. Then, we delve into three key components of Fluid AI, i.e., fluid learning, fluid inference, and fluid model downloading. They share the common feature of coping with satellite mobility via inter-satellite and space-ground cooperation to support AI services. Finally, we discuss the considerations for the real-world deployment of Fluid AI and identify further research opportunities. △ Less

Submitted 25 February, 2025; v1 submitted 24 November, 2024; originally announced November 2024.

Comments: 7 pages, 4 figures

arXiv:2411.07596 [pdf, ps, other]

The analytic criterion of strict copositivity for a 4th-order 3-dimensional tensor

Authors: Mingjun Sheng, Yisheng Song

Abstract: This paper focuses on the strict copositivity analysis of 4th-order 3-dimensional symmetric tensors. A necessary and sufficient condition is provided for the strict copositivity of a fourth-order symmetric tensor. Subsequently, building upon this conclusion, we discuss the strict copositivity of fourth-order three-dimensional symmetric tensors with its entries $\pm 1, 0$, and further build their n… ▽ More This paper focuses on the strict copositivity analysis of 4th-order 3-dimensional symmetric tensors. A necessary and sufficient condition is provided for the strict copositivity of a fourth-order symmetric tensor. Subsequently, building upon this conclusion, we discuss the strict copositivity of fourth-order three-dimensional symmetric tensors with its entries $\pm 1, 0$, and further build their necessary and sufficient conditions. Utilizing these theorems, we can effectively verify the strict copositivity of a general fourth-order three-dimensional symmetric tensors. △ Less

Submitted 12 November, 2024; originally announced November 2024.

Comments: 14Pages

arXiv:2411.03695 [pdf, other]

AMNCutter: Affinity-Attention-Guided Multi-View Normalized Cutter for Unsupervised Surgical Instrument Segmentation

Authors: Mingyu Sheng, Jianan Fan, Dongnan Liu, Ron Kikinis, Weidong Cai

Abstract: Surgical instrument segmentation (SIS) is pivotal for robotic-assisted minimally invasive surgery, assisting surgeons by identifying surgical instruments in endoscopic video frames. Recent unsupervised surgical instrument segmentation (USIS) methods primarily rely on pseudo-labels derived from low-level features such as color and optical flow, but these methods show limited effectiveness and gener… ▽ More Surgical instrument segmentation (SIS) is pivotal for robotic-assisted minimally invasive surgery, assisting surgeons by identifying surgical instruments in endoscopic video frames. Recent unsupervised surgical instrument segmentation (USIS) methods primarily rely on pseudo-labels derived from low-level features such as color and optical flow, but these methods show limited effectiveness and generalizability in complex and unseen endoscopic scenarios. In this work, we propose a label-free unsupervised model featuring a novel module named Multi-View Normalized Cutter (m-NCutter). Different from previous USIS works, our model is trained using a graph-cutting loss function that leverages patch affinities for supervision, eliminating the need for pseudo-labels. The framework adaptively determines which affinities from which levels should be prioritized. Therefore, the low- and high-level features and their affinities are effectively integrated to train a label-free unsupervised model, showing superior effectiveness and generalization ability. We conduct comprehensive experiments across multiple SIS datasets to validate our approach's state-of-the-art (SOTA) performance, robustness, and exceptional potential as a pre-trained model. Our code is released at https://github.com/MingyuShengSMY/AMNCutter. △ Less

Submitted 6 November, 2024; v1 submitted 6 November, 2024; originally announced November 2024.

Comments: Accepted by the 2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2025)

arXiv:2410.20381 [pdf, other]

Efficient and Effective Retrieval of Dense-Sparse Hybrid Vectors using Graph-based Approximate Nearest Neighbor Search

Authors: Haoyu Zhang, Jun Liu, Zhenhua Zhu, Shulin Zeng, Maojia Sheng, Tao Yang, Guohao Dai, Yu Wang

Abstract: ANNS for embedded vector representations of texts is commonly used in information retrieval, with two important information representations being sparse and dense vectors. While it has been shown that combining these representations improves accuracy, the current method of conducting sparse and dense vector searches separately suffers from low scalability and high system complexity. Alternatively,… ▽ More ANNS for embedded vector representations of texts is commonly used in information retrieval, with two important information representations being sparse and dense vectors. While it has been shown that combining these representations improves accuracy, the current method of conducting sparse and dense vector searches separately suffers from low scalability and high system complexity. Alternatively, building a unified index faces challenges with accuracy and efficiency. To address these issues, we propose a graph-based ANNS algorithm for dense-sparse hybrid vectors. Firstly, we propose a distribution alignment method to improve accuracy, which pre-samples dense and sparse vectors to analyze their distance distribution statistic, resulting in a 1%$\sim$9% increase in accuracy. Secondly, to improve efficiency, we design an adaptive two-stage computation strategy that initially computes dense distances only and later computes hybrid distances. Further, we prune the sparse vectors to speed up the calculation. Compared to naive implementation, we achieve $\sim2.1\times$ acceleration. Thorough experiments show that our algorithm achieves 8.9x$\sim$11.7x throughput at equal accuracy compared to existing hybrid vector search algorithms. △ Less

Submitted 27 October, 2024; originally announced October 2024.

Comments: 8 pages

arXiv:2410.09685 [pdf, ps, other]

The small $p$-adic Simpson correspondence in the semi-stable reduction case

Authors: Mao Sheng, Yupeng Wang

Abstract: We generalize several known results on small Simpson correspondence for smooth formal schemes over $\calO_C$ to the case for semi-stable formal schemes. More precisely, for a liftable semi-stable formal scheme $\frakX$ over $\calO_C$ with generic fiber $X$, we establish (1) an equivalence between the category of Hitchin-small integral $v$-bundles on $X_{v}$ and the category of Hitchin-small Higgs… ▽ More We generalize several known results on small Simpson correspondence for smooth formal schemes over $\calO_C$ to the case for semi-stable formal schemes. More precisely, for a liftable semi-stable formal scheme $\frakX$ over $\calO_C$ with generic fiber $X$, we establish (1) an equivalence between the category of Hitchin-small integral $v$-bundles on $X_{v}$ and the category of Hitchin-small Higgs bundles on $\frakX_{\et}$, generalizing the previous work of Min--Wang, and (2) an equivalence between the moduli stack of $v$-bundles on $X_{v}$ and the moduli stack of rational Higgs bundles on $\frakX_{\et}$ (equivalently, moduli stack of Higgs bundles on $X_{\et}$), generalizing the previous work of Anschütz--Heuer--Le Bras. △ Less

Submitted 12 October, 2024; originally announced October 2024.

Comments: Comments are welcome!

arXiv:2410.07783 [pdf, other]

CLIP Multi-modal Hashing for Multimedia Retrieval

Authors: Jian Zhu, Mingkai Sheng, Zhangmin Huang, Jingfei Chang, Jinling Jiang, Jian Long, Cheng Luo, Lei Liu

Abstract: Multi-modal hashing methods are widely used in multimedia retrieval, which can fuse multi-source data to generate binary hash code. However, the individual backbone networks have limited feature expression capabilities and are not jointly pre-trained on large-scale unsupervised multi-modal data, resulting in low retrieval accuracy. To address this issue, we propose a novel CLIP Multi-modal Hashing… ▽ More Multi-modal hashing methods are widely used in multimedia retrieval, which can fuse multi-source data to generate binary hash code. However, the individual backbone networks have limited feature expression capabilities and are not jointly pre-trained on large-scale unsupervised multi-modal data, resulting in low retrieval accuracy. To address this issue, we propose a novel CLIP Multi-modal Hashing (CLIPMH) method. Our method employs the CLIP framework to extract both text and vision features and then fuses them to generate hash code. Due to enhancement on each modal feature, our method has great improvement in the retrieval performance of multi-modal hashing methods. Compared with state-of-the-art unsupervised and supervised multi-modal hashing methods, experiments reveal that the proposed CLIPMH can significantly improve performance (a maximum increase of 8.38% in mAP). △ Less

Submitted 10 October, 2024; originally announced October 2024.

Comments: Accepted by 31st International Conference on MultiMedia Modeling (MMM2025)

arXiv:2408.16237 [pdf, other]

MQRLD: A Multimodal Data Retrieval Platform with Query-aware Feature Representation and Learned Index Based on Data Lake

Authors: Ming Sheng, Shuliang Wang, Yong Zhang, Kaige Wang, Jingyi Wang, Yi Luo, Rui Hao

Abstract: Multimodal data has become a crucial element in the realm of big data analytics, driving advancements in data exploration, data mining, and empowering artificial intelligence applications. To support high-quality retrieval for these cutting-edge applications, a robust multimodal data retrieval platform should meet the challenges of transparent data storage, rich hybrid queries, effective feature r… ▽ More Multimodal data has become a crucial element in the realm of big data analytics, driving advancements in data exploration, data mining, and empowering artificial intelligence applications. To support high-quality retrieval for these cutting-edge applications, a robust multimodal data retrieval platform should meet the challenges of transparent data storage, rich hybrid queries, effective feature representation, and high query efficiency. However, among the existing platforms, traditional schema-on-write systems, multi-model databases, vector databases, and data lakes, which are the primary options for multimodal data retrieval, make it difficult to fulfill these challenges simultaneously. Therefore, there is an urgent need to develop a more versatile multimodal data retrieval platform to address these issues. In this paper, we introduce a Multimodal Data Retrieval Platform with Query-aware Feature Representation and Learned Index based on Data Lake (MQRLD). It leverages the transparent storage capabilities of data lakes, integrates the multimodal open API to provide a unified interface that supports rich hybrid queries, introduces a query-aware multimodal data feature representation strategy to obtain effective features, and offers high-dimensional learned indexes to optimize data query. We conduct a comparative analysis of the query performance of MQRLD against other methods for rich hybrid queries. Our results underscore the superior efficiency of MQRLD in handling multimodal data retrieval tasks, demonstrating its potential to significantly improve retrieval performance in complex environments. We also clarify some potential concerns in the discussion. △ Less

Submitted 8 February, 2025; v1 submitted 28 August, 2024; originally announced August 2024.

Comments: 34 pages, 28 figures

arXiv:2408.14789 [pdf, other]

Revisiting Surgical Instrument Segmentation Without Human Intervention: A Graph Partitioning View

Authors: Mingyu Sheng, Jianan Fan, Dongnan Liu, Ron Kikinis, Weidong Cai

Abstract: Surgical instrument segmentation (SIS) on endoscopic images stands as a long-standing and essential task in the context of computer-assisted interventions for boosting minimally invasive surgery. Given the recent surge of deep learning methodologies and their data-hungry nature, training a neural predictive model based on massive expert-curated annotations has been dominating and served as an off-… ▽ More Surgical instrument segmentation (SIS) on endoscopic images stands as a long-standing and essential task in the context of computer-assisted interventions for boosting minimally invasive surgery. Given the recent surge of deep learning methodologies and their data-hungry nature, training a neural predictive model based on massive expert-curated annotations has been dominating and served as an off-the-shelf approach in the field, which could, however, impose prohibitive burden to clinicians for preparing fine-grained pixel-wise labels corresponding to the collected surgical video frames. In this work, we propose an unsupervised method by reframing the video frame segmentation as a graph partitioning problem and regarding image pixels as graph nodes, which is significantly different from the previous efforts. A self-supervised pre-trained model is firstly leveraged as a feature extractor to capture high-level semantic features. Then, Laplacian matrixs are computed from the features and are eigendecomposed for graph partitioning. On the "deep" eigenvectors, a surgical video frame is meaningfully segmented into different modules such as tools and tissues, providing distinguishable semantic information like locations, classes, and relations. The segmentation problem can then be naturally tackled by applying clustering or threshold on the eigenvectors. Extensive experiments are conducted on various datasets (e.g., EndoVis2017, EndoVis2018, UCL, etc.) for different clinical endpoints. Across all the challenging scenarios, our method demonstrates outstanding performance and robustness higher than unsupervised state-of-the-art (SOTA) methods. The code is released at https://github.com/MingyuShengSMY/GraphClusteringSIS.git. △ Less

Submitted 6 November, 2024; v1 submitted 27 August, 2024; originally announced August 2024.

Comments: Accepted by The 32nd ACM International Conference on Multimedia (ACM MM 2024) Workshop on Multimedia Computing for Health and Medicine (MCHM)

arXiv:2407.03178 [pdf, other]

Relating CNN-Transformer Fusion Network for Change Detection

Authors: Yuhao Gao, Gensheng Pei, Mengmeng Sheng, Zeren Sun, Tao Chen, Yazhou Yao

Abstract: While deep learning, particularly convolutional neural networks (CNNs), has revolutionized remote sensing (RS) change detection (CD), existing approaches often miss crucial features due to neglecting global context and incomplete change learning. Additionally, transformer networks struggle with low-level details. RCTNet addresses these limitations by introducing \textbf{(1)} an early fusion backbo… ▽ More While deep learning, particularly convolutional neural networks (CNNs), has revolutionized remote sensing (RS) change detection (CD), existing approaches often miss crucial features due to neglecting global context and incomplete change learning. Additionally, transformer networks struggle with low-level details. RCTNet addresses these limitations by introducing \textbf{(1)} an early fusion backbone to exploit both spatial and temporal features early on, \textbf{(2)} a Cross-Stage Aggregation (CSA) module for enhanced temporal representation, \textbf{(3)} a Multi-Scale Feature Fusion (MSF) module for enriched feature extraction in the decoder, and \textbf{(4)} an Efficient Self-deciphering Attention (ESA) module utilizing transformers to capture global information and fine-grained details for accurate change detection. Extensive experiments demonstrate RCTNet's clear superiority over traditional RS image CD methods, showing significant improvement and an optimal balance between accuracy and computational cost. △ Less

Submitted 3 July, 2024; originally announced July 2024.

Comments: accepted by IEEE Conference on Multimedia Expo

arXiv:2407.02778 [pdf, other]

Foster Adaptivity and Balance in Learning with Noisy Labels

Authors: Mengmeng Sheng, Zeren Sun, Tao Chen, Shuchao Pang, Yucheng Wang, Yazhou Yao

Abstract: Label noise is ubiquitous in real-world scenarios, posing a practical challenge to supervised models due to its effect in hurting the generalization performance of deep neural networks. Existing methods primarily employ the sample selection paradigm and usually rely on dataset-dependent prior knowledge (\eg, a pre-defined threshold) to cope with label noise, inevitably degrading the adaptivity. Mo… ▽ More Label noise is ubiquitous in real-world scenarios, posing a practical challenge to supervised models due to its effect in hurting the generalization performance of deep neural networks. Existing methods primarily employ the sample selection paradigm and usually rely on dataset-dependent prior knowledge (\eg, a pre-defined threshold) to cope with label noise, inevitably degrading the adaptivity. Moreover, existing methods tend to neglect the class balance in selecting samples, leading to biased model performance. To this end, we propose a simple yet effective approach named \textbf{SED} to deal with label noise in a \textbf{S}elf-adaptiv\textbf{E} and class-balance\textbf{D} manner. Specifically, we first design a novel sample selection strategy to empower self-adaptivity and class balance when identifying clean and noisy data. A mean-teacher model is then employed to correct labels of noisy samples. Subsequently, we propose a self-adaptive and class-balanced sample re-weighting mechanism to assign different weights to detected noisy samples. Finally, we additionally employ consistency regularization on selected clean samples to improve model generalization performance. Extensive experimental results on synthetic and real-world datasets demonstrate the effectiveness and superiority of our proposed method. The source code has been made available at https://github.com/NUST-Machine-Intelligence-Laboratory/SED. △ Less

Submitted 2 July, 2024; originally announced July 2024.

Comments: accepted by the European Conference on Computer Vision (ECCV), 2024

arXiv:2405.14081 [pdf, other]

Laboratory-scale Perpendicular Collisionless Shock Generation and Ion Acceleration in Magnetized Head-on Colliding Plasmas

Authors: P. Liu, D. Wu, D. W. Yuan, G. Zhao, Z. M. Sheng, X. T. He, J. Zhang

Abstract: Magnetized collisionless shocks drive particle acceleration broadly in space and astrophysics. We perform the first large-scale particle-in-cell simulations with realistic laboratory parameters (density, temperature, and velocity) to investigate the magnetized shock in head-on colliding plasmas with an applied magnetic field of tens of Tesla. It is shown that a perpendicular collisionless shock is… ▽ More Magnetized collisionless shocks drive particle acceleration broadly in space and astrophysics. We perform the first large-scale particle-in-cell simulations with realistic laboratory parameters (density, temperature, and velocity) to investigate the magnetized shock in head-on colliding plasmas with an applied magnetic field of tens of Tesla. It is shown that a perpendicular collisionless shock is formed with about fourfold density jump when two pre-magnetized flows collide. This shock is also characterized by rapid increase of neutron yield, triggered by the beam-beam nuclear reactions between injected deuterons and ones reflected by the shock. Distinct from the shocks arising from the interaction of injected flows with a magnetized background, the self-generated magnetic field in this colliding plasmas experiences a significant amplification due to the increasing diamagnetic current, approximately 30 times of upstream magnetic field. Moreover, we find that ions, regardless of whether they pass through or are reflected by the shock, can gain energy by the shock surfing acceleration, generating a power-law energy spectrum. In addition, we also demonstrate that the shock mediated only by filamentation instability cannot be generated under the prevailing unmagnetized experimental parameters. These results provide a direct connection of astrophysical field amplification to the magnetized shock formation and nonthermal ion generation. △ Less

Submitted 22 May, 2024; originally announced May 2024.

arXiv:2405.13260 [pdf, other]

Assessing Proton-Boron Fusion Feasibility under non-Thermal Equilibrium Conditions: Rider's Inhibition Revisited

Authors: S. J. Liu, D. Wu, B. Liu, Y. -K. M. Peng, J. Q. Dong, T. Y. Liang, Z. M. Sheng

Abstract: Compared to the D-T reaction, the neutron-free proton-boron (p-$^{11}$B) fusion has garnered increasing attention in recent years. However, significant Bremsstrahlung losses pose a formidable challenge in p-$^{11}$B plasmas in achieving $Q>1$ in thermal equilibrium. The primary aim of this study is to corroborate Todd H. Rider's seminal work in the 1997 Physics of Plasmas, who investigated the fea… ▽ More Compared to the D-T reaction, the neutron-free proton-boron (p-$^{11}$B) fusion has garnered increasing attention in recent years. However, significant Bremsstrahlung losses pose a formidable challenge in p-$^{11}$B plasmas in achieving $Q>1$ in thermal equilibrium. The primary aim of this study is to corroborate Todd H. Rider's seminal work in the 1997 Physics of Plasmas, who investigated the feasibility of sustaining p-$^{11}$B fusion under non-thermal equilibrium conditions. Employing a series of simulations with new fusion cross-section, we assessed the minimum recirculating power that must be recycled to maintain the system's non-thermal equilibrium and found that it is substantially greater than the fusion power output, aligning with Rider's conclusions, whether under the conditions of non-Maxwellian electron distribution or Maxwellian electron distribution, reactors reliant on non-equilibrium plasmas for p-$^{11}$B fusion are unlikely to achieve net power production without the aid of highly efficient external heat engines. However, maintaining the ion temperature at 300 keV and the Coulomb logarithm at 15, while increasing the electron temperature beyond 23.33 keV set by Rider, leads to diminished electron-ion energy transfer and heightened Bremsstrahlung radiation. When the electron temperature approaches approximately 140 keV, this progression ultimately leads to a scenario where the power of Bremsstrahlung loss equals the power of electron-ion interactions, yet remains inferior to the fusion power. Consequently, this results in a net gain in energy production. △ Less

Submitted 21 May, 2024; originally announced May 2024.

arXiv:2405.09947 [pdf, ps, other]

A Nonabelian Hodge Correspondence for Principal Bundles in Positive Characteristic

Authors: Mao Sheng, Hao Sun, Jianping Wang

Abstract: In this paper, we prove a nonabelian Hodge correspondence for principal bundles on a smooth variety $X$ in positive characteristic, which generalizes the Ogus-Vologodsky correspondence for vector bundles. Then we extend the correspondence to logahoric torsors over a log pair $(X,D)$, where $D$ a reduced normal crossing divisor in $X$. As an intermediate step, we prove a correspondence between prin… ▽ More In this paper, we prove a nonabelian Hodge correspondence for principal bundles on a smooth variety $X$ in positive characteristic, which generalizes the Ogus-Vologodsky correspondence for vector bundles. Then we extend the correspondence to logahoric torsors over a log pair $(X,D)$, where $D$ a reduced normal crossing divisor in $X$. As an intermediate step, we prove a correspondence between principal bundles on root stacks $\mathscr{X}$ and parahoric torsors on $(X,D)$, which generalizes the correspondence on curves given by Balaji--Seshadri to higher dimensional case. △ Less

Submitted 2 October, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

Comments: 31 pages

MSC Class: 14C30; 14L15; 20G15

arXiv:2405.04784 [pdf, other]

doi 10.1103/PhysRevD.110.023512

Lagrangian space remapping and the angular momentum reconstruction from cosmic structures

Authors: Sijia Li, Ming-Jie Sheng, Haikun Li, Hao-Ran Yu

Abstract: Large scale structures provide valuable information of the primordial perturbations that encode the secrets of the origin of the Universe. It is an essential step to map between observables and their initial coordinates, called Lagrangian space, from which primordial perturbations transfer their information to structures via linear theory. By using numerical simulations and state-of-the-art recons… ▽ More Large scale structures provide valuable information of the primordial perturbations that encode the secrets of the origin of the Universe. It is an essential step to map between observables and their initial coordinates, called Lagrangian space, from which primordial perturbations transfer their information to structures via linear theory. By using numerical simulations and state-of-the-art reconstruction techniques, we report the accuracy of estimating the Lagrangian coordinates of galaxies and galaxy clusters, represented by dark matter halos in various ranges of mass, and study the accuracy of this remapping on the angular momentum (spin) reconstruction. Our work shows that galaxy groups and clusters, represented by halos with mass $\gtrsim 10^{13}M_\odot$, can be accurately remapped to Lagrangian space, and their spin reconstruction errors are dominated by the reconstructed initial gravitational potential. For all mass ranges, the errors of Lagrangian remapping, as well as redshift space distortions, play subdominant roles in estimating their angular momenta. This study explains the low correlation level between observed galaxy spins and reconstructed cosmic initial conditions and illustrates the potential of using angular momenta of cosmic structures to improve the reconstruction of primordial perturbations. △ Less

Submitted 8 July, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

Comments: 7 pages, 4 figures, 1 table. Matches the accepted version in Physical Review D

Journal ref: Physical Review D, 110, 023512, 2024

arXiv:2404.10969 [pdf, other]

Integrated Communication, Navigation, and Remote Sensing in LEO Networks with Vehicular Applications

Authors: Min Sheng, Chongtao Guo, Lei Huang

Abstract: Traditionally, communication, navigation, and remote sensing (CNR) satellites are separately performed, leading to resource waste, information isolation, and independent optimization for each functionality. Taking future automated driving as an example, it faces great challenges in providing high-reliable and low-latency lane-level positioning, decimeter-level transportation observation, and huge… ▽ More Traditionally, communication, navigation, and remote sensing (CNR) satellites are separately performed, leading to resource waste, information isolation, and independent optimization for each functionality. Taking future automated driving as an example, it faces great challenges in providing high-reliable and low-latency lane-level positioning, decimeter-level transportation observation, and huge traffic sensing information downloading. To this end, this article proposes an integrated CNR (ICNR) framework based on low Earth orbit (LEO) satellite mega-constellations. After introducing the main working principles of the CNR functionalities to serve as the technological basis, we characterize the potentials of the integration gain in vehicular use cases. Then, we investigate the ICNR framework in different integration levels, which sheds strong light on qualitative performance improvement by sophisticatedly sharing orbit constellation, wireless resource, and data information towards meeting the requirements of vehicular applications. We also instantiate a fundamental numerical case study to demonstrate the integration gain and highlight possible future research directions in managing the ICNR networks. △ Less

Submitted 20 September, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

Comments: This article has been accepted by IEEE Wireless Communications Magazine

arXiv:2404.10432 [pdf, other]

doi 10.1051/0004-6361/202450397

The evolutionary pathways of disk galaxies with different sizes

Authors: Hong-Chuan Ma, Min Du, Luis C. Ho, Ming-jie Sheng, Shihong Liao

Abstract: From the IllustrisTNG-50 simulation, a sample of 836 central disk galaxies with tiny stellar halos is chosen to study the inherent evolution of galaxies driven by nature. These galaxies are classified as compact, normal, or extended by referencing their locations on the mass-size ($M_\star-R_{\rm 1/2}$) diagram. This research demonstrates the distinctive evolutionary pathways of galaxies with diff… ▽ More From the IllustrisTNG-50 simulation, a sample of 836 central disk galaxies with tiny stellar halos is chosen to study the inherent evolution of galaxies driven by nature. These galaxies are classified as compact, normal, or extended by referencing their locations on the mass-size ($M_\star-R_{\rm 1/2}$) diagram. This research demonstrates the distinctive evolutionary pathways of galaxies with different sizes in IllustrisTNG simulations, primarily driven by nature. It is confirmed that disk galaxies inherit the angular momentum of their parent dark matter halos. More compact galaxies form earlier within halos possessing lower specific angular momentum through heightened star formation during the early phase at redshifts above 2. During the later phase, the size of extended galaxies experiences more pronounced growth by accreting gas with high angular momentum. Additionally, we reveal that many key characteristics of galaxies are linked to their mass and size: (1) compact galaxies tend to exhibit higher metal content, proportional to the potential well $\frac{M_\star}{R_{\rm 1/2}}$, (2) compact galaxies host more massive bulges and black holes, and higher central concentration. Furthermore, our analysis indicates that galaxies of all types continue to actively engage in star formation, with no evident signs of quenching attributed to their varying sizes and angular momenta. △ Less

Submitted 2 July, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

Comments: 11 pages, 11 figures. Accepted for publication in A&A

Journal ref: A&A 689, A293 (2024)

arXiv:2404.08216 [pdf, other]

Role of nonlocal heat transport on the laser ablative Rayleigh-Taylor instability

Authors: Z. H. Chen, X. H. Yang, G. B. Zhang, Y. Y. Ma, R. Yan, H. Xu, Z. M. Sheng, F. Q. Shao, J. Zhang

Abstract: Ablative Rayleigh-Taylor instability (ARTI) and nonlocal heat transport are the critical problems in laser-driven inertial confinement fusion, while their coupling with each other is not completely understood yet. Here the ARTI in the presence of nonlocal heat transport is studied self-consistently for the first time theoretically and by using radiation hydrodynamic simulations. It is found that t… ▽ More Ablative Rayleigh-Taylor instability (ARTI) and nonlocal heat transport are the critical problems in laser-driven inertial confinement fusion, while their coupling with each other is not completely understood yet. Here the ARTI in the presence of nonlocal heat transport is studied self-consistently for the first time theoretically and by using radiation hydrodynamic simulations. It is found that the nonlocal heat flux generated by the hot electron transport tends to attenuate the growth of instability, especially for short wavelength perturbations. A linear theory of the ARTI coupled with the nonlocal heat flux is developed, and a prominent stabilization of the ablation front via the nonlocal heat flux is found, in good agreement with numerical simulations. This effect becomes more significant as the laser intensity increases. Our results should have important references for the target designing for inertial confinement fusion. △ Less

Submitted 11 April, 2024; originally announced April 2024.

Comments: 8 pages, 5 figures

arXiv:2403.07699 [pdf, other]

Ion Kinetics and Neutron Generation Associated with Electromagnetic Turbulence in Laboratory-scale Counter-streaming Plasmas

Authors: P. Liu, D. Wu, T. X. Hu, D. W. Yuan, G. Zhao, Z. M. Sheng, X. T. He, J. Zhang

Abstract: Electromagnetic turbulence and ion kinetics in counter-streaming plasmas hold great significance in laboratory astrophysics, such as turbulence field amplification and particle energization. Here, we quantitatively demonstrate for the first time how electromagnetic turbulence affects ion kinetics under achievable laboratory conditions (millimeter-scale interpenetrating plasmas with initial velocit… ▽ More Electromagnetic turbulence and ion kinetics in counter-streaming plasmas hold great significance in laboratory astrophysics, such as turbulence field amplification and particle energization. Here, we quantitatively demonstrate for the first time how electromagnetic turbulence affects ion kinetics under achievable laboratory conditions (millimeter-scale interpenetrating plasmas with initial velocity of $2000\ \mathrm{km/s}$, density of $4 \times 10^{19}\ \mathrm{cm}^{-3}$, and temperature of $100\ \mathrm{eV}$) utilizing a recently developed high-order implicit particle-in-cell code without scaling transformation. It is found that the electromagnetic turbulence is driven by ion two-stream and filamentation instabilities. For the magnetized scenarios where an applied magnetic field of tens of Tesla is perpendicular to plasma flows, the growth rates of instabilities increase with the strengthening of applied magnetic field, which therefore leads to a significant enhancement of turbulence fields. Under the competition between the stochastic acceleration due to electromagnetic turbulence and collisional thermalization, ion distribution function shows a distinct super-Gaussian shape, and the ion kinetics are manifested in neutron yields and spectra. Our results have well explained the recent unmagnetized experimental observations, and the findings of magnetized scenario can be verified by current astrophysical experiments. △ Less

Submitted 12 March, 2024; originally announced March 2024.

Comments: Accepted by Phys. Rev. Lett. on 12 Mar

arXiv:2402.11242 [pdf, other]

Learning with Imbalanced Noisy Data by Preventing Bias in Sample Selection

Authors: Huafeng Liu, Mengmeng Sheng, Zeren Sun, Yazhou Yao, Xian-Sheng Hua, Heng-Tao Shen

Abstract: Learning with noisy labels has gained increasing attention because the inevitable imperfect labels in real-world scenarios can substantially hurt the deep model performance. Recent studies tend to regard low-loss samples as clean ones and discard high-loss ones to alleviate the negative impact of noisy labels. However, real-world datasets contain not only noisy labels but also class imbalance. The… ▽ More Learning with noisy labels has gained increasing attention because the inevitable imperfect labels in real-world scenarios can substantially hurt the deep model performance. Recent studies tend to regard low-loss samples as clean ones and discard high-loss ones to alleviate the negative impact of noisy labels. However, real-world datasets contain not only noisy labels but also class imbalance. The imbalance issue is prone to causing failure in the loss-based sample selection since the under-learning of tail classes also leans to produce high losses. To this end, we propose a simple yet effective method to address noisy labels in imbalanced datasets. Specifically, we propose Class-Balance-based sample Selection (CBS) to prevent the tail class samples from being neglected during training. We propose Confidence-based Sample Augmentation (CSA) for the chosen clean samples to enhance their reliability in the training process. To exploit selected noisy samples, we resort to prediction history to rectify labels of noisy samples. Moreover, we introduce the Average Confidence Margin (ACM) metric to measure the quality of corrected labels by leveraging the model's evolving training dynamics, thereby ensuring that low-quality corrected noisy samples are appropriately masked out. Lastly, consistency regularization is imposed on filtered label-corrected noisy samples to boost model performance. Comprehensive experimental results on synthetic and real-world datasets demonstrate the effectiveness and superiority of our proposed method, especially in imbalanced scenarios. Comprehensive experimental results on synthetic and real-world datasets demonstrate the effectiveness and superiority of our proposed method, especially in imbalanced scenarios. △ Less

Submitted 17 February, 2024; originally announced February 2024.

Comments: accepted by IEEE Transactions on Multimedia

arXiv:2401.11894 [pdf, other]

Exact Normal Modes of Quantum Plasmas

Authors: Tian-Xing Hu, Dong Wu, Z. M. Sheng, J. Zhang

Abstract: The normal modes, i.e., the eigen solutions to the dispersion relation equation, are the most fundamental properties of a plasma, which also of key importance to many nonlinear effects such as parametric and two-plasmon decay, and Raman scattering. The real part indicates the intrinsic oscillation frequency while the imaginary part the Landau damping rate. In most of the literatures, the normal mo… ▽ More The normal modes, i.e., the eigen solutions to the dispersion relation equation, are the most fundamental properties of a plasma, which also of key importance to many nonlinear effects such as parametric and two-plasmon decay, and Raman scattering. The real part indicates the intrinsic oscillation frequency while the imaginary part the Landau damping rate. In most of the literatures, the normal modes of quantum plasmas are obtained by means of small damping approximation (SDA), which is invalid for high-$k$ modes. In this paper, we solve the exact dispersion relations via the analytical continuation (AC) scheme, and, due to the multi-value nature of the Fermi-Dirac distribution, reformation of the complex Riemann surface is required. It is found that the change of the topological shape of the root locus in quantum plasmas is quite different from classical plasmas, in which both real and imaginary frequencies of high-$k$ modes increase with $k$ in a steeper way than the typical linear behaviour as appears in classical plasmas. As a result, the temporal evolution of a high-$k$ perturbation in quantum plasmas is dominated by the ballistic modes. △ Less

Submitted 22 January, 2024; originally announced January 2024.

arXiv:2401.11891 [pdf, other]

Validation of Classical Transport Cross Section for Ion-Ion Interactions Under Repulsive Yukawa Potential

Authors: Tian-Xing Hu, Dong Wu, C. L. Lin, Z. M. Sheng, B. He, J. Zhang

Abstract: Value of cross section is a fundamental parameter to depict the transport of charged particles in matters. Due to masses of orders of magnitude higher than electrons and convenience of realistic calculation, the cross section of elastic nuclei-nuclei collision is usually treated via classical mechanics. The famous Bohr criterion was firstly proposed to judge whether the treatment via classical mec… ▽ More Value of cross section is a fundamental parameter to depict the transport of charged particles in matters. Due to masses of orders of magnitude higher than electrons and convenience of realistic calculation, the cross section of elastic nuclei-nuclei collision is usually treated via classical mechanics. The famous Bohr criterion was firstly proposed to judge whether the treatment via classical mechanics is reliable or not. Later, Lindhard generalized the results of Coulomb to screening potentials. Considering the increasing importance of detailed ion-ion interactions under modern simulation codes in inertial confinement fusion (ICF) researches, the validation of classical transport cross section for ion-ion interactions in a big range of parameter space is certainly required. In this work, the transport cross sections via classical mechanics under repulsive Yukawa potential are compared with those via quantum mechanics. Differences of differential cross sections are found with respect to scattering angles and velocities. Our results generally indicate that the classical picture fails at the cases of both low and high velocities, which represent a significant extension of the famous Bohr criterion and its generalized variations. Furthermore, the precise validation zones of classical picture is also analysed in this work. This work is of significant importance for benchmarking the modern ion-kinetic simulation codes in ICF researches, concerning the stopping power of $α$ particles in DT fuels, ion-ion friction and viscous effects in the formation of kinetic shocks. △ Less

Submitted 22 January, 2024; originally announced January 2024.

arXiv:2401.09956 [pdf, ps, other]

On the Existence of Gr-semistable Filtrations of Orthogonal/Symplectic $λ$-connections

Authors: Mao Sheng, Hao Sun, Jianping Wang

Abstract: In this paper, we study the existence of gr-semistable filtrations of orthogonal/symplectic $λ$-connections. It is known that gr-semistable filtrations always exist for flat bundles in arbitrary characteristic. However, we found a counterexample of orthogonal flat bundles of rank 5 in positive characteristic. The central new idea in this example is the notion of quasi gr-semistability for orthogon… ▽ More In this paper, we study the existence of gr-semistable filtrations of orthogonal/symplectic $λ$-connections. It is known that gr-semistable filtrations always exist for flat bundles in arbitrary characteristic. However, we found a counterexample of orthogonal flat bundles of rank 5 in positive characteristic. The central new idea in this example is the notion of quasi gr-semistability for orthogonal/symplectic $λ$-connections. We establish the equivalence between gr-semistability and quasi gr-semistablity for an orthogonal/symplectic $λ$-connection. This provides a way to determine whether an orthogonal/symplectic $λ$-connection is gr-semistable. As an application, we obtain a characterization of gr-semistable orthogonal $λ$-connections of rank $\leq 6$. △ Less

Submitted 15 February, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

Comments: 35 pages

MSC Class: 14D07; 14J60

arXiv:2401.09757 [pdf, other]

Cooperative Tri-Point Model-Based Ground-to-Air Coverage Extension in Beyond 5G Networks

Authors: Ziwei Cai, Min Sheng, Junju Liu, Chenxi Zhao, Jiandong Li

Abstract: The utilization of existing terrestrial infrastructures to provide coverage for aerial users is a potentially low-cost solution. However, the already deployed terrestrial base stations (TBSs) result in weak ground-to-air (G2A) coverage due to the down-tilted antennas. Furthermore, achieving optimal coverage across the entire airspace through antenna adjustment is challenging due to the complex sig… ▽ More The utilization of existing terrestrial infrastructures to provide coverage for aerial users is a potentially low-cost solution. However, the already deployed terrestrial base stations (TBSs) result in weak ground-to-air (G2A) coverage due to the down-tilted antennas. Furthermore, achieving optimal coverage across the entire airspace through antenna adjustment is challenging due to the complex signal coverage requirements in three-dimensional space, especially in the vertical direction. In this paper, we propose a cooperative tri-point (CoTP) model-based method that utilizes cooperative beams to enhance the G2A coverage extension. To utilize existing TBSs for establishing effective cooperation, we prove that the cooperation among three TBSs can ensure G2A coverage with a minimum coverage overlap, and design the CoTP model to analyze the G2A coverage extension. Using the model, a cooperative coverage structure based on Delaunay triangulation is designed to divide triangular prism-shaped subspaces and corresponding TBS cooperation sets. To enable TBSs in the cooperation set to cover different height subspaces while maintaining ground coverage, we design a cooperative beam generation algorithm to maximize the coverage in the triangular prism-shaped airspace. The simulation results and field trials demonstrate that the proposed method can efficiently enhance the G2A coverage extension while guaranteeing ground coverage. △ Less

Submitted 18 January, 2024; originally announced January 2024.

arXiv:2401.00445 [pdf, ps, other]

Energy-Efficient Power Control for Multiple-Task Split Inference in UAVs: A Tiny Learning-Based Approach

Authors: Chenxi Zhao, Min Sheng, Junyu Liu, Tianshu Chu, Jiandong Li

Abstract: The limited energy and computing resources of unmanned aerial vehicles (UAVs) hinder the application of aerial artificial intelligence. The utilization of split inference in UAVs garners significant attention due to its effectiveness in mitigating computing and energy requirements. However, achieving energy-efficient split inference in UAVs remains complex considering of various crucial parameters… ▽ More The limited energy and computing resources of unmanned aerial vehicles (UAVs) hinder the application of aerial artificial intelligence. The utilization of split inference in UAVs garners significant attention due to its effectiveness in mitigating computing and energy requirements. However, achieving energy-efficient split inference in UAVs remains complex considering of various crucial parameters such as energy level and delay constraints, especially involving multiple tasks. In this paper, we present a two-timescale approach for energy minimization in split inference, where discrete and continuous variables are segregated into two timescales to reduce the size of action space and computational complexity. This segregation enables the utilization of tiny reinforcement learning (TRL) for selecting discrete transmission modes for sequential tasks. Moreover, optimization programming (OP) is embedded between TRL's output and reward function to optimize the continuous transmit power. Specifically, we replace the optimization of transmit power with that of transmission time to decrease the computational complexity of OP since we reveal that energy consumption monotonically decreases with increasing transmission time. The replacement significantly reduces the feasible region and enables a fast solution according to the closed-form expression for optimal transmit power. Simulation results show that the proposed algorithm can achieve a higher probability of successful task completion with lower energy consumption. △ Less

Submitted 31 December, 2023; originally announced January 2024.

arXiv:2312.17516 [pdf, other]

Robust TOA-based Localization with Inaccurate Anchors for MANET

Authors: Xinkai Yu, Yang Zheng, Min Sheng, Yan Shi, Jiandong Li

Abstract: Accurate node localization is vital for mobile ad hoc networks (MANETs). Current methods like Time of Arrival (TOA) can estimate node positions using imprecise baseplates and achieve the Cramér-Rao lower bound (CRLB) accuracy. In multi-hop MANETs, some nodes lack direct links to base anchors, depending on neighbor nodes as dynamic anchors for chain localization. However, the dynamic nature of MANE… ▽ More Accurate node localization is vital for mobile ad hoc networks (MANETs). Current methods like Time of Arrival (TOA) can estimate node positions using imprecise baseplates and achieve the Cramér-Rao lower bound (CRLB) accuracy. In multi-hop MANETs, some nodes lack direct links to base anchors, depending on neighbor nodes as dynamic anchors for chain localization. However, the dynamic nature of MANETs challenges TOA's robustness due to the availability and accuracy of base anchors, coupled with ranging errors. To address the issue of cascading positioning error divergence, we first derive the CRLB for any primary node in MANETs as a metric to tackle localization error in cascading scenarios. Second, we propose an advanced two-step TOA method based on CRLB which is able to approximate target node's CRLB with only local neighbor information. Finally, simulation results confirm the robustness of our algorithm, achieving CRLB-level accuracy for small ranging errors and maintaining precision for larger errors compared to existing TOA methods. △ Less

Submitted 29 December, 2023; originally announced December 2023.

arXiv:2312.16971 [pdf, other]

High Throughput Inter-Layer Connecting Strategy for Multi-Layer Ultra-Dense Satellite Networks

Authors: Qi Hao, Di Zhou, Min Sheng, Yan Shi, Jiandong Li

Abstract: Multi-layer ultra-dense satellite networks (MLUDSNs) have soared this meteoric to provide vast throughputd for globally diverse services. Differing from traditional monolayer constellations, MLUDSNs emphasize the spatial integration among layers, and its throughput may not be simply the sum of throughput of each layer. The hop-count of cross-layer communication paths can be reduced by deploying in… ▽ More Multi-layer ultra-dense satellite networks (MLUDSNs) have soared this meteoric to provide vast throughputd for globally diverse services. Differing from traditional monolayer constellations, MLUDSNs emphasize the spatial integration among layers, and its throughput may not be simply the sum of throughput of each layer. The hop-count of cross-layer communication paths can be reduced by deploying inter-layer connections (ILCs), augmenting MLUDSN's throughput. Therefore, it remains an open issue how to deploy ILCs to optimize the dynamic MLUDSN topology to dramatically raise throughput gains under multi-layer collaboration. This paper designs an ILC deployment strategy to enhance throughput by revealing the impacts of ILC distribution on reducing hop-count. Since deploying ILCs burdens the satellite with extra communication resource consumption, we model the ILC deployment problem as minimizing the average hop with limited ILCs, to maximize throughput. The proposed problem is a typical integer linear programming (ILP) problem, of which computational complexity is exponential as the satellite scale expands and the time evolves. Based on the symmetrical topology of each layer, we propose a two-phase deployment scheme to halve the problem scale and prioritize stable ILCs to reduce handover-count, which decreases the exponential complexity to a polynomial one, with 1% estimation error: Simulation results based on realistic megaconstellation information confirm that the optimal number of ILCs is less than P.S/2, where P and S are orbits and satellites per orbit. Besides, these ILCs deploy uniformly in each layer, which raises over 1.55x throughput than isolated layers. △ Less

Submitted 28 December, 2023; originally announced December 2023.

arXiv:2312.09621 [pdf, other]

Inter-domain Resource Collaboration in Satellite Networks: An Intelligent Scheduling Approach Towards Hybrid Missions

Authors: Chenxi Bao, Di Zhou, Min Sheng, Yan Shi, Jiandong Li

Abstract: Since the next-generation satellite network consisting of various service function domains, such as communication, observation, navigation, etc., is moving towards large-scale, using single-domain resources is difficult to provide satisfied and timely service guarantees for the rapidly increasing mission demands of each domain. Breaking the barriers of independence of resources in each domain, and… ▽ More Since the next-generation satellite network consisting of various service function domains, such as communication, observation, navigation, etc., is moving towards large-scale, using single-domain resources is difficult to provide satisfied and timely service guarantees for the rapidly increasing mission demands of each domain. Breaking the barriers of independence of resources in each domain, and realizing the cross-domain transmission of missions to efficiently collaborate inter-domain resources is a promising solution. However, the hybrid scheduling of different missions and the continuous increase in the number of service domains have strengthened the differences and dynamics of mission demands, making it challenging for an efficient cross-domain mission scheduling (CMS). To this end, this paper first accurately characterizes the communication resource state of inter-satellite in real-time exploiting the sparse resource representation scheme, and systematically characterizes the differentiation of mission demands by conducting the mission priority model. Based on the information of resources and missions, we construct the top- and bottom-layer mission scheduling models of reward association exploiting the correlation of intra- and inter-domain mission scheduling and formulate the Markov decision process-based hierarchical CMS problem. Further, to achieve higher adaptability and autonomy of CMS and efficiently mitigate the impact of network scale, a hierarchical intelligent CMS algorithm is developed to dynamically adjust and efficiently match the CMS policy according to different mission demands. Simulation results demonstrate that the proposed algorithm has significant performance gain compared with independent domains and the existing CMS algorithms, and can still guarantee high service performance under different network scales. △ Less

Submitted 15 December, 2023; originally announced December 2023.

arXiv:2312.09505 [pdf, other]

Adaptive Integration of Partial Label Learning and Negative Learning for Enhanced Noisy Label Learning

Authors: Mengmeng Sheng, Zeren Sun, Zhenhuang Cai, Tao Chen, Yichao Zhou, Yazhou Yao

Abstract: There has been significant attention devoted to the effectiveness of various domains, such as semi-supervised learning, contrastive learning, and meta-learning, in enhancing the performance of methods for noisy label learning (NLL) tasks. However, most existing methods still depend on prior assumptions regarding clean samples amidst different sources of noise (\eg, a pre-defined drop rate or a sma… ▽ More There has been significant attention devoted to the effectiveness of various domains, such as semi-supervised learning, contrastive learning, and meta-learning, in enhancing the performance of methods for noisy label learning (NLL) tasks. However, most existing methods still depend on prior assumptions regarding clean samples amidst different sources of noise (\eg, a pre-defined drop rate or a small subset of clean samples). In this paper, we propose a simple yet powerful idea called \textbf{NPN}, which revolutionizes \textbf{N}oisy label learning by integrating \textbf{P}artial label learning (PLL) and \textbf{N}egative learning (NL). Toward this goal, we initially decompose the given label space adaptively into the candidate and complementary labels, thereby establishing the conditions for PLL and NL. We propose two adaptive data-driven paradigms of label disambiguation for PLL: hard disambiguation and soft disambiguation. Furthermore, we generate reliable complementary labels using all non-candidate labels for NL to enhance model robustness through indirect supervision. To maintain label reliability during the later stage of model training, we introduce a consistency regularization term that encourages agreement between the outputs of multiple augmentations. Experiments conducted on both synthetically corrupted and real-world noisy datasets demonstrate the superiority of NPN compared to other state-of-the-art (SOTA) methods. The source code has been made available at {\color{purple}{\url{https://github.com/NUST-Machine-Intelligence-Laboratory/NPN}}}. △ Less

Submitted 14 December, 2023; originally announced December 2023.

Comments: accepted by AAAI 2024

arXiv:2312.07936 [pdf, other]

Coordinated Intra- and Inter-system Interference Management in Integrated Satellite Terrestrial Networks

Authors: Ziyue Zhang, Min Sheng, Junyu Liu, Jiandong Li

Abstract: Leveraging the advantage of satellite and terrestrial networks, the integrated satellite terrestrial networks (ISTNs) can help to achieve seamless global access and eliminate the digital divide. However, the dense deployment and frequent handover of satellites aggravate intra- and inter-system interference, resulting in a decrease in downlink sum rate. To address this issue, we propose a coordinat… ▽ More Leveraging the advantage of satellite and terrestrial networks, the integrated satellite terrestrial networks (ISTNs) can help to achieve seamless global access and eliminate the digital divide. However, the dense deployment and frequent handover of satellites aggravate intra- and inter-system interference, resulting in a decrease in downlink sum rate. To address this issue, we propose a coordinated intra- and inter-system interference management algorithm for ISTN. This algorithm coordinates multidimensional interference through a joint design of inter-satellite handover and resource allocation method. On the one hand, we take inter-system interference between low earth orbit (LEO) and geostationary orbit (GEO) satellites as a constraint, and reduce interference to GEO satellite ground stations (GEO-GS) while ensuring system capacity through inter-satellite handover. On the other hand, satellite and terrestrial resource allocation schemes are designed based on the matching idea, and channel gain and interference to other channels are considered during the matching process to coordinate co-channel interference. In order to avoid too many unnecessary handovers, we consider handover scenarios related to service capabilities and service time to determine the optimal handover target satellite. Numerical results show that the gap between the results on the system sum rate obtained by the proposed method and the upper bound is reduced as the user density increases, and the handover frequency can be significantly reduced. △ Less

Submitted 13 December, 2023; originally announced December 2023.

arXiv:2311.14943 [pdf, ps, other]

doi 10.1017/hpl.2024.7

Generation of polarized electron beams through self-injection in the interaction of a laser with a pre-polarized plasma

Authors: L. R. Yin, X. F. Li, Y. J. Gu, N. Cao, Q. Kong, M. Buescher, S. M. Weng, M. Chen, Z. M. Sheng

Abstract: Polarized electron beam production via laser wakefield acceleration in pre-polarized plasma is investigated by particle-in-cell simulations. The evolution of the electron beam polarization is studied based on the Thomas-Bargmann-Michel-Telegdi equation for the transverse and longitudinal self-injection, and the depolarization process is found to be influenced by the injection schemes. In the case… ▽ More Polarized electron beam production via laser wakefield acceleration in pre-polarized plasma is investigated by particle-in-cell simulations. The evolution of the electron beam polarization is studied based on the Thomas-Bargmann-Michel-Telegdi equation for the transverse and longitudinal self-injection, and the depolarization process is found to be influenced by the injection schemes. In the case of transverse self-injection as found typically in the bubble regime, the spin precession of the accelerated electrons is mainly influenced by the wakefield. However, in the case of longitudinal injection in the quasi-one-dimensional regime (for example, F. Y. Li \emph{et al}., Phys. Rev. Lett. 110, 135002 (2013)), the direction of electron spin oscillates in the laser filed. Since the electrons move around the laser axis, the net influence of the laser field is nearly zero and the contribution of the wakefield can be ignored. Finally, an ultra-short electron beam with polarization of $99\%$ can be obtained using longitudinal self-injection. △ Less

Submitted 25 November, 2023; originally announced November 2023.

Comments: 7 pages, 4 figures

Journal ref: High Pow Laser Sci Eng 12 (2024) e28

arXiv:2311.07969 [pdf, other]

doi 10.1103/PhysRevD.109.123548

Spin speed correlations and the evolution of galaxy-halo systems

Authors: Ming-Jie Sheng, Lin Zhu, Hao-Ran Yu, Hongchuan Ma, Haikun Li, Peng Wang, Xi Kang

Abstract: Galaxy angular momenta (spins) contain valuable cosmological information, complementing their positions and velocities. The baryonic spin direction of galaxies has been probed as a reliable tracer of their host halos and the primordial spin modes. Here we use the TNG100 simulation of the IllustrisTNG project to study the spin magnitude correlations between dark matter, gas, and stellar components… ▽ More Galaxy angular momenta (spins) contain valuable cosmological information, complementing their positions and velocities. The baryonic spin direction of galaxies has been probed as a reliable tracer of their host halos and the primordial spin modes. Here we use the TNG100 simulation of the IllustrisTNG project to study the spin magnitude correlations between dark matter, gas, and stellar components of galaxy-halo systems and their evolutions across cosmic history. We find that these components generate similar initial spin magnitudes from the same tidal torque in Lagrangian space. At low redshifts, the gas component still traces the spin magnitude of the dark matter halo and the primordial spin magnitude. However, the traceability of the stellar component depends on the $ex$ $situ$ stellar mass fraction, $f_{\rm acc}$. Our results suggest that the galaxy baryonic spin magnitude can also serve as a tracer of their host halo and the initial perturbations, and the galaxy-halo correlations are affected by the similarity of their evolution histories. △ Less

Submitted 2 July, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

Comments: 10 pages, 10 figures. Matches the accepted version in Physical Review D

Journal ref: Physical Review D, 109, 123548, 2024

arXiv:2308.11797 [pdf, other]

CLIP Multi-modal Hashing: A new baseline CLIPMH

Authors: Jian Zhu, Mingkai Sheng, Mingda Ke, Zhangmin Huang, Jingfei Chang

Abstract: The multi-modal hashing method is widely used in multimedia retrieval. It can fuse multi-source data to generate binary hash code. However, the current multi-modal methods have the problem of low retrieval accuracy. The reason is that the individual backbone networks have limited feature expression capabilities and are not jointly pre-trained on large-scale unsupervised multi-modal data. To solve… ▽ More The multi-modal hashing method is widely used in multimedia retrieval. It can fuse multi-source data to generate binary hash code. However, the current multi-modal methods have the problem of low retrieval accuracy. The reason is that the individual backbone networks have limited feature expression capabilities and are not jointly pre-trained on large-scale unsupervised multi-modal data. To solve this problem, we propose a new baseline CLIP Multi-modal Hashing (CLIPMH) method. It uses CLIP model to extract text and image features, and then fuse to generate hash code. CLIP improves the expressiveness of each modal feature. In this way, it can greatly improve the retrieval performance of multi-modal hashing methods. In comparison to state-of-the-art unsupervised and supervised multi-modal hashing methods, experiments reveal that the proposed CLIPMH can significantly enhance performance (Maximum increase of 8.38%). CLIP also has great advantages over the text and visual backbone networks commonly used before. △ Less

Submitted 22 August, 2023; originally announced August 2023.

Comments: submit to ICASSP2024

arXiv:2307.11168 [pdf, other]

doi 10.1016/j.enggeo.2019.105306

On the hydraulic fracturing in naturally-layered porous media using the phase field method

Authors: Xiaoying Zhuang, Shuwei Zhou, Mao Sheng, Gensheng Li

Abstract: In the hydraulic fracturing of natural rocks, understanding and predicting crack penetrations into the neighboring layers is crucial and relevant in terms of cost-efficiency in engineering and environmental protection. This study constitutes a phase field framework to examine hydraulic fracture propagation in naturally-layered porous media. Biot's poroelasticity theory is used to couple the displa… ▽ More In the hydraulic fracturing of natural rocks, understanding and predicting crack penetrations into the neighboring layers is crucial and relevant in terms of cost-efficiency in engineering and environmental protection. This study constitutes a phase field framework to examine hydraulic fracture propagation in naturally-layered porous media. Biot's poroelasticity theory is used to couple the displacement and flow field, while a phase field method helps characterize fracture growth behavior. Additional fracture criteria are not required and fracture propagation is governed by the equation of phase field evolution. Thus, penetration criteria are not required when hydraulic fractures reach the material interfaces. The phase field method is implemented within a staggered scheme that sequentially solves the displacement, phase field, and fluid pressure. We consider the soft-to-stiff and the stiff-to-soft configurations, where the layer interface exhibits different inclination angles $θ$. Penetration, singly-deflected, and doubly-deflected fracture scenarios can be predicted by our simulations. In the soft-to-stiff configuration, $θ=0^\circ$ exhibits penetration or symmetrical doubly-deflected scenarios, and $θ=15^\circ$ exhibits singly-deflected or asymmetric doubly-deflected scenarios. Only the singly-deflected scenario is obtained for $θ=30^\circ$. In the stiff-to-soft configuration, only the penetration scenario is obtained with widening fractures when hydraulic fractures penetrate into the soft layer. △ Less

Submitted 11 July, 2023; originally announced July 2023.

Journal ref: Engineering Geology, 2020, 266: 105306

arXiv:2304.08357 [pdf, other]

A high-efficiency proton-boron fusion scheme taking into account the effects of quantum degeneracy

Authors: S. J. Liu, D. Wu, T. X. Hu, T. Y. Liang, X. C. Ning, J. H. Liang, Y. C. Liu, P. Liu, X. Liu, Z. M. Sheng, Y. T. Zhao, D. H. H. Hoffmann, X. T. He, J. Zhang

Abstract: The proton-boron (p-$^{11}$B) reaction is regarded as the holy grail of advanced fusion fuels, since the primary reaction produces three $α$ particles with few neutrons and induced radio-activities from second order reactions. Compared to the Deuterium-Tritium reaction a much higher reaction temperature is required. Moreover, bremsstrahlung energy losses due to the high nuclear charge of boron dee… ▽ More The proton-boron (p-$^{11}$B) reaction is regarded as the holy grail of advanced fusion fuels, since the primary reaction produces three $α$ particles with few neutrons and induced radio-activities from second order reactions. Compared to the Deuterium-Tritium reaction a much higher reaction temperature is required. Moreover, bremsstrahlung energy losses due to the high nuclear charge of boron deem it seemingly apparent than a fusion reactor based on Deuterium-Tritium plasma in equilibrium is to say the least very difficult.It is becoming more appealing to collide intense laser beams or accelerated proton beams with a boron target to produce p-$^{11}$B reactions. The fusion yield of p-$^{11}$B reactions is closely related to proton beam parameters and boron target conditions such as density, temperature, and ingredients. Quantum degeneracy will increase fusion yields by reducing the stopping power of injected protons. In this work, we suggest a high-efficiency scheme for beam-target p-$^{11}$B fusions via injecting a MeV proton beam into a highly compressed quantum degenerated boron target. Such a boron target can be achieved via quasi-isentropic compression of solid boron by using precisely shaped laser pulses. Our results indicate that for densities ranging from $10^3$ to $10^4ρ_s$, where $ρ_s$ is the density of solid boron, contributions of bound and free electrons to the stopping of protons can be completely disregarded and dramatically reduced respectively. The result is an increase in fusion yield by orders of magnitude. Furthermore, in order to achieve multiplication factor $F$ greater than one, with $F$ defined as the ratio of output fusion energy to the energy of injected protons, it is found there exits a minimum possible density of boron target, which is $2.15 \times 10^4 ρ_s$ when the kinetic energy of injected protons is $0.8$ MeV. △ Less

Submitted 17 April, 2023; originally announced April 2023.

arXiv:2303.10584 [pdf, other]

doi 10.1103/PhysRevB.107.125126

Magnetic phase diagrams and large magnetocaloric effects of the two-dimensional antiferromagnetic triangular lattice of Gd$^{3+}$ ions in KBaGd(BO$_3$)$_2$

Authors: Z. M. Song, N. Zhao, H. Ge, T. T. Li, J. Yang, L. Wang, Y. Fu, Y. Z. Zhang, S. M. Wang, J. W. Mei, H. He, S. Guo, L. S. Wu, J. M. Sheng

Abstract: We report a detailed study of the magnetic properties of KBaGd(BO$_3$)$_2$, in which magnetic Gd$^{3+}$ ($S=7/2$) ions form into two-dimensional triangular layers. Magnetization, specific heat and magnetocaloric effect (MCE) measurements have been performed on KBaGd(BO$_3$)$_2$ single crystals. The results show that a long-range antiferromagnetic state is established below $T_{\rm N}=0.24$ K. In z… ▽ More We report a detailed study of the magnetic properties of KBaGd(BO$_3$)$_2$, in which magnetic Gd$^{3+}$ ($S=7/2$) ions form into two-dimensional triangular layers. Magnetization, specific heat and magnetocaloric effect (MCE) measurements have been performed on KBaGd(BO$_3$)$_2$ single crystals. The results show that a long-range antiferromagnetic state is established below $T_{\rm N}=0.24$ K. In zero fields, only about half of the full entropy is released at $T_{\rm N}$, indicating that not all the magnetic moments are frozen below the ordering temperature, as expected from the geometrical frustration of the triangular spin lattice. Further studies under external fields were performed down to 50 mK, and the magnetic phase diagrams are established with magnetic fields applied both within and perpendicular to the triangular plane. KBaGd(BO$_3$)$_2$ serves as an example of a two-dimensional triangular lattice with large spin values ($S=7/2$) and can be directly compared with the iso-structure KBaR(BO$_3$)$_2$ (R = Dy-Yb) family of doublet ground states, which exhibit effective spins of $S=1/2$. △ Less

Submitted 19 March, 2023; originally announced March 2023.

Comments: 8 pages, 5 figures

Journal ref: Physical Review B 107, 125126 (2023)

arXiv:2303.10405 [pdf, other]

doi 10.1103/PhysRevB.105.014441

Antiferromagnetism and Ising Ground States in the Rare-earth Garnet Nd$_3$Ga$_5$O$_{12}$

Authors: N. Zhao, H. Ge, L. Zhou, Z. M. Song, J. Yang, T. T. Li, L. Wang, Y. Fu, Y. F. Zhang, J. B. Xu, S. M. Wang, J. W. Mei, X. Tong, L. S. Wu, J. M. Sheng

Abstract: In this paper, we investigate the low temperature magnetic properties of the rare-earth garnet compound Nd$_3$Ga$_5$O$_{12}$ in detail by means of magnetization, specific heat and magnetocaloric effect measurements. The magnetic thermal properties along with the crystal field calculations reveal that the Nd$^{3+}$ ions form into a frustrated hyper-kagome lattice with connected triangles have an Is… ▽ More In this paper, we investigate the low temperature magnetic properties of the rare-earth garnet compound Nd$_3$Ga$_5$O$_{12}$ in detail by means of magnetization, specific heat and magnetocaloric effect measurements. The magnetic thermal properties along with the crystal field calculations reveal that the Nd$^{3+}$ ions form into a frustrated hyper-kagome lattice with connected triangles have an Ising-like ground state with the easy axis along the local [100], [010] and [001] directions. Instead of a quantum spin liquid ground state, an antiferromagnetically ordered state is found below $T_{\mathrm{N}}=0.52~\rm K$. With applying field in the [111] direction, the antiferromagnetic order is suppressed at the critical field of $B_{\mathrm{c}}=0.75~\rm T$, and enhancement of the critical fluctuations with linear crossover behaviors is observed near the critical point. △ Less

Submitted 18 March, 2023; originally announced March 2023.

Comments: 7 pages, 6 figures

Journal ref: Physical Review B 105, 014441 (2022)

arXiv:2303.08673 [pdf, other]

doi 10.1103/PhysRevMaterials.6.085001

Successive magnetic orderings in the Ising spin chain magnet DyNi$_5$Ge$_3$

Authors: H. Ge, L. Zhang, N. Zhao, J. Yang, L. Wang, L. Zhou, Y. Fu, T. T. Li, Z. M. Song, F. Ding, J. B. Xu, Y. F. Zhang, S. M. Wang, J. W. Mei, X. Tong, P. Miao, H. He, Q. Zhanghang, L. S. Wu, J. M. Sheng

Abstract: In this report, we investigated a new rare earth based one-dimensional Ising spin chain magnet~\DNG~by means of magnetization, specific heat and powder neutron diffraction measurements. Due to the crystalline electrical field splitting, the magnetic Dy ions share an Ising like ground doublet state. Owning to the local point symmetry, these Ising moments form into two canted magnetic sublattices, w… ▽ More In this report, we investigated a new rare earth based one-dimensional Ising spin chain magnet~\DNG~by means of magnetization, specific heat and powder neutron diffraction measurements. Due to the crystalline electrical field splitting, the magnetic Dy ions share an Ising like ground doublet state. Owning to the local point symmetry, these Ising moments form into two canted magnetic sublattices, which were further confirmed by the angle-dependent magnetization measurement. In zero fields, two successive antiferromagnetic phase transitions were found at temperatures $T_{\mathrm{N1}}=6~\rm K$ and $T_{\mathrm{N2}}=5~\rm K$, respectively. Only part of the moments are statically ordered in this intermediate state between $T_{\mathrm{N1}}$ and $T_{\mathrm{N2}}$. Powder neutron diffraction experiments at different temperatures were performed as well. An incommensurate magnetic propagation vector of $\mathbf{k_{\rm m}}=(0.5,0.4,0.5)$ was identified. The refined spin configurations through the irreducible representation analysis confirmed that these Ising spins are canted in the crystal $ab$~plane. △ Less

Submitted 15 March, 2023; originally announced March 2023.

Comments: 9 pages, 7 figures

Journal ref: Physical Review MATERIALS 6, 085001 (2022)

arXiv:2302.09231 [pdf, ps, other]

An explicit infinite homotopy in nonabelian Hodge theory in positive characteristic

Authors: Mao Sheng, Zebao Zhang

Abstract: This short note is extracted from Section 3 and Appendix of the paper entitled with Intersection de Rham complexes in positive characteristic by the same named authors, where an explicit infinite homotopy from a Higgs complex to the Frobenius pushforward of the corresponding de Rham complex in positive characteristic has been provided. The verification details, which are omitted therein, are provi… ▽ More This short note is extracted from Section 3 and Appendix of the paper entitled with Intersection de Rham complexes in positive characteristic by the same named authors, where an explicit infinite homotopy from a Higgs complex to the Frobenius pushforward of the corresponding de Rham complex in positive characteristic has been provided. The verification details, which are omitted therein, are provided here. △ Less

Submitted 17 February, 2023; originally announced February 2023.

Comments: 13 pages. Comments are very welcome! arXiv admin note: text overlap with arXiv:1904.06651

MSC Class: 14C30; 14D07; 14F43; 14G17

arXiv:2212.02038 [pdf, ps, other]

A torsion property of the zero of Kodaira-Spencer over $\mathbb{P}^1$ removing four points

Authors: Xiaojin Lin, Mao Sheng, Jianping Wang

Abstract: We establish a torsion theorem to the effect that the unique zero of the Kodaira-Spencer map attached to a certain quasi-semistable family of complex projective varieties over the complex projective line is the image of a torsion point of an elliptic curve under the natural projection. The proof is a mod $p$ argument and requires a density one set of primes. There are three essential ingredients i… ▽ More We establish a torsion theorem to the effect that the unique zero of the Kodaira-Spencer map attached to a certain quasi-semistable family of complex projective varieties over the complex projective line is the image of a torsion point of an elliptic curve under the natural projection. The proof is a mod $p$ argument and requires a density one set of primes. There are three essential ingredients in the proof: a solution to the conjecture of Sun-Yang-Zuo, which constitutes the principal part of the paper, Pink's theorem, and Higgs periodicity theorem. △ Less

Submitted 10 May, 2023; v1 submitted 5 December, 2022; originally announced December 2022.

Comments: 3rd version. Acknowledgement to the initial discovery of the torsion phenomenon due to Jingbang Yang and Kang Zuo as proved in the paper added

MSC Class: 14D05; 14D07

arXiv:2210.04203 [pdf, other]

doi 10.3847/1538-4357/acae92

Baryonic Effects on Lagrangian Clustering and Angular Momentum Reconstruction

Authors: Ming-Jie Sheng, Hao-Ran Yu, Sijia Li, Shihong Liao, Min Du, Yunchong Wang, Peng Wang, Kun Xu, Shy Genel, Dimitrios Irodotou

Abstract: Recent studies illustrate the correlation between the angular momenta of cosmic structures and their Lagrangian properties. However, only baryons are observable and it is unclear whether they reliably trace the cosmic angular momenta. We study the Lagrangian mass distribution, spin correlation, and predictability of dark matter, gas, and stellar components of galaxy-halo systems using IllustrisTNG… ▽ More Recent studies illustrate the correlation between the angular momenta of cosmic structures and their Lagrangian properties. However, only baryons are observable and it is unclear whether they reliably trace the cosmic angular momenta. We study the Lagrangian mass distribution, spin correlation, and predictability of dark matter, gas, and stellar components of galaxy-halo systems using IllustrisTNG, and show that the primordial segregations between components are typically small. Their protoshapes are also similar in terms of the statistics of moment of inertia tensors. Under the common gravitational potential they are expected to exert the same tidal torque and the strong spin correlations are not destroyed by the nonlinear evolution and complicated baryonic effects, as confirmed by the high-resolution hydrodynamic simulations. We further show that their late-time angular momenta traced by total gas, stars, or the central galaxies, can be reliably reconstructed by the initial perturbations. These results suggest that baryonic angular momenta can potentially be used in reconstructing the parameters and models related to the initial perturbations. △ Less

Submitted 4 February, 2023; v1 submitted 9 October, 2022; originally announced October 2022.

Comments: 8 pages, 5 figures, 1 table. Matches the accepted version in ApJ

Journal ref: The Astrophysical Journal, Volume 943, Number 2, 2023

arXiv:2209.13645 [pdf, other]

PearNet: A Pearson Correlation-based Graph Attention Network for Sleep Stage Recognition

Authors: Jianchao Lu, Yuzhe Tian, Shuang Wang, Michael Sheng, Xi Zheng

Abstract: Sleep stage recognition is crucial for assessing sleep and diagnosing chronic diseases. Deep learning models, such as Convolutional Neural Networks and Recurrent Neural Networks, are trained using grid data as input, making them not capable of learning relationships in non-Euclidean spaces. Graph-based deep models have been developed to address this issue when investigating the external relationsh… ▽ More Sleep stage recognition is crucial for assessing sleep and diagnosing chronic diseases. Deep learning models, such as Convolutional Neural Networks and Recurrent Neural Networks, are trained using grid data as input, making them not capable of learning relationships in non-Euclidean spaces. Graph-based deep models have been developed to address this issue when investigating the external relationship of electrode signals across different brain regions. However, the models cannot solve problems related to the internal relationships between segments of electrode signals within a specific brain region. In this study, we propose a Pearson correlation-based graph attention network, called PearNet, as a solution to this problem. Graph nodes are generated based on the spatial-temporal features extracted by a hierarchical feature extraction method, and then the graph structure is learned adaptively to build node connections. Based on our experiments on the Sleep-EDF-20 and Sleep-EDF-78 datasets, PearNet performs better than the state-of-the-art baselines. △ Less

Submitted 16 October, 2022; v1 submitted 26 September, 2022; originally announced September 2022.

Showing 1–50 of 141 results for author: Sheng, M