-
Accurate and Efficient Fetal Birth Weight Estimation from 3D Ultrasound
Authors:
Jian Wang,
Qiongying Ni,
Hongkui Yu,
Ruixuan Yao,
Jinqiao Ying,
Bin Zhang,
Xingyi Yang,
Jin Peng,
Jiongquan Chen,
Junxuan Yu,
Wenlong Shi,
Chaoyu Chen,
Zhongnuo Yan,
Mingyuan Luo,
Gaocheng Cai,
Dong Ni,
Jing Lu,
Xin Yang
Abstract:
Accurate fetal birth weight (FBW) estimation is essential for optimizing delivery decisions and reducing perinatal mortality. However, clinical methods for FBW estimation are inefficient, operator-dependent, and challenging to apply in cases of complex fetal anatomy. Existing deep learning methods are based on 2D standard ultrasound (US) images or videos that lack spatial information, limiting the…
▽ More
Accurate fetal birth weight (FBW) estimation is essential for optimizing delivery decisions and reducing perinatal mortality. However, clinical methods for FBW estimation are inefficient, operator-dependent, and challenging to apply in cases of complex fetal anatomy. Existing deep learning methods are based on 2D standard ultrasound (US) images or videos that lack spatial information, limiting their prediction accuracy. In this study, we propose the first method for directly estimating FBW from 3D fetal US volumes. Our approach integrates a multi-scale feature fusion network (MFFN) and a synthetic sample-based learning framework (SSLF). The MFFN effectively extracts and fuses multi-scale features under sparse supervision by incorporating channel attention, spatial attention, and a ranking-based loss function. SSLF generates synthetic samples by simply combining fetal head and abdomen data from different fetuses, utilizing semi-supervised learning to improve prediction performance. Experimental results demonstrate that our method achieves superior performance, with a mean absolute error of $166.4\pm155.9$ $g$ and a mean absolute percentage error of $5.1\pm4.6$%, outperforming existing methods and approaching the accuracy of a senior doctor. Code is available at: https://github.com/Qioy-i/EFW.
△ Less
Submitted 30 June, 2025;
originally announced July 2025.
-
UltraTwin: Towards Cardiac Anatomical Twin Generation from Multi-view 2D Ultrasound
Authors:
Junxuan Yu,
Yaofei Duan,
Yuhao Huang,
Yu Wang,
Rongbo Ling,
Weihao Luo,
Ang Zhang,
Jingxian Xu,
Qiongying Ni,
Yongsong Zhou,
Binghan Li,
Haoran Dou,
Liping Liu,
Yanfen Chu,
Feng Geng,
Zhe Sheng,
Zhifeng Ding,
Dingxin Zhang,
Rui Huang,
Yuhang Zhang,
Xiaowei Xu,
Tao Tan,
Dong Ni,
Zhongshan Gou,
Xin Yang
Abstract:
Echocardiography is routine for cardiac examination. However, 2D ultrasound (US) struggles with accurate metric calculation and direct observation of 3D cardiac structures. Moreover, 3D US is limited by low resolution, small field of view and scarce availability in practice. Constructing the cardiac anatomical twin from 2D images is promising to provide precise treatment planning and clinical quan…
▽ More
Echocardiography is routine for cardiac examination. However, 2D ultrasound (US) struggles with accurate metric calculation and direct observation of 3D cardiac structures. Moreover, 3D US is limited by low resolution, small field of view and scarce availability in practice. Constructing the cardiac anatomical twin from 2D images is promising to provide precise treatment planning and clinical quantification. However, it remains challenging due to the rare paired data, complex structures, and US noises. In this study, we introduce a novel generative framework UltraTwin, to obtain cardiac anatomical twin from sparse multi-view 2D US. Our contribution is three-fold. First, pioneered the construction of a real-world and high-quality dataset containing strictly paired multi-view 2D US and CT, and pseudo-paired data. Second, we propose a coarse-to-fine scheme to achieve hierarchical reconstruction optimization. Last, we introduce an implicit autoencoder for topology-aware constraints. Extensive experiments show that UltraTwin reconstructs high-quality anatomical twins versus strong competitors. We believe it advances anatomical twin modeling for potential applications in personalized cardiac care.
△ Less
Submitted 29 June, 2025;
originally announced June 2025.
-
Semantic Communication Meets Heterogeneous Network: Emerging Trends, Opportunities, and Challenges
Authors:
Guhan Zheng,
Qiang Ni,
Aryan Kaushik,
Lixia Yang
Abstract:
Recent developments in machine learning (ML) techniques enable users to extract, transmit, and reproduce information semantics via ML-based semantic communication (SemCom). This significantly increases network spectral efficiency and transmission robustness. In the network, the semantic encoders and decoders among various users, based on ML, however, require collaborative updating according to new…
▽ More
Recent developments in machine learning (ML) techniques enable users to extract, transmit, and reproduce information semantics via ML-based semantic communication (SemCom). This significantly increases network spectral efficiency and transmission robustness. In the network, the semantic encoders and decoders among various users, based on ML, however, require collaborative updating according to new transmission tasks. The various heterogeneous characteristics of most networks in turn introduce emerging but unique challenges for semantic codec updating that are different from other general ML model updating. In this article, we first overview the key components of the SemCom system. We then discuss the unique challenges associated with semantic codec updates in heterogeneous networks. Accordingly, we point out a potential framework and discuss the pros and cons thereof. Finally, several future research directions are also discussed.
△ Less
Submitted 13 February, 2025;
originally announced February 2025.
-
STAR-RIS Enabled ISAC Systems: Joint Rate Splitting and Beamforming Optimization
Authors:
Yuan Liu,
Ruichen Zhang,
Ruihong Jiang,
Yongdong Zhu,
Huimin Hu,
Qiang Ni,
Zesong Fei,
Dusit Niyato
Abstract:
This paper delves into an integrated sensing and communication (ISAC) system bolstered by a simultaneously transmitting and reflecting reconfigurable intelligent surface (STAR-RIS). Within this system, a base station (BS) is equipped with communication and radar capabilities, enabling it to communicate with ground terminals (GTs) and concurrently probe for echo signals from a target of interest. M…
▽ More
This paper delves into an integrated sensing and communication (ISAC) system bolstered by a simultaneously transmitting and reflecting reconfigurable intelligent surface (STAR-RIS). Within this system, a base station (BS) is equipped with communication and radar capabilities, enabling it to communicate with ground terminals (GTs) and concurrently probe for echo signals from a target of interest. Moreover, to manage interference and improve communication quality, the rate splitting multiple access (RSMA) scheme is incorporated into the system. The signal-to-interference-plus-noise ratio (SINR) of the received sensing echo signals is a measure of sensing performance. We formulate a joint optimization problem of common rates, transmit beamforming at the BS, and passive beamforming vectors of the STAR-RIS. The objective is to maximize sensing SINR while guaranteeing the communication rate requirements for each GT. We present an iterative algorithm to address the non-convex problem by invoking Dinkelbach's transform, semidefinite relaxation (SDR), majorization-minimization, and sequential rank-one constraint relaxation (SROCR) theories. Simulation results manifest that the performance of the studied ISAC network enhanced by the STAR-RIS and RSMA surpasses other benchmarks considerably. The results evidently indicate the superior performance improvement of the ISAC system with the proposed RSMA-based transmission strategy design and the dynamic optimization of both transmission and reflection beamforming at STAR-RIS.
△ Less
Submitted 13 November, 2024;
originally announced November 2024.
-
End-to-end multi-channel speaker extraction and binaural speech synthesis
Authors:
Cheng Chi,
Xiaoyu Li,
Yuxuan Ke,
Qunping Ni,
Yao Ge,
Xiaodong Li,
Chengshi Zheng
Abstract:
Speech clarity and spatial audio immersion are the two most critical factors in enhancing remote conferencing experiences. Existing methods are often limited: either due to the lack of spatial information when using only one microphone, or because their performance is highly dependent on the accuracy of direction-of-arrival estimation when using microphone array. To overcome this issue, we introdu…
▽ More
Speech clarity and spatial audio immersion are the two most critical factors in enhancing remote conferencing experiences. Existing methods are often limited: either due to the lack of spatial information when using only one microphone, or because their performance is highly dependent on the accuracy of direction-of-arrival estimation when using microphone array. To overcome this issue, we introduce an end-to-end deep learning framework that has the capacity of mapping multi-channel noisy and reverberant signals to clean and spatialized binaural speech directly. This framework unifies source extraction, noise suppression, and binaural rendering into one network. In this framework, a novel magnitude-weighted interaural level difference loss function is proposed that aims to improve the accuracy of spatial rendering. Extensive evaluations show that our method outperforms established baselines in terms of both speech quality and spatial fidelity.
△ Less
Submitted 11 July, 2025; v1 submitted 8 October, 2024;
originally announced October 2024.
-
Communication and Control Co-Design in 6G: Sequential Decision-Making with LLMs
Authors:
Xianfu Chen,
Celimuge Wu,
Yi Shen,
Yusheng Ji,
Tsutomu Yoshinaga,
Qiang Ni,
Charilaos C. Zarakovitis,
Honggang Zhang
Abstract:
This article investigates a control system within the context of six-generation wireless networks. The control performance optimization confronts the technical challenges that arise from the intricate interactions between communication and control sub-systems, asking for a co-design. Accounting for the system dynamics, we formulate the sequential co-design decision-makings of communication and con…
▽ More
This article investigates a control system within the context of six-generation wireless networks. The control performance optimization confronts the technical challenges that arise from the intricate interactions between communication and control sub-systems, asking for a co-design. Accounting for the system dynamics, we formulate the sequential co-design decision-makings of communication and control over the discrete time horizon as a Markov decision process, for which a practical offline learning framework is proposed. Our proposed framework integrates large language models into the elements of reinforcement learning. We present a case study on the age of semantics-aware communication and control co-design to showcase the potentials from our proposed learning framework. Furthermore, we discuss the open issues remaining to make our proposed offline learning framework feasible for real-world implementations, and highlight the research directions for future explorations.
△ Less
Submitted 9 September, 2024; v1 submitted 6 July, 2024;
originally announced July 2024.
-
Design and Performance Analysis of Multi-scale NOMA for 5G Positioning
Authors:
Lu Yin,
Jiameng Cao,
Zhongliang Deng,
Qiang Ni,
Song Li,
Xinyu Zheng,
Hanhua Wang
Abstract:
This paper presents a feasibility study for a novel positioning-communication integrated signal called Multi-Scale Non-Orthogonal Multiple Access (MS-NOMA) for 5G positioning. One of the main differences between the MS-NOMA and the traditional positioning signal is MS-NOMA supports configurable powers for different positioning users (P-Users) to obtain better ranging accuracy and signal coverage.…
▽ More
This paper presents a feasibility study for a novel positioning-communication integrated signal called Multi-Scale Non-Orthogonal Multiple Access (MS-NOMA) for 5G positioning. One of the main differences between the MS-NOMA and the traditional positioning signal is MS-NOMA supports configurable powers for different positioning users (P-Users) to obtain better ranging accuracy and signal coverage. Our major contributions are: Firstly, we present the MS-NOMA signal and analyze the Bit Error Rate (BER) and ranging accuracy by deriving their simple expressions. The results show the interaction between the communication and positioning signals is rather limited, and it is feasible to use the MS-NOMA signal to achieve high positioning accuracy. Secondly, for an optimal positioning accuracy and signal coverage, we model the power allocation problem for MS-NOMA signal as a convex optimization problem by satisfying the QoS (Quality of Services) requirement and other constraints. Then, we propose a novel Positioning-Communication Joint Power Allocation (PCJPA) algorithm which allocates the powers of all P-Users iteratively. The theoretical and numerical results show our proposed MS-NOMA signal has great improvements of ranging/positioning accuracy than traditional PRS (Positioning Reference Signal) in 5G, and improves the coverage dramatically which means more P-Users could locate their positions without suffering the near-far effect.
△ Less
Submitted 9 October, 2019;
originally announced October 2019.