Skip to main content

Showing 1–32 of 32 results for author: Ge, S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2503.06686  [pdf, other

    eess.IV cs.CV

    ImplicitCell: Resolution Cell Modeling of Joint Implicit Volume Reconstruction and Pose Refinement in Freehand 3D Ultrasound

    Authors: Sheng Song, Yiting Chen, Duo Xu, Songhan Ge, Yunqian Huang, Junni Shi, Man Chen, Hongbo Chen, Rui Zheng

    Abstract: Freehand 3D ultrasound enables volumetric imaging by tracking a conventional ultrasound probe during freehand scanning, offering enriched spatial information that improves clinical diagnosis. However, the quality of reconstructed volumes is often compromised by tracking system noise and irregular probe movements, leading to artifacts in the final reconstruction. To address these challenges, we pro… ▽ More

    Submitted 9 March, 2025; originally announced March 2025.

  2. Composite Nonlinear Trajectory Tracking Control of Co-Driving Vehicles Using Self-Triggered Adaptive Dynamic Programming

    Authors: Chuan Hu, Sicheng Ge, Yingkui Shi, Weinan Gao, Wenfeng Guo, Xi Zhang

    Abstract: This article presents a composite nonlinear feedback (CNF) control method using self-triggered (ST) adaptive dynamic programming (ADP) algorithm in a human-machine shared steering framework. For the overall system dynamics, a two-degrees-of-freedom (2-DOF) vehicle model is established and a two-point preview driver model is adopted. A dynamic authority allocation strategy based on cooperation leve… ▽ More

    Submitted 5 March, 2025; originally announced March 2025.

    Comments: Accepted by IEEE Transactions on Consumer Electronics (12 pages)

  3. arXiv:2502.04837  [pdf

    cs.RO eess.SY

    Online Robot Motion Planning Methodology Guided by Group Social Proxemics Feature

    Authors: Xuan Mu, Xiaorui Liu, Shuai Guo, Wenzheng Chi, Wei Wang, Shuzhi Sam Ge

    Abstract: Nowadays robot is supposed to demonstrate human-like perception, reasoning and behavior pattern in social or service application. However, most of the existing motion planning methods are incompatible with above requirement. A potential reason is that the existing navigation algorithms usually intend to treat people as another kind of obstacle, and hardly take the social principle or awareness int… ▽ More

    Submitted 7 February, 2025; originally announced February 2025.

    Comments: 14 pages,14 figures

  4. arXiv:2501.00064  [pdf, other

    cs.SD cs.LG eess.AS

    Lungmix: A Mixup-Based Strategy for Generalization in Respiratory Sound Classification

    Authors: Shijia Ge, Weixiang Zhang, Shuzhao Xie, Baixu Yan, Zhi Wang

    Abstract: Respiratory sound classification plays a pivotal role in diagnosing respiratory diseases. While deep learning models have shown success with various respiratory sound datasets, our experiments indicate that models trained on one dataset often fail to generalize effectively to others, mainly due to data collection and annotation \emph{inconsistencies}. To address this limitation, we introduce \emph… ▽ More

    Submitted 29 December, 2024; originally announced January 2025.

    Comments: 4pages, 3 figures, conference paper

  5. arXiv:2405.07478  [pdf, other

    eess.SY

    Coded Event-triggered Control for Nonlinear Systems

    Authors: Ruihang Ji, Shuzhi Sam Ge, Kai Zhao

    Abstract: This paper studies a Coded Event-triggered Control (CEC) for a class of nonlinear systems under any initial condition. To reduce communication burden, the CEC is designed from the encoding-decoding viewpoint by which only $m$-length string is transmitted for each communication between CEC and actuator. If a more general Entry Capture Problem is encountered, such control design will be rather compl… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  6. arXiv:2312.12066  [pdf, other

    eess.IV

    Automatic bony structure segmentation and curvature estimation on ultrasound cervical spine images -- a feasibility study

    Authors: Songhan Ge, Haoyuan Tian, Wei Zhang, Rui Zheng

    Abstract: The loss of cervical lordosis is a common degenerative disorder known to be associated with abnormal spinal alignment. In recent years, ultrasound (US) imaging has been widely applied in the assessment of spine deformity and has shown promising results. The objectives of this study are to automatically segment bony structures from the 3D US cervical spine image volume and to assess the cervical lo… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  7. arXiv:2308.05005  [pdf, other

    eess.SP cs.CV

    Deep Learning Model Transfer in Forest Mapping using Multi-source Satellite SAR and Optical Images

    Authors: Shaojia Ge, Oleg Antropov, Tuomas Häme, Ronald E. McRoberts, Jukka Miettinen

    Abstract: Deep learning (DL) models are gaining popularity in forest variable prediction using Earth Observation images. However, in practical forest inventories, reference datasets are often represented by plot- or stand-level measurements, while high-quality representative wall-to-wall reference data for end-to-end training of DL models are rarely available. Transfer learning facilitates expansion of the… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

  8. arXiv:2308.03137  [pdf, other

    eess.SP

    Digital Self-Interference Cancellation With Robust Multi-layered Total Least Mean Squares Adaptive Filters

    Authors: Shiyu Song, Yanqun Tang, Xizhang Wei, Yu Zhou, Xianjie Lu, Zhengpeng Wang, Songhu Ge

    Abstract: In simultaneous transmit and receive (STAR) wireless communications, digital self-interference (SI) cancellation is required before estimating the remote transmission (RT) channel. Considering the inherent connection between SI channel reconstruction and RT channel estimation, we propose a multi-layered M-estimate total least mean squares (m-MTLS) joint estimator to estimate both channels. In each… ▽ More

    Submitted 6 August, 2023; originally announced August 2023.

  9. arXiv:2303.17210  [pdf, other

    cs.CR cs.NI eess.SY

    DecentRAN: Decentralized Radio Access Network for 5.5G and beyond

    Authors: Hao Xu, Xun Liu, Qinghai Zeng, Qiang Li, Shibin Ge, Guohua Zhou, Raymond Forbes

    Abstract: Radio Access Network faces challenges from privacy and flexible wide area and local area network access. RAN is limited from providing local service directly due to centralized design of cellular network and concerns of user privacy and data security. DecentRAN or Decentralized Radio Access Network offers an alternative perspective to cope with the emerging demands of 5G Non-public Network and the… ▽ More

    Submitted 30 March, 2023; originally announced March 2023.

  10. arXiv:2303.02456  [pdf, other

    cs.RO eess.SY

    Fixed-time Adaptive Neural Control for Physical Human-Robot Collaboration with Time-Varying Workspace Constraints

    Authors: Yuzhu Sun, Mien Van, Stephen McIlvanna, Nguyen Minh Nhat, Sean McLoone, Dariusz Ceglarek, Shuzhi Sam Ge

    Abstract: Physical human-robot collaboration (pHRC) requires both compliance and safety guarantees since robots coordinate with human actions in a shared workspace. This paper presents a novel fixed-time adaptive neural control methodology for handling time-varying workspace constraints that occur in physical human-robot collaboration while also guaranteeing compliance during intended force interactions. Th… ▽ More

    Submitted 26 April, 2023; v1 submitted 4 March, 2023; originally announced March 2023.

  11. arXiv:2302.01537  [pdf, other

    math.OC eess.SY

    Gradient and Variable Tracking with Multiple Local SGD for Decentralized Non-Convex Learning

    Authors: Songyang Ge, Tsung-Hui Chang

    Abstract: Stochastic distributed optimization methods that solve an optimization problem over a multi-agent network have played an important role in a variety of large-scale signal processing and machine leaning applications. Among the existing methods, the gradient tracking (GT) method is found robust against the variance between agents' local data distribution, in contrast to the distributed stochastic gr… ▽ More

    Submitted 2 February, 2023; originally announced February 2023.

    Comments: 46 pages, 6 figures

  12. arXiv:2212.14747  [pdf, other

    eess.IV cs.CV

    VertMatch: A Semi-supervised Framework for Vertebral Structure Detection in 3D Ultrasound Volume

    Authors: Hongye Zeng, kang Zhou, Songhan Ge, Yuchong Gao, Jianhao Zhao, Shenghua Gao, Rui Zheng

    Abstract: Three-dimensional (3D) ultrasound imaging technique has been applied for scoliosis assessment, but current assessment method only uses coronal projection image and cannot illustrate the 3D deformity and vertebra rotation. The vertebra detection is essential to reveal 3D spine information, but the detection task is challenging due to complex data and limited annotations. We propose VertMatch, a two… ▽ More

    Submitted 28 December, 2022; originally announced December 2022.

    Comments: 15 pages, 8 figures

  13. A Novel Semisupervised Contrastive Regression Framework for Forest Inventory Mapping with Multisensor Satellite Data

    Authors: Shaojia Ge, Hong Gu, Weimin Su, Anne Lönnqvist, Oleg Antropov

    Abstract: Accurate mapping of forests is critical for forest management and carbon stocks monitoring. Deep learning is becoming more popular in Earth Observation (EO), however, the availability of reference data limits its potential in wide-area forest mapping. To overcome those limitations, here we introduce contrastive regression into EO based forest mapping and develop a novel semisupervised regression f… ▽ More

    Submitted 30 November, 2022; originally announced December 2022.

  14. arXiv:2211.13229  [pdf, other

    eess.IV cs.CL cs.CV cs.LG

    DeltaNet:Conditional Medical Report Generation for COVID-19 Diagnosis

    Authors: Xian Wu, Shuxin Yang, Zhaopeng Qiu, Shen Ge, Yangtian Yan, Xingwang Wu, Yefeng Zheng, S. Kevin Zhou, Li Xiao

    Abstract: Fast screening and diagnosis are critical in COVID-19 patient treatment. In addition to the gold standard RT-PCR, radiological imaging like X-ray and CT also works as an important means in patient screening and follow-up. However, due to the excessive number of patients, writing reports becomes a heavy burden for radiologists. To reduce the workload of radiologists, we propose DeltaNet to generate… ▽ More

    Submitted 12 November, 2022; originally announced November 2022.

  15. arXiv:2208.08607   

    math.OC eess.SY

    Event-triggered Finite-time Control Using Inverse-optimal Implicit Lyapunov Function

    Authors: Peng Wang, Shuzhi Sam Ge, Xiaobing Zhang

    Abstract: This work deals with the event-triggered finite-time control for high-order systems based on an implicit Lyapunov function (ILF). With the construction of an inverse optimal problem, a novel expression of ILF is obtained. By designing the event-triggering mechanism elaborately, it is guaranteed that the trivial solution of the closed-loop system is globally finite-time stable and there exists no Z… ▽ More

    Submitted 9 November, 2022; v1 submitted 17 August, 2022; originally announced August 2022.

    Comments: To be revised and corrected

  16. arXiv:2205.04674  [pdf, other

    eess.SY

    Balanced control between performance and saturation for constrained nonlinear systems

    Authors: Peng Wang, Haibin Wang, Shuzhi Sam Ge, Xiaobing Zhang

    Abstract: This paper addresses the balanced control between performance and saturation for a class of constrained nonlinear systems, including the branches: balanced command filtered backstepping (BCFB) and balanced performance control (BPC). To balance the interconnection and conflict between performance and saturation constraints, define a performance safety evaluation (PSE) function, which evaluates the… ▽ More

    Submitted 10 May, 2022; originally announced May 2022.

    Comments: 9 pages, 7 figures

  17. arXiv:2204.14272  [pdf, other

    cs.CL cs.AI cs.SD eess.AS

    End-to-end Spoken Conversational Question Answering: Task, Dataset and Model

    Authors: Chenyu You, Nuo Chen, Fenglin Liu, Shen Ge, Xian Wu, Yuexian Zou

    Abstract: In spoken question answering, the systems are designed to answer questions from contiguous text spans within the related speech transcripts. However, the most natural way that human seek or test their knowledge is via human conversations. Therefore, we propose a new Spoken Conversational Question Answering task (SCQA), aiming at enabling the systems to model complex dialogue flows given the speech… ▽ More

    Submitted 29 April, 2022; originally announced April 2022.

    Comments: In Findings of NAACL 2022. arXiv admin note: substantial text overlap with arXiv:2010.08923

  18. arXiv:2204.10513  [pdf

    eess.IV cs.CV

    MIPR:Automatic Annotation of Medical Images with Pixel Rearrangement

    Authors: Pingping Dai, Haiming Zhu, Shuang Ge, Ruihan Zhang, Xiang Qian, Xi Li, Kehong Yuan

    Abstract: Most of the state-of-the-art semantic segmentation reported in recent years is based on fully supervised deep learning in the medical domain. How?ever, the high-quality annotated datasets require intense labor and domain knowledge, consuming enormous time and cost. Previous works that adopt semi?supervised and unsupervised learning are proposed to address the lack of anno?tated data through assist… ▽ More

    Submitted 22 April, 2022; originally announced April 2022.

  19. arXiv:2203.10095  [pdf, other

    eess.IV cs.CV

    AlignTransformer: Hierarchical Alignment of Visual Regions and Disease Tags for Medical Report Generation

    Authors: Di You, Fenglin Liu, Shen Ge, Xiaoxia Xie, Jing Zhang, Xian Wu

    Abstract: Recently, medical report generation, which aims to automatically generate a long and coherent descriptive paragraph of a given medical image, has received growing research interests. Different from the general image captioning tasks, medical report generation is more challenging for data-driven neural models. This is mainly due to 1) the serious data bias: the normal visual regions dominate the da… ▽ More

    Submitted 18 March, 2022; originally announced March 2022.

    Comments: Accepted by MICCAI 2021 (the 24th International Conference on Medical Image Computing and Computer Assisted Intervention)

  20. arXiv:2112.15011  [pdf, other

    eess.IV cs.CL cs.CV

    Radiology Report Generation with a Learned Knowledge Base and Multi-modal Alignment

    Authors: Shuxin Yang, Xian Wu, Shen Ge, S. Kevin Zhou, Li Xiao

    Abstract: In clinics, a radiology report is crucial for guiding a patient's treatment. However, writing radiology reports is a heavy burden for radiologists. To this end, we present an automatic, multi-modal approach for report generation from a chest x-ray. Our approach, motivated by the observation that the descriptions in radiology reports are highly correlated with specific information of the x-ray imag… ▽ More

    Submitted 1 June, 2022; v1 submitted 30 December, 2021; originally announced December 2021.

  21. arXiv:2112.15009  [pdf, ps, other

    eess.IV cs.CL cs.CV

    Knowledge Matters: Radiology Report Generation with General and Specific Knowledge

    Authors: Shuxin Yang, Xian Wu, Shen Ge, Shaohua Kevin Zhou, Li Xiao

    Abstract: Automatic radiology report generation is critical in clinics which can relieve experienced radiologists from the heavy workload and remind inexperienced radiologists of misdiagnosis or missed diagnose. Existing approaches mainly formulate radiology report generation as an image captioning task and adopt the encoder-decoder framework. However, in the medical domain, such pure data-driven approaches… ▽ More

    Submitted 6 November, 2022; v1 submitted 30 December, 2021; originally announced December 2021.

    Comments: Medical Image Analysis

  22. arXiv:2107.13431  [pdf

    eess.IV cs.CV

    AI assisted method for efficiently generating breast ultrasound screening reports

    Authors: Shuang Ge, Qiongyu Ye, Wenquan Xie, Desheng Sun, Huabin Zhang, Xiaobo Zhou, Kehong Yuan

    Abstract: Background: Ultrasound is one of the preferred choices for early screening of dense breast cancer. Clinically, doctors have to manually write the screening report which is time-consuming and laborious, and it is easy to miss and miswrite. Aim: We proposed a new pipeline to automatically generate AI breast ultrasound screening reports based on ultrasound images, aiming to assist doctors in improvin… ▽ More

    Submitted 22 May, 2022; v1 submitted 28 July, 2021; originally announced July 2021.

  23. arXiv:2105.03847  [pdf

    eess.IV cs.CV

    Automatic segmentation of vertebral features on ultrasound spine images using Stacked Hourglass Network

    Authors: Hong-Ye Zeng, Song-Han Ge, Yu-Chong Gao, De-Sen Zhou, Kang Zhou, Xu-Ming He, Edmond Lou, Rui Zheng

    Abstract: Objective: The spinous process angle (SPA) is one of the essential parameters to denote three-dimensional (3-D) deformity of spine. We propose an automatic segmentation method based on Stacked Hourglass Network (SHN) to detect the spinous processes (SP) on ultrasound (US) spine images and to measure the SPAs of clinical scoliotic subjects. Methods: The network was trained to detect vertebral SP an… ▽ More

    Submitted 23 May, 2021; v1 submitted 9 May, 2021; originally announced May 2021.

    Comments: 9 pages,5 figures

  24. arXiv:2103.05378  [pdf, other

    math.OC eess.SY

    Decentralized Non-Convex Learning with Linearly Coupled Constraints

    Authors: Jiawei Zhang, Songyang Ge, Tsung-Hui Chang, Zhi-Quan Luo

    Abstract: Motivated by the need for decentralized learning, this paper aims at designing a distributed algorithm for solving nonconvex problems with general linear constraints over a multi-agent network. In the considered problem, each agent owns some local information and a local variable for jointly minimizing a cost function, but local variables are coupled by linear constraints. Most of the existing met… ▽ More

    Submitted 22 June, 2022; v1 submitted 9 March, 2021; originally announced March 2021.

  25. arXiv:2012.15432  [pdf

    cs.CV cs.LG eess.IV

    SharpGAN: Receptive Field Block Net for Dynamic Scene Deblurring

    Authors: Hui Feng, Jundong Guo, Sam Shuzhi Ge

    Abstract: When sailing at sea, the smart ship will inevitably produce swaying motion due to the action of wind, wave and current, which makes the image collected by the visual sensor appear motion blur. This will have an adverse effect on the object detection algorithm based on the vision sensor, thereby affect the navigation safety of the smart ship. In order to remove the motion blur in the images during… ▽ More

    Submitted 30 December, 2020; originally announced December 2020.

    Comments: 15 pages, 6 figures

    ACM Class: I.2.10

  26. arXiv:2006.07907  [pdf, other

    eess.SY math.OC

    Trajectory Generation by Chance Constrained Nonlinear MPC with Probabilistic Prediction

    Authors: Xiaoxue Zhang, Jun Ma, Zilong Cheng, Sunan Huang, Shuzhi Sam Ge, Tong Heng Lee

    Abstract: Continued great efforts have been dedicated towards high-quality trajectory generation based on optimization methods, however, most of them do not suitably and effectively consider the situation with moving obstacles; and more particularly, the future position of these moving obstacles in the presence of uncertainty within some possible prescribed prediction horizon. To cater to this rather major… ▽ More

    Submitted 4 August, 2020; v1 submitted 14 June, 2020; originally announced June 2020.

    Comments: 13 pages, 13 figures

  27. Adaptive Feedforward Neural Network Control with an Optimized Hidden Node Distribution

    Authors: Qiong Liu, Dongyu Li, Shuzhi Sam Ge, Zhong Ouyang

    Abstract: Composite adaptive radial basis function neural network (RBFNN) control with a lattice distribution of hidden nodes has three inherent demerits: 1) the approximation domain of adaptive RBFNNs is difficult to be determined a priori; 2) only a partial persistence of excitation (PE) condition can be guaranteed; and 3) in general, the required number of hidden nodes of RBFNNs is enormous. This paper p… ▽ More

    Submitted 22 April, 2021; v1 submitted 23 May, 2020; originally announced May 2020.

    Comments: 12 pages, 7 figures This paper is submitted to "IEEE Transactions on Artificial Intelligence"

  28. arXiv:2005.05083  [pdf, other

    cs.LG cs.DC eess.SP

    A Federated Learning Framework for Healthcare IoT devices

    Authors: Binhang Yuan, Song Ge, Wenhui Xing

    Abstract: The Internet of Things (IoT) revolution has shown potential to give rise to many medical applications with access to large volumes of healthcare data collected by IoT devices. However, the increasing demand for healthcare data privacy and security makes each IoT device an isolated island of data. Further, the limited computation and communication capacity of wearable healthcare devices restrict th… ▽ More

    Submitted 7 May, 2020; originally announced May 2020.

  29. arXiv:1912.11221  [pdf, ps, other

    cs.IT eess.SP

    FDD Massive MIMO Uplink and Downlink Channel Reciprocity Properties: Full or Partial Reciprocity?

    Authors: Zhimeng Zhong, Li Fan, Shibin Ge

    Abstract: One challenge for FDD massive MIMO communication system is how to obtain the downlink channel state information (CSI) at the base station. Except for traditional codebook feedback through uplink pilot transmission, some channel reciprocity properties can be utilized through uplink channel estimation and channel parameter estimation algorithms. In this paper, the uplink and downlink channel recipro… ▽ More

    Submitted 30 December, 2019; v1 submitted 24 December, 2019; originally announced December 2019.

  30. arXiv:1909.13265  [pdf, ps, other

    cs.NE eess.SY

    Adaptive Control for Marine Vessels Against Harsh Environmental Variation

    Authors: Fangwen Tu, Shuzhi Sam Ge, Yoo Sang Choo, Chang Chieh Hang

    Abstract: In this paper, robust control with sea state observer and dynamic thrust allocation is proposed for the Dynamic Positioning (DP) of an accommodation vessel in the presence of unknown hydrodynamic force variation and the input time delay. In order to overcome the huge force variation due to the adjoining Floating Production Storage and Offloading (FPSO) and accommodation vessel, a novel sea state o… ▽ More

    Submitted 29 September, 2019; originally announced September 2019.

  31. arXiv:1908.07590  [pdf, other

    cs.IR cs.CL cs.SD eess.AS

    From Text to Sound: A Preliminary Study on Retrieving Sound Effects to Radio Stories

    Authors: Songwei Ge, Curtis Xuan, Ruihua Song, Chao Zou, Wei Liu, Jin Zhou

    Abstract: Sound effects play an essential role in producing high-quality radio stories but require enormous labor cost to add. In this paper, we address the problem of automatically adding sound effects to radio stories with a retrieval-based model. However, directly implementing a tag-based retrieval model leads to high false positives due to the ambiguity of story contents. To solve this problem, we intro… ▽ More

    Submitted 20 August, 2019; originally announced August 2019.

    Comments: In the Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2019)

  32. A Scalable Framework for Multilevel Streaming Data Analytics using Deep Learning

    Authors: Shihao Ge, Haruna Isah, Farhana Zulkernine, Shahzad Khan

    Abstract: The rapid growth of data in velocity, volume, value, variety, and veracity has enabled exciting new opportunities and presented big challenges for businesses of all types. Recently, there has been considerable interest in developing systems for processing continuous data streams with the increasing need for real-time analytics for decision support in the business, healthcare, manufacturing, and se… ▽ More

    Submitted 15 July, 2019; originally announced July 2019.