Skip to main content

Showing 1–43 of 43 results for author: Bai, X

Searching in archive eess. Search in all archives.
.
  1. arXiv:2507.01161  [pdf, ps, other

    eess.SY cs.RO

    Imitation Learning for Satellite Attitude Control under Unknown Perturbations

    Authors: Zhizhuo Zhang, Hao Peng, Xiaoli Bai

    Abstract: This paper presents a novel satellite attitude control framework that integrates Soft Actor-Critic (SAC) reinforcement learning with Generative Adversarial Imitation Learning (GAIL) to achieve robust performance under various unknown perturbations. Traditional control techniques often rely on precise system models and are sensitive to parameter uncertainties and external perturbations. To overcome… ▽ More

    Submitted 1 July, 2025; originally announced July 2025.

    Comments: 2025 AAS/AIAA Astrodynamics Specialist Conference

  2. arXiv:2505.11366  [pdf, ps, other

    cs.RO cs.HC cs.LG eess.SY

    Learning Multimodal AI Algorithms for Amplifying Limited User Input into High-dimensional Control Space

    Authors: Ali Rabiee, Sima Ghafoori, MH Farhadi, Robert Beyer, Xiangyu Bai, David J Lin, Sarah Ostadabbas, Reza Abiri

    Abstract: Current invasive assistive technologies are designed to infer high-dimensional motor control signals from severely paralyzed patients. However, they face significant challenges, including public acceptance, limited longevity, and barriers to commercialization. Meanwhile, noninvasive alternatives often rely on artifact-prone signals, require lengthy user training, and struggle to deliver robust hig… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

  3. arXiv:2505.04380  [pdf, other

    eess.IV cs.CV cs.IR

    Tetrahedron-Net for Medical Image Registration

    Authors: Jinhai Xiang, Shuai Guo, Qianru Han, Dantong Shi, Xinwei He, Xiang Bai

    Abstract: Medical image registration plays a vital role in medical image processing. Extracting expressive representations for medical images is crucial for improving the registration quality. One common practice for this end is constructing a convolutional backbone to enable interactions with skip connections among feature extraction layers. The de facto structure, U-Net-like networks, has attempted to des… ▽ More

    Submitted 7 May, 2025; originally announced May 2025.

  4. arXiv:2503.10211  [pdf, other

    cs.CL cs.SD eess.AS

    Adaptive Inner Speech-Text Alignment for LLM-based Speech Translation

    Authors: Henglyu Liu, Andong Chen, Kehai Chen, Xuefeng Bai, Meizhi Zhong, Yuan Qiu, Min Zhang

    Abstract: Recent advancement of large language models (LLMs) has led to significant breakthroughs across various tasks, laying the foundation for the development of LLM-based speech translation systems. Existing methods primarily focus on aligning inputs and outputs across modalities while overlooking deeper semantic alignment within model representations. To address this limitation, we propose an Adaptive… ▽ More

    Submitted 13 March, 2025; originally announced March 2025.

    Comments: 12 pages, 7 figures

  5. arXiv:2501.15588  [pdf, other

    eess.IV cs.CV

    Tumor Detection, Segmentation and Classification Challenge on Automated 3D Breast Ultrasound: The TDSC-ABUS Challenge

    Authors: Gongning Luo, Mingwang Xu, Hongyu Chen, Xinjie Liang, Xing Tao, Dong Ni, Hyunsu Jeong, Chulhong Kim, Raphael Stock, Michael Baumgartner, Yannick Kirchhoff, Maximilian Rokuss, Klaus Maier-Hein, Zhikai Yang, Tianyu Fan, Nicolas Boutry, Dmitry Tereshchenko, Arthur Moine, Maximilien Charmetant, Jan Sauer, Hao Du, Xiang-Hui Bai, Vipul Pai Raikar, Ricardo Montoya-del-Angel, Robert Marti , et al. (12 additional authors not shown)

    Abstract: Breast cancer is one of the most common causes of death among women worldwide. Early detection helps in reducing the number of deaths. Automated 3D Breast Ultrasound (ABUS) is a newer approach for breast screening, which has many advantages over handheld mammography such as safety, speed, and higher detection rate of breast cancer. Tumor detection, segmentation, and classification are key componen… ▽ More

    Submitted 26 January, 2025; originally announced January 2025.

  6. arXiv:2412.17943  [pdf, other

    eess.IV

    Optimizing Prompt Strategies for SAM: Advancing lesion Segmentation Across Diverse Medical Imaging Modalities

    Authors: Yuli Wang, Victoria Shi, Wen-Chi Hsu, Yuwei Dai, Sophie Yao, Zhusi Zhong, Zishu Zhang, Jing Wu, Aaron Maxwell, Scott Collins, Zhicheng Jiao, Harrison X. Bai

    Abstract: Purpose: To evaluate various Segmental Anything Model (SAM) prompt strategies across four lesions datasets and to subsequently develop a reinforcement learning (RL) agent to optimize SAM prompt placement. Materials and Methods: This retrospective study included patients with four independent ovarian, lung, renal, and breast tumor datasets. Manual segmentation and SAM-assisted segmentation were per… ▽ More

    Submitted 28 December, 2024; v1 submitted 23 December, 2024; originally announced December 2024.

  7. arXiv:2412.04822  [pdf

    eess.SP

    Space-Time-Modulated Wideband Radiation-Type Programmable Metasurface for Low Sidelobe Beamforming

    Authors: Xudong Bai, Longpan Wang, Yuhua Chen, Xilong Lu, Fuli Zhang, Jingfeng Chen, Wen Chen, He-Xiu Xu

    Abstract: Programmable metasurfaces promise a great potential to construct low-cost phased array systems due to the capability of elaborate modulation over electromagnetic (EM) waves. However, they are in either reflective or transmissive mode, and usually possess a relatively high profile as a result of the external feed source. Besides, it is difficult to conduct multibit phase shift in metasurfaces, when… ▽ More

    Submitted 6 December, 2024; originally announced December 2024.

  8. arXiv:2410.17576  [pdf, other

    cs.RO cs.AI eess.SY

    Real-time Vehicle-to-Vehicle Communication Based Network Cooperative Control System through Distributed Database and Multimodal Perception: Demonstrated in Crossroads

    Authors: Xinwen Zhu, Zihao Li, Yuxuan Jiang, Jiazhen Xu, Jie Wang, Xuyang Bai

    Abstract: The autonomous driving industry is rapidly advancing, with Vehicle-to-Vehicle (V2V) communication systems highlighting as a key component of enhanced road safety and traffic efficiency. This paper introduces a novel Real-time Vehicle-to-Vehicle Communication Based Network Cooperative Control System (VVCCS), designed to revolutionize macro-scope traffic planning and collision avoidance in autonomou… ▽ More

    Submitted 23 October, 2024; originally announced October 2024.

    Comments: ICICT 2024, 18 pages

  9. arXiv:2409.18429  [pdf, other

    cs.IT eess.SP

    Joint Optimization of Data- and Model-Driven Probing Beams and Beam Predictor

    Authors: Tianheng Lu, Fan Meng, Zhilei Zhang, Yongming Huang, Cheng Zhang, Xiaoyu Bai

    Abstract: Hierarchical search in millimeter-wave (mmWave) communications incurs significant beam training overhead and delay, especially in a dynamic environment. Deep learning-enabled beam prediction is promising to significantly mitigate the overhead and delay, efficiently utilizing the site-specific channel prior. In this work, we propose to jointly optimize a data- and model-driven probe beam module and… ▽ More

    Submitted 26 September, 2024; originally announced September 2024.

  10. arXiv:2409.12470  [pdf, other

    cs.CV eess.IV

    HSIGene: A Foundation Model For Hyperspectral Image Generation

    Authors: Li Pang, Xiangyong Cao, Datao Tang, Shuang Xu, Xueru Bai, Feng Zhou, Deyu Meng

    Abstract: Hyperspectral image (HSI) plays a vital role in various fields such as agriculture and environmental monitoring. However, due to the expensive acquisition cost, the number of hyperspectral images is limited, degenerating the performance of downstream tasks. Although some recent studies have attempted to employ diffusion models to synthesize HSIs, they still struggle with the scarcity of HSIs, affe… ▽ More

    Submitted 1 November, 2024; v1 submitted 19 September, 2024; originally announced September 2024.

  11. arXiv:2409.07226  [pdf, other

    cs.SD eess.AS

    Muskits-ESPnet: A Comprehensive Toolkit for Singing Voice Synthesis in New Paradigm

    Authors: Yuning Wu, Jiatong Shi, Yifeng Yu, Yuxun Tang, Tao Qian, Yueqian Lin, Jionghao Han, Xinyi Bai, Shinji Watanabe, Qin Jin

    Abstract: This research presents Muskits-ESPnet, a versatile toolkit that introduces new paradigms to Singing Voice Synthesis (SVS) through the application of pretrained audio models in both continuous and discrete approaches. Specifically, we explore discrete representations derived from SSL models and audio codecs and offer significant advantages in versatility and intelligence, supporting multi-format in… ▽ More

    Submitted 10 October, 2024; v1 submitted 11 September, 2024; originally announced September 2024.

    Comments: Accepted by ACMMM 2024 demo track

  12. arXiv:2408.13483  [pdf, other

    eess.SP cs.IT

    Transmissive RIS Enabled Transceiver Systems:Architecture, Design Issues and Opportunities

    Authors: Zhendong Li, Wen Chen, Qingqing Wu, Ziwei Liu, Chong He, Xudong Bai, Jun Li

    Abstract: Reconfigurable intelligent surface (RIS) is anticipated to augment the performance of beyond fifth-generation (B5G) and sixth-generation (6G) networks by intelligently manipulating the state of its components. Rather than employing reflective RIS for aided communications, this paper proposes an innovative transmissive RIS-enabled transceiver (TRTC) architecture that can accomplish the functions of… ▽ More

    Submitted 24 August, 2024; originally announced August 2024.

    Journal ref: IEEE VTM, 2024

  13. arXiv:2408.08101  [pdf

    eess.SY

    Stochastic Real-Time Economic Dispatch for Integrated Electric and Gas Systems Considering Uncertainty Propagation and Pipeline Leakage

    Authors: eiyao Zhao, Zhengshuo Li, Jiahui Zhang, Xiang Bai, Jia Su

    Abstract: Gas-fired units (GFUs) with rapid regulation capabilities are considered an effective tool to mitigate fluctuations in the generation of renewable energy sources and have coupled electricity power systems (EPSs) and natural gas systems (NGSs) more tightly. However, this tight coupling leads to uncertainty propagation, a challenge for the real-time dispatch of such integrated electric and gas syste… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  14. arXiv:2407.16921  [pdf, other

    cs.CV eess.IV

    SAR to Optical Image Translation with Color Supervised Diffusion Model

    Authors: Xinyu Bai, Feng Xu

    Abstract: Synthetic Aperture Radar (SAR) offers all-weather, high-resolution imaging capabilities, but its complex imaging mechanism often poses challenges for interpretation. In response to these limitations, this paper introduces an innovative generative model designed to transform SAR images into more intelligible optical images, thereby enhancing the interpretability of SAR images. Specifically, our mod… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

  15. arXiv:2407.06095  [pdf, other

    cs.CV eess.IV

    Accelerating Diffusion for SAR-to-Optical Image Translation via Adversarial Consistency Distillation

    Authors: Xinyu Bai, Feng Xu

    Abstract: Synthetic Aperture Radar (SAR) provides all-weather, high-resolution imaging capabilities, but its unique imaging mechanism often requires expert interpretation, limiting its widespread applicability. Translating SAR images into more easily recognizable optical images using diffusion models helps address this challenge. However, diffusion models suffer from high latency due to numerous iterative i… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  16. arXiv:2407.04888  [pdf, other

    eess.IV cs.CV

    Unraveling Radiomics Complexity: Strategies for Optimal Simplicity in Predictive Modeling

    Authors: Mahdi Ait Lhaj Loutfi, Teodora Boblea Podasca, Alex Zwanenburg, Taman Upadhaya, Jorge Barrios, David R. Raleigh, William C. Chen, Dante P. I. Capaldi, Hong Zheng, Olivier Gevaert, Jing Wu, Alvin C. Silva, Paul J. Zhang, Harrison X. Bai, Jan Seuntjens, Steffen Löck, Patrick O. Richard, Olivier Morin, Caroline Reinhold, Martin Lepage, Martin Vallières

    Abstract: Background: The high dimensionality of radiomic feature sets, the variability in radiomic feature types and potentially high computational requirements all underscore the need for an effective method to identify the smallest set of predictive features for a given clinical problem. Purpose: Develop a methodology and tools to identify and explain the smallest set of predictive radiomic features. Mat… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  17. arXiv:2404.15584  [pdf

    eess.SY

    Research on OPF control of three-phase four-wire low-voltage distribution network considering uncertainty

    Authors: Rui Wang, Xiaoqing Bai, Shengquan Huang, Shoupu Wei

    Abstract: As power systems become more complex and uncertain, low-voltage distribution networks face numerous challenges, including three-phase imbalances caused by asymmetrical loads and distributed energy resources. We propose a robust stochastic optimization (RSO) based optimal power flow (OPF) control method for three-phase, four-wire low-voltage distribution networks that consider uncertainty to addres… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: systems optimization, robust optimization, local control

  18. arXiv:2401.17619  [pdf, ps, other

    cs.SD eess.AS

    Singing Voice Data Scaling-up: An Introduction to ACE-Opencpop and ACE-KiSing

    Authors: Jiatong Shi, Yueqian Lin, Xinyi Bai, Keyi Zhang, Yuning Wu, Yuxun Tang, Yifeng Yu, Qin Jin, Shinji Watanabe

    Abstract: In singing voice synthesis (SVS), generating singing voices from musical scores faces challenges due to limited data availability. This study proposes a unique strategy to address the data scarcity in SVS. We employ an existing singing voice synthesizer for data augmentation, complemented by detailed manual tuning, an approach not previously explored in data curation, to reduce instances of unnatu… ▽ More

    Submitted 12 June, 2024; v1 submitted 31 January, 2024; originally announced January 2024.

    Comments: Accepted by Interspeech2024

  19. arXiv:2401.01609  [pdf, other

    cs.IT eess.SP

    Entropy-based Probing Beam Selection and Beam Prediction via Deep Learning

    Authors: Fan Meng, Cheng Zhang, Yongming Huang, Zhilei Zhang, Xiaoyu Bai, Zhaohua Lu

    Abstract: Hierarchical beam search in mmWave communications incurs substantial training overhead, necessitating deep learning-enabled beam predictions to effectively leverage channel priors and mitigate this overhead. In this study, we introduce a comprehensive probabilistic model of power distribution in beamspace, and formulate the joint optimization problem of probing beam selection and probabilistic bea… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

  20. arXiv:2311.08190  [pdf, other

    eess.IV cs.CV cs.LG

    SAMIHS: Adaptation of Segment Anything Model for Intracranial Hemorrhage Segmentation

    Authors: Yinuo Wang, Kai Chen, Weimin Yuan, Cai Meng, XiangZhi Bai

    Abstract: Segment Anything Model (SAM), a vision foundation model trained on large-scale annotations, has recently continued raising awareness within medical image segmentation. Despite the impressive capabilities of SAM on natural scenes, it struggles with performance decline when confronted with medical images, especially those involving blurry boundaries and highly irregular regions of low contrast. In t… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    Comments: 5 pages, 3 figures, 2 tables

  21. arXiv:2310.12407  [pdf, other

    cs.LG cs.AI eess.SY

    Classification-Aided Robust Multiple Target Tracking Using Neural Enhanced Message Passing

    Authors: Xianglong Bai, Zengfu Wang, Quan Pan, Tao Yun, Hua Lan

    Abstract: We address the challenge of tracking an unknown number of targets in strong clutter environments using measurements from a radar sensor. Leveraging the range-Doppler spectra information, we identify the measurement classes, which serve as additional information to enhance clutter rejection and data association, thus bolstering the robustness of target tracking. We first introduce a novel neural en… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: 15 pages

  22. arXiv:2309.13404  [pdf, other

    eess.IV cs.CV

    Weakly Supervised YOLO Network for Surgical Instrument Localization in Endoscopic Videos

    Authors: Rongfeng Wei, Jinlin Wu, Xuexue Bai, Ming Feng, Zhen Lei, Hongbin Liu, Zhen Chen

    Abstract: In minimally invasive surgery, surgical instrument localization is a crucial task for endoscopic videos, which enables various applications for improving surgical outcomes. However, annotating the instrument localization in endoscopic videos is tedious and labor-intensive. In contrast, obtaining the category information is easy and efficient in real-world applications. To fully utilize the categor… ▽ More

    Submitted 20 June, 2024; v1 submitted 23 September, 2023; originally announced September 2023.

    Comments: Accepted by ICRA 2024 Workshop on C4 Surgical Robotic Systems in the Embodied AI Era; Surgical Tool Localization in Endoscopic Videos Challenge of MICCAI2023

  23. arXiv:2308.08465  [pdf, other

    eess.IV cs.CV cs.LG

    Hierarchical Uncertainty Estimation for Medical Image Segmentation Networks

    Authors: Xinyu Bai, Wenjia Bai

    Abstract: Learning a medical image segmentation model is an inherently ambiguous task, as uncertainties exist in both images (noise) and manual annotations (human errors and bias) used for model training. To build a trustworthy image segmentation model, it is important to not just evaluate its performance but also estimate the uncertainty of the model prediction. Most state-of-the-art image segmentation net… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

    Comments: 8 pages, 3 figures

  24. arXiv:2307.08268  [pdf, other

    eess.IV cs.CV

    Liver Tumor Screening and Diagnosis in CT with Pixel-Lesion-Patient Network

    Authors: Ke Yan, Xiaoli Yin, Yingda Xia, Fakai Wang, Shu Wang, Yuan Gao, Jiawen Yao, Chunli Li, Xiaoyu Bai, Jingren Zhou, Ling Zhang, Le Lu, Yu Shi

    Abstract: Liver tumor segmentation and classification are important tasks in computer aided diagnosis. We aim to address three problems: liver tumor screening and preliminary diagnosis in non-contrast computed tomography (CT), and differential diagnosis in dynamic contrast-enhanced CT. A novel framework named Pixel-Lesion-pAtient Network (PLAN) is proposed. It uses a mask transformer to jointly segment and… ▽ More

    Submitted 21 October, 2023; v1 submitted 17 July, 2023; originally announced July 2023.

    Comments: MICCAI 2023, code: https://github.com/alibaba-damo-academy/pixel-lesion-patient-network

  25. arXiv:2306.09710  [pdf, other

    eess.SY

    Combinatorial-restless-bandit-based Transmitter-Receiver Online Selection for Distributed MIMO Radars With Non-Stationary Channels

    Authors: Yuhang Hao, Zengfu Wang, Jing Fu, Xianglong Bai, Can Li, Quan Pan

    Abstract: We track moving targets with a distributed multiple-input multiple-output (MIMO) radar, for which the transmitters and receivers are appropriately paired and selected with a limited number of radar stations. We aim to maximize the sum of the signal-to-interference-plus-noise ratios (SINRs) of all the targets by sensibly selecting the transmitter-receiver pairs during the tracking period. A key is… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

    Comments: 13 pages

  26. SAM3D: Zero-Shot 3D Object Detection via Segment Anything Model

    Authors: Dingyuan Zhang, Dingkang Liang, Hongcheng Yang, Zhikang Zou, Xiaoqing Ye, Zhe Liu, Xiang Bai

    Abstract: With the development of large language models, many remarkable linguistic systems like ChatGPT have thrived and achieved astonishing success on many tasks, showing the incredible power of foundation models. In the spirit of unleashing the capability of foundation models on vision tasks, the Segment Anything Model (SAM), a vision foundation model for image segmentation, has been proposed recently a… ▽ More

    Submitted 29 January, 2024; v1 submitted 3 June, 2023; originally announced June 2023.

    Comments: Accepted by Science China Information Sciences (SCIS)

  27. arXiv:2305.18708  [pdf, other

    cs.CV eess.IV

    Infrared Image Deturbulence Restoration Using Degradation Parameter-Assisted Wide & Deep Learning

    Authors: Yi Lu, Yadong Wang, Xingbo Jiang, Xiangzhi Bai

    Abstract: Infrared images captured under turbulent conditions are degraded by complex geometric distortions and blur. We address infrared deturbulence as an image restoration task, proposing DparNet, a parameter-assisted multi-frame network with a wide & deep architecture. DparNet learns a degradation prior (key parameter matrix) directly from degraded images without external knowledge. Its wide & deep arch… ▽ More

    Submitted 6 May, 2025; v1 submitted 29 May, 2023; originally announced May 2023.

  28. arXiv:2212.07182  [pdf, other

    eess.SY

    Robust Multitarget Tracking in Interference Environments: A Message-Passing Approach

    Authors: Xianglong Bai, Hua Lan, Zengfu Wang, Quan Pan, Yuhang Hao, Can Li

    Abstract: Multitarget tracking in the interference environments suffers from the nonuniform, unknown and time-varying clutter, resulting in dramatic performance deterioration. We address this challenge by proposing a robust multitarget tracking algorithm, which estimates the states of clutter and targets simultaneously by the message-passing (MP) approach. We define the non-homogeneous clutter with a finite… ▽ More

    Submitted 14 December, 2022; originally announced December 2022.

    Comments: 21 pages, 21 figures

  29. arXiv:2211.02852  [pdf

    eess.SY

    Fast Quasi-Optimal Power Flow of Flexible DC Traction Power Systems

    Authors: Zhanhe Li, Xiaoqian Li, Yingdong Wei, Chao Lu, Xuelian Bai

    Abstract: This paper proposes a quasi-optimal power flow (OPF) algorithm for flexible DC traction power systems (TPSs). Near-optimal solutions can be solved with high computational efficiency by the proposed quasi-OPF. Unlike conventional OPF utilizing mathematical optimization algorithms, the proposed quasi-OPF adopts analytical mapping from load information to near-optimal solutions, hence considerably ac… ▽ More

    Submitted 5 November, 2022; originally announced November 2022.

  30. arXiv:2206.04948  [pdf, other

    eess.SY

    A Holistic Robust Motion Controller Framework for Autonomous Platooning

    Authors: Hong Wang, Li-Ming Peng, Zi-Chun Wei, Kai Yang, Xian-Xu Bai, Luo Jiang, Ehsan Hashemi

    Abstract: Safety is the foremost concern for autonomous platooning. The vehicle-to-vehicle (V2V) communication delay and the sudden appearance of obstacles will trigger the safety of the intended functionality (SOTIF) issues for autonomous platooning. This research proposes a holistic robust motion controller framework (MCF) for an intelligent and connected vehicle platoon system. The MCF utilizes a hierarc… ▽ More

    Submitted 10 June, 2022; originally announced June 2022.

    Comments: 13 pages, 20 figures

  31. arXiv:2109.05462  [pdf, other

    eess.SP

    Multi-Antenna Systems by Transmissive Reconfigurable Meta-Surface

    Authors: Zhendong Li, Wen Chen, Chong He, Xudong Bai, Jianmin Lu

    Abstract: Reconfigurable meta-surface (RMS) is proposed as a very promising and novel technology, which is composed of a large number of low-cost passive elements, and can achieve passive beamforming by controlling the amplitude and phase of incident electromagnetic (EM) waves. Therefore, in order to solve the challenges of high power consumption and high cost of existing base stations (BSs), we propose a l… ▽ More

    Submitted 20 February, 2022; v1 submitted 12 September, 2021; originally announced September 2021.

  32. arXiv:2109.03389  [pdf, other

    eess.SY cs.DC

    An Optimal Resource Allocator of Elastic Training for Deep Learning Jobs on Cloud

    Authors: Liang Hu, Jiangcheng Zhu, Zirui Zhou, Ruiqing Cheng, Xiaolong Bai, Yong Zhang

    Abstract: Cloud training platforms, such as Amazon Web Services and Huawei Cloud provide users with computational resources to train their deep learning jobs. Elastic training is a service embedded in cloud training platforms that dynamically scales up or down the resources allocated to a job. The core technique of an elastic training system is to best allocate limited resources among heterogeneous jobs in… ▽ More

    Submitted 7 September, 2021; originally announced September 2021.

  33. arXiv:2109.00683  [pdf

    eess.SP cs.RO

    Time-correlated Window Carrier-phase Aided GNSS Positioning Using Factor Graph Optimization for Urban Positioning

    Authors: Xiwei Bai, Weisong Wen, Li-Ta Hsu

    Abstract: This paper proposes an improved global navigation satellite system (GNSS) positioning method that explores the time correlation between consecutive epochs of the code and carrier phase measurements which significantly increases the robustness against outlier measurements. Instead of relying on the time difference carrier phase (TDCP) which only considers two neighboring epochs using an extended Ka… ▽ More

    Submitted 1 September, 2021; originally announced September 2021.

  34. arXiv:2012.07616  [pdf, other

    cs.CV eess.IV

    WDNet: Watermark-Decomposition Network for Visible Watermark Removal

    Authors: Yang Liu, Zhen Zhu, Xiang Bai

    Abstract: Visible watermarks are widely-used in images to protect copyright ownership. Analyzing watermark removal helps to reinforce the anti-attack techniques in an adversarial way. Current removal methods normally leverage image-to-image translation techniques. Nevertheless, the uncertainty of the size, shape, color and transparency of the watermarks set a huge barrier for these methods. To combat this,… ▽ More

    Submitted 14 December, 2020; v1 submitted 14 December, 2020; originally announced December 2020.

    Comments: To appear in WACV 2021

  35. arXiv:2011.01447  [pdf, other

    cs.SD cs.AI cs.LG cs.NE eess.AS

    A Two-Stage Approach to Device-Robust Acoustic Scene Classification

    Authors: Hu Hu, Chao-Han Huck Yang, Xianjun Xia, Xue Bai, Xin Tang, Yajian Wang, Shutong Niu, Li Chai, Juanjuan Li, Hongning Zhu, Feng Bao, Yuanjun Zhao, Sabato Marco Siniscalchi, Yannan Wang, Jun Du, Chin-Hui Lee

    Abstract: To improve device robustness, a highly desirable key feature of a competitive data-driven acoustic scene classification (ASC) system, a novel two-stage system based on fully convolutional neural networks (CNNs) is proposed. Our two-stage system leverages on an ad-hoc score combination based on two CNN classifiers: (i) the first CNN classifies acoustic inputs into one of three broad classes, and (i… ▽ More

    Submitted 2 November, 2020; originally announced November 2020.

    Comments: Submitted to ICASSP 2021. Code available: https://github.com/MihawkHu/DCASE2020_task1

    Report number: 845--849

    Journal ref: ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

  36. arXiv:2010.13624  [pdf

    eess.SY

    Wind Power Transmission System Integration -- a Case Study of China Wind Power Base

    Authors: Jianxue Wang, Shutang You, Xingzhong Bai, Mingqiao Peng

    Abstract: Due to a series of supporting policies in recent years, China wind power has developed rapidly through a large-scale and centralized mode. This paper analyzes the two major concerns faced by wind power development in China: wind generation reliability and wind energy balancing. More specifically, wind farm tripping-off-grid incidents and wind power curtailment issues, which caused huge economical… ▽ More

    Submitted 10 April, 2021; v1 submitted 22 October, 2020; originally announced October 2020.

    Comments: 21 pages, 6 figures

  37. arXiv:2008.00107  [pdf, other

    eess.AS cs.CL cs.SD

    An Acoustic Segment Model Based Segment Unit Selection Approach to Acoustic Scene Classification with Partial Utterances

    Authors: Hu Hu, Sabato Marco Siniscalchi, Yannan Wang, Xue Bai, Jun Du, Chin-Hui Lee

    Abstract: In this paper, we propose a sub-utterance unit selection framework to remove acoustic segments in audio recordings that carry little information for acoustic scene classification (ASC). Our approach is built upon a universal set of acoustic segment units covering the overall acoustic scene space. First, those units are modeled with acoustic segment models (ASMs) used to tokenize acoustic scene utt… ▽ More

    Submitted 31 July, 2020; originally announced August 2020.

    Comments: Accepted by Interspeech 2020

  38. arXiv:2007.08389  [pdf, other

    eess.AS cs.LG cs.SD

    Device-Robust Acoustic Scene Classification Based on Two-Stage Categorization and Data Augmentation

    Authors: Hu Hu, Chao-Han Huck Yang, Xianjun Xia, Xue Bai, Xin Tang, Yajian Wang, Shutong Niu, Li Chai, Juanjuan Li, Hongning Zhu, Feng Bao, Yuanjun Zhao, Sabato Marco Siniscalchi, Yannan Wang, Jun Du, Chin-Hui Lee

    Abstract: In this technical report, we present a joint effort of four groups, namely GT, USTC, Tencent, and UKE, to tackle Task 1 - Acoustic Scene Classification (ASC) in the DCASE 2020 Challenge. Task 1 comprises two different sub-tasks: (i) Task 1a focuses on ASC of audio signals recorded with multiple (real and simulated) devices into ten different fine-grained classes, and (ii) Task 1b concerns with cla… ▽ More

    Submitted 26 August, 2020; v1 submitted 16 July, 2020; originally announced July 2020.

    Comments: Revised Technical Report. Proposed systems attain 2nds in both Task-1a and Task-1b in the official DCASE challenge 2020

  39. arXiv:2003.09984  [pdf, other

    eess.SP eess.SY

    Measurement-Level Fusion for OTHR Network Using Message Passing

    Authors: Hua Lan, Zengfu Wang, Xianglong Bai, Quan Pan, Kun Lu

    Abstract: Tracking an unknown number of targets based on multipath measurements provided by an over-the-horizon radar (OTHR) network with a statistical ionospheric model is complicated, which requires solving four subproblems: target detection, target tracking, multipath data association and ionospheric height identification. A joint solution is desired since the four subproblems are highly correlated, but… ▽ More

    Submitted 3 April, 2020; v1 submitted 22 March, 2020; originally announced March 2020.

    Comments: 40 pages, 23 figures

  40. Deep Prototypical Networks Based Domain Adaptation for Fault Diagnosis

    Authors: Huanjie Wang, Jie Tan, Xiwei Bai, Jiechao Yang

    Abstract: Due to the existence of dataset shifts, the distributions of data acquired from different working conditions show significant differences in real-world industrial applications, which leads to performance degradation of traditional machine learning methods. This work provides a framework that combines supervised domain adaptation with prototype learning for fault diagnosis. The main idea of domain… ▽ More

    Submitted 11 December, 2019; v1 submitted 8 December, 2019; originally announced December 2019.

  41. arXiv:1901.07951  [pdf, other

    eess.SY

    Modeling and Simulation of UAV Carrier Landings

    Authors: Gaurav Misra, Tianyu Gao, Xiaoli Bai

    Abstract: With UAVs promising capabilities to increase operation flexibility and reduce mission cost, we are exploiting the automated carrier-landing performance advancement that can be achieved by fixed-wing UAVs. To demonstrate such potentials, in this paper, we investigate two key metrics, namely, flight path control performance, and reduced approach speeds for UAVs based on the F/A-18 High Angle of Atta… ▽ More

    Submitted 23 January, 2019; originally announced January 2019.

  42. arXiv:1901.05284  [pdf

    eess.SP

    A balanced energy consumption clustering algorithm for heterogeneous energy wireless sensor networks

    Authors: Xiaofu Ma, Yu Fang, Xingzhen Bai

    Abstract: In this paper, a balanced energy consumption clustering algorithm (BECC) is proposed. This new scheme is a cluster-based algorithm designed for heterogeneous energy wireless sensor networks. A polarized energy factor is introduced to adjust the probability with which each node may become a cluster head in the election of the new clustering scheme. Under the condition that the expected number of cl… ▽ More

    Submitted 4 December, 2018; originally announced January 2019.

    Comments: 2010 IEEE International Conference on Wireless Communications, Networking and Information Security

  43. A multifeature fusion approach for power system transient stability assessment using PMU data

    Authors: Yang Li, Guoqing Li, Zhenhao Wang, Zijiao Han, Xue Bai

    Abstract: Taking full advantage of synchrophasors provided by GPS-based wide-area measurement system (WAMS), a novel VBpMKL-based transient stability assessment (TSA) method through multifeature fusion is proposed in this paper. First, a group of classification features reflecting the transient stability characteristics of power systems are extracted from synchrophasors, and according to the different stage… ▽ More

    Submitted 8 September, 2018; originally announced September 2018.

    Comments: Accepted by Mathematical Problems in Engineering

    Journal ref: Mathematical Problems in Engineering 2015 (2015) 1-11