Skip to main content

Showing 1–50 of 75 results for author: Xiong, J

Searching in archive eess. Search in all archives.
.
  1. arXiv:2507.08293  [pdf, ps, other

    eess.SP

    Ambiguity Function Analysis of AFDM Signals for Integrated Sensing and Communications

    Authors: Haoran Yin, Yanqun Tang, Yuanhan Ni, Zulin Wang, Gaojie Chen, Jun Xiong, Kai Yang, Marios Kountouris, Yong Liang Guan, Yong Zeng

    Abstract: Affine frequency division multiplexing (AFDM) is a promising chirp-based waveform with high flexibility and resilience, making it well-suited for next-generation wireless networks, particularly in high-mobility scenarios. In this paper, we investigate the ambiguity functions (AFs) of AFDM signals, which fundamentally characterize their range and velocity estimation capabilities in both monostatic… ▽ More

    Submitted 10 July, 2025; originally announced July 2025.

    Comments: 14 pages, 14 figures. Under revision in an IEEE Journal

  2. arXiv:2506.12537  [pdf, ps, other

    cs.CL cs.AI eess.AS

    Speech-Language Models with Decoupled Tokenizers and Multi-Token Prediction

    Authors: Xiaoran Fan, Zhichao Sun, Yangfan Gao, Jingfei Xiong, Hang Yan, Yifei Cao, Jiajun Sun, Shuo Li, Zhihao Zhang, Zhiheng Xi, Yuhao Zhou, Senjie Jin, Changhao Jiang, Junjie Ye, Ming Zhang, Rui Zheng, Zhenhua Han, Yunke Zhang, Demei Yan, Shaokang Dong, Tao Ji, Tao Gui, Qi Zhang, Xuanjing Huang

    Abstract: Speech-language models (SLMs) offer a promising path toward unifying speech and text understanding and generation. However, challenges remain in achieving effective cross-modal alignment and high-quality speech generation. In this work, we systematically investigate the impact of key components (i.e., speech tokenizers, speech heads, and speaker modeling) on the performance of LLM-centric SLMs. We… ▽ More

    Submitted 14 June, 2025; originally announced June 2025.

  3. arXiv:2505.22855  [pdf, ps, other

    eess.IV cs.CV

    IRS: Incremental Relationship-guided Segmentation for Digital Pathology

    Authors: Ruining Deng, Junchao Zhu, Juming Xiong, Can Cui, Tianyuan Yao, Junlin Guo, Siqi Lu, Marilyn Lionts, Mengmeng Yin, Yu Wang, Shilin Zhao, Yucheng Tang, Yihe Yang, Paul Dennis Simonson, Mert R. Sabuncu, Haichun Yang, Yuankai Huo

    Abstract: Continual learning is rapidly emerging as a key focus in computer vision, aiming to develop AI systems capable of continuous improvement, thereby enhancing their value and practicality in diverse real-world applications. In healthcare, continual learning holds great promise for continuously acquired digital pathology data, which is collected in hospitals on a daily basis. However, panoramic segmen… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

  4. arXiv:2505.19709  [pdf, ps, other

    cs.IT eess.SP

    Capacity-Optimized Pre-Equalizer Design for Visible Light Communication Systems

    Authors: Runxin Zhang, Yulin Shao, Jian Xiong, Lu Lu, Murat Uysal

    Abstract: Since commercial LEDs are primarily designed for illumination rather than data transmission, their modulation bandwidth is inherently limited to a few MHz. This becomes a major bottleneck in the implementation of visible light communication (VLC) systems necessiating the design of pre-equalizers. While state-of-the-art equalizer designs primarily focus on the data rate increasing through bandwidth… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  5. arXiv:2502.04199  [pdf, other

    eess.IV cs.CV

    Expanding Training Data for Endoscopic Phenotyping of Eosinophilic Esophagitis

    Authors: Juming Xiong, Hou Xiong, Quan Liu, Ruining Deng, Regina N Tyree, Girish Hiremath, Yuankai Huo

    Abstract: Eosinophilic esophagitis (EoE) is a chronic esophageal disorder marked by eosinophil-dominated inflammation. Diagnosing EoE usually involves endoscopic inspection of the esophageal mucosa and obtaining esophageal biopsies for histologic confirmation. Recent advances have seen AI-assisted endoscopic imaging, guided by the EREFS system, emerge as a potential alternative to reduce reliance on invasiv… ▽ More

    Submitted 6 February, 2025; originally announced February 2025.

  6. arXiv:2411.15942  [pdf, other

    eess.IV cs.CV

    Cross-organ Deployment of EOS Detection AI without Retraining: Feasibility and Limitation

    Authors: Yifei Wu, Juming Xiong, Tianyuan Yao, Ruining Deng, Junlin Guo, Jialin Yue, Naweed Chowdhury, Yuankai Huo

    Abstract: Chronic rhinosinusitis (CRS) is characterized by persistent inflammation in the paranasal sinuses, leading to typical symptoms of nasal congestion, facial pressure, olfactory dysfunction, and discolored nasal drainage, which can significantly impact quality-of-life. Eosinophils (Eos), a crucial component in the mucosal immune response, have been linked to disease severity in CRS. The diagnosis of… ▽ More

    Submitted 24 November, 2024; originally announced November 2024.

    Comments: 8 pages, 5 figures. Accepted by SPIE Medical Imaging 2025 on October 28, 2024

  7. arXiv:2411.13766  [pdf, ps, other

    cs.SD cs.AI eess.AS

    Tiny-Align: Bridging Automatic Speech Recognition and Large Language Model on the Edge

    Authors: Ruiyang Qin, Dancheng Liu, Gelei Xu, Zheyu Yan, Chenhui Xu, Yuting Hu, X. Sharon Hu, Jinjun Xiong, Yiyu Shi

    Abstract: The combination of Large Language Models (LLM) and Automatic Speech Recognition (ASR), when deployed on edge devices (called edge ASR-LLM), can serve as a powerful personalized assistant to enable audio-based interaction for users. Compared to text-based interaction, edge ASR-LLM allows accessible and natural audio interactions. Unfortunately, existing ASR-LLM models are mainly trained in high-per… ▽ More

    Submitted 9 July, 2025; v1 submitted 20 November, 2024; originally announced November 2024.

    Comments: Accepted by ICCAD'25

  8. arXiv:2411.00078  [pdf, other

    cs.CV cs.AI eess.IV

    How Good Are We? Evaluating Cell AI Foundation Models in Kidney Pathology with Human-in-the-Loop Enrichment

    Authors: Junlin Guo, Siqi Lu, Can Cui, Ruining Deng, Tianyuan Yao, Zhewen Tao, Yizhe Lin, Marilyn Lionts, Quan Liu, Juming Xiong, Yu Wang, Shilin Zhao, Catie Chang, Mitchell Wilkes, Mengmeng Yin, Haichun Yang, Yuankai Huo

    Abstract: Training AI foundation models has emerged as a promising large-scale learning approach for addressing real-world healthcare challenges, including digital pathology. While many of these models have been developed for tasks like disease diagnosis and tissue quantification using extensive and diverse training datasets, their readiness for deployment on some arguably simplest tasks, such as nuclei seg… ▽ More

    Submitted 31 October, 2024; originally announced November 2024.

  9. arXiv:2410.11865  [pdf, other

    eess.AS cs.CL q-bio.QM

    Automatic Screening for Children with Speech Disorder using Automatic Speech Recognition: Opportunities and Challenges

    Authors: Dancheng Liu, Jason Yang, Ishan Albrecht-Buehler, Helen Qin, Sophie Li, Yuting Hu, Amir Nassereldine, Jinjun Xiong

    Abstract: Speech is a fundamental aspect of human life, crucial not only for communication but also for cognitive, social, and academic development. Children with speech disorders (SD) face significant challenges that, if unaddressed, can result in lasting negative impacts. Traditionally, speech and language assessments (SLA) have been conducted by skilled speech-language pathologists (SLPs), but there is a… ▽ More

    Submitted 7 October, 2024; originally announced October 2024.

    Comments: AAAI-FSS 24

  10. arXiv:2409.16277  [pdf, other

    eess.IV cs.CV

    Compressed Depth Map Super-Resolution and Restoration: AIM 2024 Challenge Results

    Authors: Marcos V. Conde, Florin-Alexandru Vasluianu, Jinhui Xiong, Wei Ye, Rakesh Ranjan, Radu Timofte

    Abstract: The increasing demand for augmented reality (AR) and virtual reality (VR) applications highlights the need for efficient depth information processing. Depth maps, essential for rendering realistic scenes and supporting advanced functionalities, are typically large and challenging to stream efficiently due to their size. This challenge introduces a focus on developing innovative depth upsampling te… ▽ More

    Submitted 24 September, 2024; originally announced September 2024.

    Comments: ECCV 2024 - Advances in Image Manipulation (AIM)

  11. arXiv:2409.13117  [pdf, other

    eess.IV

    Breaking the Barriers of One-to-One Usage of Implicit Neural Representation in Image Compression: A Linear Combination Approach with Performance Guarantees

    Authors: Sai Sanjeet, Seyyedali Hosseinalipour, Jinjun Xiong, Masahiro Fujita, Bibhu Datta Sahoo

    Abstract: In an era where the exponential growth of image data driven by the Internet of Things (IoT) is outpacing traditional storage solutions, this work explores and advances the potential of Implicit Neural Representation (INR) as a transformative approach to image compression. INR leverages the function approximation capabilities of neural networks to represent various types of data. While previous res… ▽ More

    Submitted 19 September, 2024; originally announced September 2024.

    Comments: 10 pages, 13 figures

  12. arXiv:2408.06381  [pdf, other

    eess.IV cs.AI cs.CV

    Assessment of Cell Nuclei AI Foundation Models in Kidney Pathology

    Authors: Junlin Guo, Siqi Lu, Can Cui, Ruining Deng, Tianyuan Yao, Zhewen Tao, Yizhe Lin, Marilyn Lionts, Quan Liu, Juming Xiong, Yu Wang, Shilin Zhao, Catie Chang, Mitchell Wilkes, Mengmeng Yin, Haichun Yang, Yuankai Huo

    Abstract: Cell nuclei instance segmentation is a crucial task in digital kidney pathology. Traditional automatic segmentation methods often lack generalizability when applied to unseen datasets. Recently, the success of foundation models (FMs) has provided a more generalizable solution, potentially enabling the segmentation of any cell type. In this study, we perform a large-scale evaluation of three widely… ▽ More

    Submitted 6 February, 2025; v1 submitted 9 August, 2024; originally announced August 2024.

  13. arXiv:2407.06662  [pdf, other

    eess.SP

    Experimental Demonstration of 16D Voronoi Constellation with Two-Level Coding over 50km Four-Core Fiber

    Authors: Can Zhao, Bin Chen, Jiaqi Cai, Zhiwei Liang, Yi Lei, Junjie Xiong, Lin Ma, Daohui Hu, Lin Sun, Gangxiang Shen

    Abstract: A 16-dimensional Voronoi constellation concatenated with multilevel coding is experimentally demonstrated over a 50km four-core fiber transmission system. The proposed scheme reduces the required launch power by 6dB and provides a 17dB larger operating range than 16QAM with BICM at the outer HD-FEC BER threshold.

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 4 pages, 4 figures, accepted by 2024 European Conference on Optical Communication (ECOC)

  14. arXiv:2407.00596  [pdf, other

    eess.IV cs.CV

    HATs: Hierarchical Adaptive Taxonomy Segmentation for Panoramic Pathology Image Analysis

    Authors: Ruining Deng, Quan Liu, Can Cui, Tianyuan Yao, Juming Xiong, Shunxing Bao, Hao Li, Mengmeng Yin, Yu Wang, Shilin Zhao, Yucheng Tang, Haichun Yang, Yuankai Huo

    Abstract: Panoramic image segmentation in computational pathology presents a remarkable challenge due to the morphologically complex and variably scaled anatomy. For instance, the intricate organization in kidney pathology spans multiple layers, from regions like the cortex and medulla to functional units such as glomeruli, tubules, and vessels, down to various cell types. In this paper, we propose a novel… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: arXiv admin note: text overlap with arXiv:2402.19286

  15. arXiv:2406.17926  [pdf, other

    cs.CL cs.SD eess.AS

    FASA: a Flexible and Automatic Speech Aligner for Extracting High-quality Aligned Children Speech Data

    Authors: Dancheng Liu, Jinjun Xiong

    Abstract: Automatic Speech Recognition (ASR) for adults' speeches has made significant progress by employing deep neural network (DNN) models recently, but improvement in children's speech is still unsatisfactory due to children's speech's distinct characteristics. DNN models pre-trained on adult data often struggle in generalizing children's speeches with fine tuning because of the lack of high-quality ali… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: 4 pages, 1 figure

  16. arXiv:2406.15668  [pdf, other

    cs.CL cs.SD eess.AS

    PI-Whisper: Designing an Adaptive and Incremental Automatic Speech Recognition System for Edge Devices

    Authors: Amir Nassereldine, Dancheng Liu, Chenhui Xu, Ruiyang Qin, Yiyu Shi, Jinjun Xiong

    Abstract: Edge-based automatic speech recognition (ASR) technologies are increasingly prevalent in the development of intelligent and personalized assistants. However, resource-constrained ASR models face significant challenges in adaptivity, incrementality, and inclusivity when faced with a diverse population. To tackle those challenges, we propose PI-Whisper, a novel ASR system that adaptively enhances re… ▽ More

    Submitted 23 December, 2024; v1 submitted 21 June, 2024; originally announced June 2024.

    Comments: in submission

  17. arXiv:2403.04945  [pdf, ps, other

    cs.CL cs.LG eess.SP

    MEIT: Multimodal Electrocardiogram Instruction Tuning on Large Language Models for Report Generation

    Authors: Zhongwei Wan, Che Liu, Xin Wang, Chaofan Tao, Hui Shen, Jing Xiong, Rossella Arcucci, Huaxiu Yao, Mi Zhang

    Abstract: Electrocardiogram (ECG) is the primary non-invasive diagnostic tool for monitoring cardiac conditions and is crucial in assisting clinicians. Recent studies have concentrated on classifying cardiac conditions using ECG data but have overlooked ECG report generation, which is time-consuming and requires clinical expertise. To automate ECG report generation and ensure its versatility, we propose the… ▽ More

    Submitted 7 July, 2025; v1 submitted 7 March, 2024; originally announced March 2024.

    Comments: ACL 2025

  18. arXiv:2402.19286  [pdf, other

    eess.IV cs.CV

    PrPSeg: Universal Proposition Learning for Panoramic Renal Pathology Segmentation

    Authors: Ruining Deng, Quan Liu, Can Cui, Tianyuan Yao, Jialin Yue, Juming Xiong, Lining Yu, Yifei Wu, Mengmeng Yin, Yu Wang, Shilin Zhao, Yucheng Tang, Haichun Yang, Yuankai Huo

    Abstract: Understanding the anatomy of renal pathology is crucial for advancing disease diagnostics, treatment evaluation, and clinical research. The complex kidney system comprises various components across multiple levels, including regions (cortex, medulla), functional units (glomeruli, tubules), and cells (podocytes, mesangial cells in glomerulus). Prior studies have predominantly overlooked the intrica… ▽ More

    Submitted 20 March, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

    Comments: IEEE / CVF Computer Vision and Pattern Recognition Conference 2024

  19. arXiv:2311.08880  [pdf, other

    cs.RO eess.SY

    Motion Control of Two Mobile Robots under Allowable Collisions

    Authors: Li Tan, Wei Ren, Xi-Ming Sun, Junlin Xiong

    Abstract: This letter investigates the motion control problem of two mobile robots under allowable collisions. Here, the allowable collisions mean that the collisions do not damage the mobile robots. The occurrence of the collisions is discussed and the effects of the collisions on the mobile robots are analyzed to develop a hybrid model of each mobile robot under allowable collisions. Based on the effects… ▽ More

    Submitted 26 April, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: 8 pages, 5 figures

  20. arXiv:2308.08974  [pdf, other

    eess.IV cs.CV

    Eosinophils Instance Object Segmentation on Whole Slide Imaging Using Multi-label Circle Representation

    Authors: Yilin Liu, Ruining Deng, Juming Xiong, Regina N Tyree, Hernan Correa, Girish Hiremath, Yaohong Wang, Yuankai Huo

    Abstract: Eosinophilic esophagitis (EoE) is a chronic and relapsing disease characterized by esophageal inflammation. Symptoms of EoE include difficulty swallowing, food impaction, and chest pain which significantly impact the quality of life, resulting in nutritional impairments, social limitations, and psychological distress. The diagnosis of EoE is typically performed with a threshold (15 to 20) of eosin… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

  21. arXiv:2308.06333  [pdf, other

    eess.IV cs.CV

    Deep Learning-Based Open Source Toolkit for Eosinophil Detection in Pediatric Eosinophilic Esophagitis

    Authors: Juming Xiong, Yilin Liu, Ruining Deng, Regina N Tyree, Hernan Correa, Girish Hiremath, Yaohong Wang, Yuankai Huo

    Abstract: Eosinophilic Esophagitis (EoE) is a chronic, immune/antigen-mediated esophageal disease, characterized by symptoms related to esophageal dysfunction and histological evidence of eosinophil-dominant inflammation. Owing to the intricate microscopic representation of EoE in imaging, current methodologies which depend on manual identification are not only labor-intensive but also prone to inaccuracies… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

  22. arXiv:2307.14778  [pdf, other

    cs.LG eess.SP

    MATNilm: Multi-appliance-task Non-intrusive Load Monitoring with Limited Labeled Data

    Authors: Jing Xiong, Tianqi Hong, Dongbo Zhao, Yu Zhang

    Abstract: Non-intrusive load monitoring (NILM) identifies the status and power consumption of various household appliances by disaggregating the total power usage signal of an entire house. Efficient and accurate load monitoring facilitates user profile establishment, intelligent household energy management, and peak load shifting. This is beneficial for both the end-users and utilities by improving the ove… ▽ More

    Submitted 29 July, 2023; v1 submitted 27 July, 2023; originally announced July 2023.

  23. arXiv:2307.14076  [pdf, other

    eess.SP

    A Phase-Coded Time-Domain Interleaved OTFS Waveform with Improved Ambiguity Function

    Authors: Jiajun Zhu, Yanqun Tang, Chao Yang, Chi Zhang, Haoran Yin, Jiaojiao Xiong, Yuhua Chen

    Abstract: Integrated sensing and communication (ISAC) is a significant application scenario in future wireless communication networks, and sensing capability of a waveform is always evaluated by the ambiguity function. To enhance the sensing performance of the orthogonal time frequency space (OTFS) waveform, we propose a novel time-domain interleaved cyclic-shifted P4-coded OTFS (TICP4-OTFS) with improved a… ▽ More

    Submitted 23 September, 2023; v1 submitted 26 July, 2023; originally announced July 2023.

    Comments: This paper has been accepted by 2023 IEEE Globecom Workshops (GC Wkshps): Workshop on Integrated Sensing and Communications for Internet of Things

  24. arXiv:2307.09279  [pdf, other

    cs.CV eess.IV

    Regression-free Blind Image Quality Assessment with Content-Distortion Consistency

    Authors: Xiaoqi Wang, Jian Xiong, Hao Gao, Weisi Lin

    Abstract: The optimization objective of regression-based blind image quality assessment (IQA) models is to minimize the mean prediction error across the training dataset, which can lead to biased parameter estimation due to potential training data biases. To mitigate this issue, we propose a regression-free framework for image quality evaluation, which is based upon retrieving locally similar instances by i… ▽ More

    Submitted 21 October, 2023; v1 submitted 18 July, 2023; originally announced July 2023.

  25. arXiv:2306.02306  [pdf, other

    cs.CV cs.LG eess.IV

    Cross-CBAM: A Lightweight network for Scene Segmentation

    Authors: Zhengbin Zhang, Zhenhao Xu, Xingsheng Gu, Juan Xiong

    Abstract: Scene parsing is a great challenge for real-time semantic segmentation. Although traditional semantic segmentation networks have made remarkable leap-forwards in semantic accuracy, the performance of inference speed is unsatisfactory. Meanwhile, this progress is achieved with fairly large networks and powerful computational resources. However, it is difficult to run extremely large models on edge… ▽ More

    Submitted 4 June, 2023; originally announced June 2023.

  26. arXiv:2305.08465  [pdf, other

    eess.SP

    An Overview of Resource Allocation in Integrated Sensing and Communication

    Authors: Jinming Du, Yanqun Tang, Xizhang Wei, Jiaojiao Xiong, Jiajun Zhu, Haoran Yin, Chi Zhang, Haibo Chen

    Abstract: Integrated sensing and communication (ISAC) is considered as a promising solution for improving spectrum efficiency and relieving wireless spectrum congestion. This paper systematically introduces the evolutionary path of ISAC technologies, then sorts out and summarizes the current research status of ISAC resource allocation. From the perspective of different integrated levels of ISAC, we introduc… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

    Comments: 6 pages,4 figures,conference

  27. A Unifying Framework of Attention-based Neural Load Forecasting

    Authors: Jing Xiong, Yu Zhang

    Abstract: Accurate load forecasting is critical for reliable and efficient planning and operation of electric power grids. In this paper, we propose a unifying deep learning framework for load forecasting, which includes time-varying feature weighting, hierarchical temporal attention, and feature-reinforced error correction. Our framework adopts a modular design with good generalization capability. First, t… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

  28. arXiv:2302.14224  [pdf, other

    eess.SP cs.NI

    Overview and Performance Analysis of Various Waveforms in High Mobility Scenarios

    Authors: Yu Zhou, Haoran Yin, Jiaojiao Xiong, Shiyu Song, Jiajun Zhu, Jinming Du, Haibo Chen, Yanqun Tang

    Abstract: In the high-mobility scenarios of next-generation wireless communication systems (beyond 5G/6G), the performance of orthogonal frequency division multiplexing (OFDM) deteriorates drastically due to the loss of orthogonality between the subcarriers caused by large Doppler frequency shifts. Various emerging waveforms have been proposed for fast time-varying channels with excellent results. In this p… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

  29. arXiv:2302.11179   

    eess.SP

    Cyclic Delay-Doppler Shift: A Simple Transmit Diversity Technique for Delay-Doppler Waveforms in Doubly Selective Channels

    Authors: Haoran Yin, Jiaojiao Xiong, Yu Zhou, Chi Zhang, Di Zhang, Xizhang Wei, Yanqun Tang

    Abstract: Delay-Doppler waveform design has been considered as a promising solution to achieve reliable communication under high-mobility channels for the space-air-ground-integrated networks (SAGIN). In this paper, we introduce the cyclic delay-Doppler shift (CDDS) technique for delay-Doppler waveforms to extract transmit diversity in doubly selective channels. Two simple CDDS schemes, named time-domain CD… ▽ More

    Submitted 14 January, 2025; v1 submitted 22 February, 2023; originally announced February 2023.

    Comments: We are requesting the withdrawal of this paper due to critical issues identified in the document. Specifically, in Section III of the paper, the expression in Equation (7) is ambiguous and leads to inconsistencies in the subsequent derivations and conclusions. As a result, this could potentially confuse readers and misguide further research. Significant changes are made to the documents

  30. arXiv:2211.08658  [pdf, other

    eess.IV cs.CV

    Consistent Direct Time-of-Flight Video Depth Super-Resolution

    Authors: Zhanghao Sun, Wei Ye, Jinhui Xiong, Gyeongmin Choe, Jialiang Wang, Shuochen Su, Rakesh Ranjan

    Abstract: Direct time-of-flight (dToF) sensors are promising for next-generation on-device 3D sensing. However, limited by manufacturing capabilities in a compact module, the dToF data has a low spatial resolution (e.g., $\sim 20\times30$ for iPhone dToF), and it requires a super-resolution step before being passed to downstream tasks. In this paper, we solve this super-resolution problem by fusing the low-… ▽ More

    Submitted 3 May, 2023; v1 submitted 15 November, 2022; originally announced November 2022.

  31. arXiv:2211.03577  [pdf

    physics.optics eess.SP physics.app-ph

    Regrowth-free AlGaInAs MQW polarization controller integrated with sidewall grating DFB laser

    Authors: Xiao Sun, Song Liang, Weiqing Cheng, Shengwei Ye, Yiming Sun, Yongguang Huang, Ruikang Zhang, Jichuan Xiong, Xuefeng Liu, John H. Marsh, Lianping Hou

    Abstract: We report an AlGaInAs multiple quantum well integrated source of polarization controlled light consisting of a polarization mode converter PMC, differential phase shifter(DPS), and a side wall grating distributed-feedback DFB laser. We demonstrate an asymmetrical stepped-height ridge waveguide PMC to realize TE to TM polarization conversion and a symmetrical straight waveguide DPS to enable polari… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: arXiv admin note: text overlap with arXiv:2210.10519

  32. arXiv:2208.07655  [pdf, other

    eess.IV cs.CV cs.LG physics.med-ph

    A Hybrid Deep Feature-Based Deformable Image Registration Method for Pathology Images

    Authors: Chulong Zhang, Yuming Jiang, Na Li, Zhicheng Zhang, Md Tauhidul Islam, Jingjing Dai, Lin Liu, Wenfeng He, Wenjian Qin, Jing Xiong, Yaoqin Xie, Xiaokun Liang

    Abstract: Pathologists need to combine information from differently stained pathology slices for accurate diagnosis. Deformable image registration is a necessary technique for fusing multi-modal pathology slices. This paper proposes a hybrid deep feature-based deformable image registration framework for stained pathology samples. We first extract dense feature points via the detector-based and detector-free… ▽ More

    Submitted 10 April, 2023; v1 submitted 16 August, 2022; originally announced August 2022.

    Comments: 22 pages, 12 figures. This work has been submitted to the IEEE for possible publication

  33. arXiv:2205.02939  [pdf

    eess.SY

    Modelling Pre-fatigue, Low-velocity Impact and Fatigue behaviours of Composite Helicopter Tail Structures under Multipoint Coordinated Loading Spectrum

    Authors: Zheng-Qiang Cheng, Wei Tan, Jun-Jiang Xiong

    Abstract: This paper aims to numerically study the pre-fatigue, low-velocity impact (LVI) and fatigue progressive damage behaviours of a full-scale composite helicopter tail structure under multipoint coordinated loading spectrum. First, a fatigue progressive damage model (PDM) incorporating multiaxial fatigue residual strength degradation rule, fatigue failure criteria based on fatigue residual strength co… ▽ More

    Submitted 5 May, 2022; originally announced May 2022.

    Comments: 43 pages, 16 figures

  34. arXiv:2203.02655  [pdf, other

    cs.SD cs.CV cs.LG eess.AS

    Audio-visual speech separation based on joint feature representation with cross-modal attention

    Authors: Junwen Xiong, Peng Zhang, Lei Xie, Wei Huang, Yufei Zha, Yanning Zhang

    Abstract: Multi-modal based speech separation has exhibited a specific advantage on isolating the target character in multi-talker noisy environments. Unfortunately, most of current separation strategies prefer a straightforward fusion based on feature learning of each single modality, which is far from sufficient consideration of inter-relationships between modalites. Inspired by learning joint feature rep… ▽ More

    Submitted 4 March, 2022; originally announced March 2022.

    Comments: 5 pages, 3 figures

  35. arXiv:2203.02216  [pdf, other

    cs.SD cs.AI cs.MM eess.AS

    Look\&Listen: Multi-Modal Correlation Learning for Active Speaker Detection and Speech Enhancement

    Authors: Junwen Xiong, Yu Zhou, Peng Zhang, Lei Xie, Wei Huang, Yufei Zha

    Abstract: Active speaker detection and speech enhancement have become two increasingly attractive topics in audio-visual scenario understanding. According to their respective characteristics, the scheme of independently designed architecture has been widely used in correspondence to each single task. This may lead to the representation learned by the model being task-specific, and inevitably result in the l… ▽ More

    Submitted 7 July, 2022; v1 submitted 4 March, 2022; originally announced March 2022.

    Comments: 13 pages, 8figures

  36. arXiv:2201.12862  [pdf, ps, other

    eess.SY

    Lyapunov Conditions for Input-to-State Stability of Hybrid Systems with Memory

    Authors: Wei Ren, Junlin Xiong

    Abstract: This paper studies input-to-state stability for hybrid systems with memory, which models hybrid dynamics affected by time delays. Using both Lyapunov-Razumikhin functions and Lyapunov-Krasovskii functionals, Lyapunov-based sufficient conditions are established for input-to-state stability. In addition, further extensions and relaxations are proposed for special cases, such as the stable flow/jump… ▽ More

    Submitted 30 January, 2022; originally announced January 2022.

    Comments: 8 pages, IEEE Transactions on Automatic Control

  37. arXiv:2201.08514  [pdf, other

    cs.LG eess.SP

    How does unlabeled data improve generalization in self-training? A one-hidden-layer theoretical analysis

    Authors: Shuai Zhang, Meng Wang, Sijia Liu, Pin-Yu Chen, Jinjun Xiong

    Abstract: Self-training, a semi-supervised learning algorithm, leverages a large amount of unlabeled data to improve learning when the labeled data are limited. Despite empirical successes, its theoretical characterization remains elusive. To the best of our knowledge, this work establishes the first theoretical analysis for the known iterative self-training paradigm and proves the benefits of unlabeled dat… ▽ More

    Submitted 14 February, 2022; v1 submitted 20 January, 2022; originally announced January 2022.

    Comments: 36 pages

    Journal ref: Tenth International Conference on Learning Representations 2022

  38. RF-Based Human Activity Recognition Using Signal Adapted Convolutional Neural Network

    Authors: Zhe Chen, Chao Cai, Tianyue Zheng, Jun Luo, Jie Xiong, Xin Wang

    Abstract: Human Activity Recognition (HAR) plays a critical role in a wide range of real-world applications, and it is traditionally achieved via wearable sensing. Recently, to avoid the burden and discomfort caused by wearable devices, device-free approaches exploiting RF signals arise as a promising alternative for HAR. Most of the latest device-free approaches require training a large deep neural network… ▽ More

    Submitted 27 October, 2021; v1 submitted 27 October, 2021; originally announced October 2021.

    Comments: 13 pages

    Journal ref: IEEE Transactions on Mobile Computing, 19 April 2021

  39. arXiv:2110.09123  [pdf, other

    cs.IT eess.SP

    Joint Spatial Division and Coaxial Multiplexing for Downlink Multi-User OAM Wireless Backhaul

    Authors: Wen-Xuan Long, Rui Chen, Marco Moretti, Jian Xiong, Jiandong Li

    Abstract: Orbital angular momentum (OAM) at radio frequency (RF) provides a novel approach of multiplexing a set of orthogonal modes on the same frequency channel to achieve high spectral efficiencies (SEs). However, the existing research on OAM wireless communications is mainly focused on pointto-point transmission in the line-of-sight (LoS) scenario. In this paper, we propose an overall scheme of the down… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

  40. arXiv:2110.07378  [pdf, other

    eess.SY math.OC

    Scheduler-Pointed False Data Injection Attack for Event-Based Remote State Estimation

    Authors: Qiulin Xu, Junlin Xiong

    Abstract: In this paper, an attack problem is investigated for event-based remote state estimation in cyber-physical systems. Our objective is to degrade the effect of the event-based scheduler while bypassing a $χ^2$ false data detector. A two-channel scheduler-pointed false data injection attack strategy is proposed by modifying the numerical characteristics of innovation signals. The attack strategy is p… ▽ More

    Submitted 14 October, 2021; originally announced October 2021.

    Comments: 10 pages, 5 figures

  41. Attention-based Neural Load Forecasting: A Dynamic Feature Selection Approach

    Authors: Jing Xiong, Pengyang Zhou, Alan Chen, Yu Zhang

    Abstract: Encoder-decoder-based recurrent neural network (RNN) has made significant progress in sequence-to-sequence learning tasks such as machine translation and conversational models. Recent works have shown the advantage of this type of network in dealing with various time series forecasting tasks. The present paper focuses on the problem of multi-horizon short-term load forecasting, which plays a key r… ▽ More

    Submitted 24 August, 2021; originally announced August 2021.

  42. arXiv:2108.10122  [pdf, other

    physics.optics eess.IV

    Ghost Panorama

    Authors: Zhiyuan Ye, Hai-Bo Wang, Jun Xiong, Kaige Wang

    Abstract: Computational ghost imaging or single-pixel imaging enables the image formation of an unknown scene using a lens-free photodetector. In this Letter, we present a computational panoramic ghost imaging system that can achieve the full-color panorama using a single-pixel photodetector, where a convex mirror performs the optical transformation of the engineered Hadamard-based circular illumination pat… ▽ More

    Submitted 13 August, 2021; originally announced August 2021.

    Comments: 5 pages, 4figures

  43. arXiv:2106.15283  [pdf, other

    cs.CV cs.LG eess.SP

    Similarity Embedding Networks for Robust Human Activity Recognition

    Authors: Chenglin Li, Carrie Lu Tong, Di Niu, Bei Jiang, Xiao Zuo, Lei Cheng, Jian Xiong, Jianming Yang

    Abstract: Deep learning models for human activity recognition (HAR) based on sensor data have been heavily studied recently. However, the generalization ability of deep models on complex real-world HAR data is limited by the availability of high-quality labeled activity data, which are hard to obtain. In this paper, we design a similarity embedding neural network that maps input sensor signals onto real vec… ▽ More

    Submitted 31 May, 2021; originally announced June 2021.

  44. arXiv:2106.08519  [pdf, other

    eess.AS cs.LG cs.SD

    Global Rhythm Style Transfer Without Text Transcriptions

    Authors: Kaizhi Qian, Yang Zhang, Shiyu Chang, Jinjun Xiong, Chuang Gan, David Cox, Mark Hasegawa-Johnson

    Abstract: Prosody plays an important role in characterizing the style of a speaker or an emotion, but most non-parallel voice or emotion style transfer algorithms do not convert any prosody information. Two major components of prosody are pitch and rhythm. Disentangling the prosody information, particularly the rhythm component, from the speech is challenging because it involves breaking the synchrony betwe… ▽ More

    Submitted 15 June, 2021; originally announced June 2021.

  45. Energy-Efficient Precoding in Electromagnetic Exposure-Constrained Uplink Multiuser MIMO

    Authors: Jiayuan Xiong, Li You, Derrick Wing Kwan Ng, Wenjin Wang, Xiqi Gao

    Abstract: User electromagnetic (EM) exposure is continuously being exacerbated by the evolution of multi-antenna portable devices. To mitigate the effects of EM radiation, portable devices must satisfy tight regulations on user exposure level, generally measured by specific absorption rate (SAR). To this end, we investigate the SAR-aware uplink precoder design for the energy efficiency (EE) maximization in… ▽ More

    Submitted 31 May, 2021; originally announced May 2021.

    Comments: We investigate the SAR-aware uplink precoder design for the EE maximization in multiuser MIMO transmission exploiting statistical CSI

    Journal ref: IEEE Transactions on Vehicular Technology, vol. 70, no. 7, pp. 7226-7231, Jul. 2021

  46. arXiv:2105.10299  [pdf, other

    eess.SY eess.SP

    Optimal Estimator Design and Properties Analysis for Interconnected Systems with Asymmetric Information Structure

    Authors: Yan Wang, Junlin Xiong, Zaiyue Yang, Rong Su

    Abstract: This paper studies the optimal state estimation problem for interconnected systems. Each subsystem can obtain its own measurement in real time, while, the measurements transmitted between the subsystems suffer from random delay. The optimal estimator is analytically designed for minimizing the conditional error covariance. The boundedness of the expected error covariance (EEC) is analyzed. In part… ▽ More

    Submitted 2 May, 2023; v1 submitted 21 May, 2021; originally announced May 2021.

  47. arXiv:2104.05463  [pdf, other

    cs.LG eess.SP

    Scalable Power Control/Beamforming in Heterogeneous Wireless Networks with Graph Neural Networks

    Authors: Xiaochen Zhang, Haitao Zhao, Jun Xiong, Li Zhou, Jibo Wei

    Abstract: Machine learning (ML) has been widely used for efficient resource allocation (RA) in wireless networks. Although superb performance is achieved on small and simple networks, most existing ML-based approaches are confronted with difficulties when heterogeneity occurs and network size expands. In this paper, specifically focusing on power control/beamforming (PC/BF) in heterogeneous device-to-device… ▽ More

    Submitted 8 December, 2021; v1 submitted 12 April, 2021; originally announced April 2021.

    Comments: 6 pages, 6 figures, accepted by IEEE GLOBECOM 2021

  48. arXiv:2103.16051  [pdf, ps, other

    eess.SY

    Reduced Dynamics and Control for an Autonomous Bicycle

    Authors: Jiaming Xiong, Bo Li, Ruihan Yu, Daolin Ma, Wei Wang, Caishan Liu

    Abstract: In this paper, we propose the reduced model for the full dynamics of a bicycle and analyze its nonlinear behavior under a proportional control law for steering. Based on the Gibbs-Appell equations for the Whipple bicycle, we obtain a second-order nonlinear ordinary differential equation (ODE) that governs the bicycle's controlled motion. Two types of equilibrium points for the governing equation a… ▽ More

    Submitted 29 March, 2021; originally announced March 2021.

    Journal ref: ICRA 2021

  49. arXiv:2103.06549  [pdf, other

    eess.IV

    Advanced Geometry Surface Coding for Dynamic Point Cloud Compression

    Authors: Jian Xiong, Hao Gao, Miaohui Wang, Hongliang Li, King Ngi Ngan, Weisi Lin

    Abstract: In video-based dynamic point cloud compression (V-PCC), 3D point clouds are projected onto 2D images for compressing with the existing video codecs. However, the existing video codecs are originally designed for natural visual signals, and it fails to account for the characteristics of point clouds. Thus, there are still problems in the compression of geometry information generated from the point… ▽ More

    Submitted 11 March, 2021; originally announced March 2021.

  50. arXiv:2103.02894  [pdf, ps, other

    eess.SY

    Stability and $\mathcal{H}_{\infty}$ Performance Analysis of Stochastic Linear Networked and Quantized Control Systems

    Authors: Wei Ren, Junlin Xiong

    Abstract: This paper studies the stability and $\mathcal{H}_{\infty}$ performance analysis problem for linear networked and quantized control systems with both communication delays random packet losses. To deal with the network-induced uncertainties and random packet dropouts, a novel discrete-time stochastic system model is developed for continuous-time networked control systems, and further overapproximat… ▽ More

    Submitted 4 March, 2021; originally announced March 2021.

    Comments: 8 pages, 2 figures, extended version of the ACC paper