Skip to main content

Showing 1–32 of 32 results for author: Gong, J

Searching in archive eess. Search in all archives.
.
  1. arXiv:2509.12714  [pdf, ps, other

    cs.RO eess.SP

    MoiréTac: A Dual-Mode Visuotactile Sensor for Multidimensional Perception Using Moiré Pattern Amplification

    Authors: Kit-Wa Sou, Junhao Gong, Shoujie Li, Chuqiao Lyu, Ziwu Song, Shilong Mu, Wenbo Ding

    Abstract: Visuotactile sensors typically employ sparse marker arrays that limit spatial resolution and lack clear analytical force-to-image relationships. To solve this problem, we present \textbf{MoiréTac}, a dual-mode sensor that generates dense interference patterns via overlapping micro-gratings within a transparent architecture. When two gratings overlap with misalignment, they create moiré patterns th… ▽ More

    Submitted 16 September, 2025; originally announced September 2025.

  2. arXiv:2507.20489  [pdf, ps, other

    eess.SP

    Energy-Efficient Secure Communications via Joint Optimization of UAV Trajectory and Movable-Antenna Array Beamforming

    Authors: Sanghyeok Kim, Jinu Gong, Joonhyuk Kang

    Abstract: This paper investigates the potential of unmanned aerial vehicles (UAVs) equipped with movable-antenna (MA) arrays to strengthen security in wireless communication systems. We propose a novel framework that jointly optimizes the UAV trajectory and the reconfigurable beamforming of the MA array to maximize secrecy energy efficiency, while ensuring reliable communication with legitimate users. By ex… ▽ More

    Submitted 27 July, 2025; originally announced July 2025.

    Comments: 5 pages, 2 figures

  3. arXiv:2507.19822  [pdf, ps, other

    cs.LG eess.IV eess.SP

    Debunking Optimization Myths in Federated Learning for Medical Image Classification

    Authors: Youngjoon Lee, Hyukjoon Lee, Jinu Gong, Yang Cao, Joonhyuk Kang

    Abstract: Federated Learning (FL) is a collaborative learning method that enables decentralized model training while preserving data privacy. Despite its promise in medical imaging, recent FL methods are often sensitive to local factors such as optimizers and learning rates, limiting their robustness in practical deployments. In this work, we revisit vanilla FL to clarify the impact of edge device configura… ▽ More

    Submitted 26 July, 2025; originally announced July 2025.

    Comments: Accepted to Efficient Medical AI Workshop - MICCAI 2025

  4. arXiv:2506.08967  [pdf, ps, other

    cs.SD cs.CL eess.AS

    Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language Model

    Authors: Ailin Huang, Bingxin Li, Bruce Wang, Boyong Wu, Chao Yan, Chengli Feng, Heng Wang, Hongyu Zhou, Hongyuan Wang, Jingbei Li, Jianjian Sun, Joanna Wang, Mingrui Chen, Peng Liu, Ruihang Miao, Shilei Jiang, Tian Fei, Wang You, Xi Chen, Xuerui Yang, Yechang Huang, Yuxiang Zhang, Zheng Ge, Zheng Gong, Zhewei Huang , et al. (51 additional authors not shown)

    Abstract: Large Audio-Language Models (LALMs) have significantly advanced intelligent human-computer interaction, yet their reliance on text-based outputs limits their ability to generate natural speech responses directly, hindering seamless audio interactions. To address this, we introduce Step-Audio-AQAA, a fully end-to-end LALM designed for Audio Query-Audio Answer (AQAA) tasks. The model integrates a du… ▽ More

    Submitted 13 June, 2025; v1 submitted 10 June, 2025; originally announced June 2025.

    Comments: 12 pages, 3 figures

  5. arXiv:2506.00045  [pdf, other

    cs.SD eess.AS

    ACE-Step: A Step Towards Music Generation Foundation Model

    Authors: Junmin Gong, Sean Zhao, Sen Wang, Shengyuan Xu, Joe Guo

    Abstract: We introduce ACE-Step, a novel open-source foundation model for music generation that overcomes key limitations of existing approaches and achieves state-of-the-art performance through a holistic architectural design. Current methods face inherent trade-offs between generation speed, musical coherence, and controllability. For example, LLM-based models (e.g. Yue, SongGen) excel at lyric alignment… ▽ More

    Submitted 28 May, 2025; originally announced June 2025.

    Comments: 14 pages, 5 figures, ace-step's tech report

  6. arXiv:2505.05798  [pdf, ps, other

    cs.LG cs.CV eess.IV eess.SP

    Improving Generalizability of Kolmogorov-Arnold Networks via Error-Correcting Output Codes

    Authors: Youngjoon Lee, Jinu Gong, Joonhyuk Kang

    Abstract: Kolmogorov-Arnold Networks (KAN) offer universal function approximation using univariate spline compositions without nonlinear activations. In this work, we integrate Error-Correcting Output Codes (ECOC) into the KAN framework to transform multi-class classification into multiple binary tasks, improving robustness via Hamming distance decoding. Our proposed KAN with ECOC framework outperforms vani… ▽ More

    Submitted 17 September, 2025; v1 submitted 9 May, 2025; originally announced May 2025.

    Comments: Accepted to IEEE BioCAS 2025

  7. arXiv:2504.19639  [pdf, ps, other

    cs.LG eess.SP

    A Unified Benchmark of Federated Learning with Kolmogorov-Arnold Networks for Medical Imaging

    Authors: Youngjoon Lee, Jinu Gong, Joonhyuk Kang

    Abstract: Federated Learning (FL) enables model training across decentralized devices without sharing raw data, thereby preserving privacy in sensitive domains like healthcare. In this paper, we evaluate Kolmogorov-Arnold Networks (KAN) architectures against traditional MLP across six state-of-the-art FL algorithms on a blood cell classification dataset. Notably, our experiments demonstrate that KAN can eff… ▽ More

    Submitted 17 September, 2025; v1 submitted 28 April, 2025; originally announced April 2025.

    Comments: Accepted to AI/ML for Edge/Fog Networks Workshop - IEEE GLOBECOM 2025

  8. arXiv:2502.11946  [pdf, other

    cs.CL cs.AI cs.HC cs.SD eess.AS

    Step-Audio: Unified Understanding and Generation in Intelligent Speech Interaction

    Authors: Ailin Huang, Boyong Wu, Bruce Wang, Chao Yan, Chen Hu, Chengli Feng, Fei Tian, Feiyu Shen, Jingbei Li, Mingrui Chen, Peng Liu, Ruihang Miao, Wang You, Xi Chen, Xuerui Yang, Yechang Huang, Yuxiang Zhang, Zheng Gong, Zixin Zhang, Hongyu Zhou, Jianjian Sun, Brian Li, Chengting Feng, Changyi Wan, Hanpeng Hu , et al. (120 additional authors not shown)

    Abstract: Real-time speech interaction, serving as a fundamental interface for human-machine collaboration, holds immense potential. However, current open-source models face limitations such as high costs in voice data collection, weakness in dynamic control, and limited intelligence. To address these challenges, this paper introduces Step-Audio, the first production-ready open-source solution. Key contribu… ▽ More

    Submitted 18 February, 2025; v1 submitted 17 February, 2025; originally announced February 2025.

  9. arXiv:2501.15206  [pdf, other

    physics.app-ph cond-mat.dis-nn eess.SY

    Engineering-Oriented Design of Drift-Resilient MTJ Random Number Generator via Hybrid Control Strategies

    Authors: Ran Zhang, Caihua Wan, Yingqian Xu, Xiaohan Li, Raik Hoffmann, Meike Hindenberg, Shiqiang Liu, Dehao Kong, Shilong Xiong, Shikun He, Alptekin Vardar, Qiang Dai, Junlu Gong, Yihui Sun, Zejie Zheng, Thomas Kämpfe, Guoqiang Yu, Xiufeng Han

    Abstract: Magnetic Tunnel Junctions (MTJs) have shown great promise as hardware sources for true random number generation (TRNG) due to their intrinsic stochastic switching behavior. However, practical deployment remains challenged by drift in switching probability caused by thermal fluctuations, device aging, and environmental instability. This work presents an engineering-oriented, drift-resilient MTJ-bas… ▽ More

    Submitted 19 April, 2025; v1 submitted 25 January, 2025; originally announced January 2025.

    Comments: 16 pages, 9 figures, data shared at https://doi.org/10.6084/m9.figshare.28680899.v1

  10. arXiv:2410.23824  [pdf, other

    cs.LG eess.SP

    Generative AI-Powered Plugin for Robust Federated Learning in Heterogeneous IoT Networks

    Authors: Youngjoon Lee, Jinu Gong, Joonhyuk Kang

    Abstract: Federated learning enables edge devices to collaboratively train a global model while maintaining data privacy by keeping data localized. However, the Non-IID nature of data distribution across devices often hinders model convergence and reduces performance. In this paper, we propose a novel plugin for federated optimization techniques that approximates Non-IID data distributions to IID through ge… ▽ More

    Submitted 25 April, 2025; v1 submitted 31 October, 2024; originally announced October 2024.

    Comments: 5 pages

  11. arXiv:2406.04985  [pdf, ps, other

    eess.SP cs.ET

    RSMA Assisted ISAC With Hybrid Beamforming

    Authors: Zhuohui Yao, Wenchi Cheng, Liping Liang, Tao Zhang, Jun Gong

    Abstract: The harsh environment and scarce resources post-disaster drive the equipment to be miniaturized and portable. Based on this, integrated sensing and communication (ISAC) systems play a significant role in providing emergency wireless networks. In order to reduce the hardware cost, a hybrid beamforming (HBF) assisted millimeter-wave (mmWave) ISAC system, which exploits the limited number of radio fr… ▽ More

    Submitted 1 September, 2025; v1 submitted 7 June, 2024; originally announced June 2024.

    Comments: Conference

  12. arXiv:2403.01153  [pdf, other

    eess.SP

    Transfer Learning-Enhanced Instantaneous Multi-Person Indoor Localization by CSI

    Authors: Zhiyuan He, Ke Deng, Jiangchao Gong, Yi Zhou, Desheng Wang

    Abstract: Passive indoor localization, integral to smart buildings, emergency response, and indoor navigation, has traditionally been limited by a focus on single-target localization and reliance on multi-packet CSI. We introduce a novel Multi-target loss, notably enhancing multi-person localization. Utilizing this loss function, our instantaneous CSI-ResNet achieves an impressive 99.21% accuracy at 0.6m pr… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

  13. arXiv:2402.09488  [pdf, other

    eess.SY cs.LG

    Intelligent Agricultural Greenhouse Control System Based on Internet of Things and Machine Learning

    Authors: Cangqing Wang, Jiangchuan Gong

    Abstract: This study endeavors to conceptualize and execute a sophisticated agricultural greenhouse control system grounded in the amalgamation of the Internet of Things (IoT) and machine learning. Through meticulous monitoring of intrinsic environmental parameters within the greenhouse and the integration of machine learning algorithms, the conditions within the greenhouse are aptly modulated. The envisage… ▽ More

    Submitted 20 March, 2025; v1 submitted 14 February, 2024; originally announced February 2024.

  14. arXiv:2312.16419  [pdf

    eess.SP

    Radar detection of wake vortex behind the aircraft: the detection range problem

    Authors: Jiangkun Gong, Jun Yan, Deyong Kong, Deren Li

    Abstract: In this study, we showcased the detection of the wake vortex produced by a medium aircraft at distances exceeding 10 km using an X-band pulse-Doppler radar. We analyzed radar signals within the range profiles behind a Boeing 737 aircraft on February 7, 2021, within the airspace of the Runway Protection Zone (RPZ) at Tianhe Airport, Wuhan, China. The findings revealed that the wake vortex extended… ▽ More

    Submitted 27 December, 2023; originally announced December 2023.

  15. arXiv:2310.14769  [pdf

    eess.SP

    An introduction to radar Automatic Target Recognition (ATR) technology in ground-based radar systems

    Authors: Jiangkun Gong, Jun Yan, Deyong Kong, Deren Li

    Abstract: This paper presents a brief examination of Automatic Target Recognition (ATR) technology within ground-based radar systems. It offers a lucid comprehension of the ATR concept, delves into its historical milestones, and categorizes ATR methods according to different scattering regions. By incorporating ATR solutions into radar systems, this study demonstrates the expansion of radar detection ranges… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

  16. arXiv:2309.15415  [pdf

    eess.SP

    Formation Wing-Beat Modulation (FWM): A Tool for Quantifying Bird Flocks Using Radar Micro-Doppler Signals

    Authors: Jiangkun Gong, Jun Yan, Deyong Kong, Ruizhi Chen, Deren Li

    Abstract: Radar echoes from bird flocks contain modulation signals, which we find are produced by the flapping gaits of birds in the flock, resulting in a group of spectral peaks with similar amplitudes spaced at a specific interval. We call this the formation wing-beat modulation (FWM) effect. FWM signals are micro-Doppler modulated by flapping wings and are related to the bird number, wing-beat frequency,… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

  17. arXiv:2307.15874  [pdf, ps, other

    eess.SY math.OC

    Resilient Controller Synthesis Against DoS Attacks for Vehicular Platooning in Spatial Domain

    Authors: Jian Gong, Carlos Murguia, Anggera Bayuwindra, Jinde Cao

    Abstract: This paper proposes a vehicular platoon control approach under Denial-of-Service (DoS) attacks and external disturbances. DoS attacks increase the service time on the communication network and cause additional transmission delays, which consequently increase the risk of rear-end collisions of vehicles in the platoon. To counter DoS attacks, we propose a resilient control scheme that exploits polyt… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

  18. arXiv:2307.10326  [pdf

    eess.SP eess.SY

    Introduction to Drone Detection Radar with Emphasis on Automatic Target Recognition (ATR) technology

    Authors: Jiangkun Gong, Jun Yan, Deyong Kong, Deren Li

    Abstract: This paper discusses the challenges of detecting and categorizing small drones with radar automatic target recognition (ATR) technology. The authors suggest integrating ATR capabilities into drone detection radar systems to improve performance and manage emerging threats. The study focuses primarily on drones in Group 1 and 2. The paper highlights the need to consider kinetic features and signal s… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

    Comments: 17 pages, 14 figures, submitted to a journal and being under review

  19. arXiv:2307.02836  [pdf, other

    cs.CV eess.IV

    Noise-to-Norm Reconstruction for Industrial Anomaly Detection and Localization

    Authors: Shiqi Deng, Zhiyu Sun, Ruiyan Zhuang, Jun Gong

    Abstract: Anomaly detection has a wide range of applications and is especially important in industrial quality inspection. Currently, many top-performing anomaly-detection models rely on feature-embedding methods. However, these methods do not perform well on datasets with large variations in object locations. Reconstruction-based methods use reconstruction errors to detect anomalies without considering pos… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

  20. arXiv:2304.07150  [pdf, other

    eess.SY physics.soc-ph

    FOCUS : A framework for energy system optimization from prosumer to district and city scale

    Authors: Jingyu Gong, Yi Nie, Jonas van Ouwerkerk, Felix Wege, Mauricio Celi Cortés, Christoph von Oy, Jonas Brucksch, Christian Bußar, Thomas Schreiber, Dirk Uwe Sauer, Dirk Müller, Antonello Monti

    Abstract: Decarbonizing the energy sector is one of the main challenges to combat the climate crisis. Cities play an important role to reach climate neutrality as more than 70% of global CO2 emissions originate from urban areas. Decarbonization of energy supply systems can be achieved through various means, including the use of renewable energy sources, improving the efficiency of technologies, the coupling… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

  21. arXiv:2209.07267  [pdf, ps, other

    cs.LG eess.SP

    Compressed Particle-Based Federated Bayesian Learning and Unlearning

    Authors: Jinu Gong, Osvaldo Simeone, Joonhyuk Kang

    Abstract: Conventional frequentist FL schemes are known to yield overconfident decisions. Bayesian FL addresses this issue by allowing agents to process and exchange uncertainty information encoded in distributions over the model parameters. However, this comes at the cost of a larger per-iteration communication overhead. This letter investigates whether Bayesian FL can still provide advantages in terms of… ▽ More

    Submitted 19 September, 2022; v1 submitted 14 September, 2022; originally announced September 2022.

    Comments: Submitted for publication

  22. arXiv:2111.12056  [pdf, ps, other

    cs.LG eess.SP

    Forget-SVGD: Particle-Based Bayesian Federated Unlearning

    Authors: Jinu Gong, Osvaldo Simeone, Rahif Kassab, Joonhyuk Kang

    Abstract: Variational particle-based Bayesian learning methods have the advantage of not being limited by the bias affecting more conventional parametric techniques. This paper proposes to leverage the flexibility of non-parametric Bayesian approximate inference to develop a novel Bayesian federated unlearning method, referred to as Forget-Stein Variational Gradient Descent (Forget-SVGD). Forget-SVGD builds… ▽ More

    Submitted 23 November, 2021; originally announced November 2021.

    Comments: submitted for conference publication

  23. arXiv:2111.05194  [pdf

    eess.IV cs.CV

    Deep Learning Adapted Acceleration for Limited-view Photoacoustic Computed Tomography

    Authors: Hengrong Lan, Jiali Gong, Fei Gao

    Abstract: Photoacoustic imaging (PAI) is a non-invasive imaging modality that detects the ultrasound signal generated from tissue with light excitation. Photoacoustic computed tomography (PACT) uses unfocused large-area light to illuminate the target with ultrasound transducer array for PA signal detection. Limited-view issue could cause a low-quality image in PACT due to the limitation of geometric conditi… ▽ More

    Submitted 7 November, 2021; originally announced November 2021.

    Comments: submitted the journal version

  24. arXiv:2109.08880  [pdf, other

    cs.CV cs.AI eess.IV

    Computational Imaging and Artificial Intelligence: The Next Revolution of Mobile Vision

    Authors: Jinli Suo, Weihang Zhang, Jin Gong, Xin Yuan, David J. Brady, Qionghai Dai

    Abstract: Signal capture stands in the forefront to perceive and understand the environment and thus imaging plays the pivotal role in mobile vision. Recent explosive progresses in Artificial Intelligence (AI) have shown great potential to develop advanced mobile platforms with new imaging devices. Traditional imaging systems based on the "capturing images first and processing afterwards" mechanism cannot m… ▽ More

    Submitted 18 September, 2021; originally announced September 2021.

  25. arXiv:2109.07342  [pdf

    cs.RO eess.SY

    Sequential Point Cloud Prediction in Interactive Scenarios: A Survey

    Authors: Haowen Wang, Zirui Li, Jianwei Gong

    Abstract: Point cloud has been widely used in the field of autonomous driving since it can provide a more comprehensive three-dimensional representation of the environment than 2D images. Point-wise prediction based on point cloud sequence (PCS) is an essential part of environment understanding, which can assist in the decision-making and motion-planning of autonomous vehicles. However, PCS prediction has n… ▽ More

    Submitted 15 September, 2021; originally announced September 2021.

  26. arXiv:2109.07210  [pdf

    cs.RO eess.SY

    Life-Long Multi-Task Learning of Adaptive Path Tracking Policy for Autonomous Vehicle

    Authors: Cheng Gong, Jianwei Gong, Chao Lu, Zhe Liu, Zirui Li

    Abstract: This paper proposes a life-long adaptive path tracking policy learning method for autonomous vehicles that can self-evolve and self-adapt with multi-task knowledge. Firstly, the proposed method can learn a model-free control policy for path tracking directly from the historical driving experience, where the property of vehicle dynamics and corresponding control strategy can be learned simultaneous… ▽ More

    Submitted 15 September, 2021; originally announced September 2021.

  27. Decision-Making in Driver-Automation Shared Control: A Review and Perspectives

    Authors: Wenshuo Wang, Xiaoxiang Na, Dongpu Cao, Jianwei Gong, Junqiang Xi, Yang Xi, Fei-Yue Wang

    Abstract: Shared control schemes allow a human driver to work with an automated driving agent in driver-vehicle systems while retaining the driver's abilities to control. The human driver, as an essential agent in the driver-vehicle shared control systems, should be precisely modeled regarding their cognitive processes, control strategies, and decision-making processes. The interactive strategy design betwe… ▽ More

    Submitted 3 July, 2020; originally announced July 2020.

    Comments: 17 pages, 8 figures, journal

    Journal ref: IEEE/CAA Journal of Automatica Sinica, Vol. 7, No. 5, pp. 1289 --1307, 2020

  28. arXiv:2005.14629  [pdf

    cs.RO eess.SP

    Stealth UAV through Coanda Effect

    Authors: Dongyoon Shin, Hyeji Kim, Jihyuk Gong, Uijeong Jeong, Yeeun Jo, Eric Matson

    Abstract: This paper uses Coanda Effect to reduce motors, the source of noise, and finds low noise materials with sufficient lift force so that it can achieve acoustical stealth UAVs.According to NASA research [1], the noise of UAVs is better heard to people. But there must be some moments when we need to operate the drones quietly, so how can we reduce the noise? In previous research, there have also been… ▽ More

    Submitted 29 April, 2020; originally announced May 2020.

    Comments: 8 pages, 18 Figures, Accepted in The Fourth IEEE International Conference on Robotics Computing

  29. arXiv:2003.10916  [pdf, other

    cs.NI eess.SP

    Age of Processing: Age-driven Status Sampling and Processing Offloading for Edge Computing-enabled Real-time IoT Applications

    Authors: Rui Li, Qian Ma, Jie Gong, Zhi Zhou, Xu Chen

    Abstract: The freshness of status information is of great importance for time-critical Internet of Things (IoT) applications. A metric measuring status freshness is the age-of-information (AoI), which captures the time elapsed from the status being generated at the source node (e.g., a sensor) to the latest status update.However, in intelligent IoT applications such as video surveillance, the status informa… ▽ More

    Submitted 24 March, 2020; originally announced March 2020.

    Comments: submitted for review

  30. arXiv:2002.09719  [pdf, ps, other

    cs.IT eess.SP

    Joint Transmission and Computing Scheduling for Status Update with Mobile Edge Computing

    Authors: Jie Gong, Qiaobin Kuang, Xiang Chen

    Abstract: Age of Information (AoI), defined as the time elapsed since the generation of the latest received update, is a promising performance metric to measure data freshness for real-time status monitoring. In many applications, status information needs to be extracted through computing, which can be processed at an edge server enabled by mobile edge computing (MEC). In this paper, we aim to minimize the… ▽ More

    Submitted 22 February, 2020; originally announced February 2020.

    Comments: 6 pages, 6 figures, accepted by IEEE ICC'20

  31. arXiv:2002.06400  [pdf, other

    cs.IT eess.SP

    Analysis on Computation-Intensive Status Update in Mobile Edge Computing

    Authors: Qiaobin Kuang, Jie Gong, Xiang Chen, Xiao Ma

    Abstract: In status update scenarios, the freshness of information is measured in terms of age-of-information (AoI), which essentially reflects the timeliness for real-time applications to transmit status update messages to a remote controller. For some applications, computational expensive and time consuming data processing is inevitable for status information of messages to be displayed. Mobile edge serve… ▽ More

    Submitted 15 February, 2020; originally announced February 2020.

    Comments: 13 pages

    MSC Class: Primary ACM Class: F.2.2

  32. arXiv:1812.07493  [pdf, other

    eess.SP

    A Time Efficient Approach for Decision-Making Style Recognition in Lane-Change Behavior

    Authors: Sen Yang, Wenshuo Wang, Chao Lu, Jianwei Gong, Junqiang Xi

    Abstract: Fast recognizing driver's decision-making style of changing lanes plays a pivotal role in safety-oriented and personalized vehicle control system design. This paper presents a time-efficient recognition method by integrating k-means clustering (k-MC) with K-nearest neighbor (KNN), called kMC-KNN. The mathematical morphology is implemented to automatically label the decision-making data into three… ▽ More

    Submitted 8 November, 2018; originally announced December 2018.