Skip to main content

Showing 1–50 of 96 results for author: Tan, Y

Searching in archive eess. Search in all archives.
.
  1. arXiv:2509.25656  [pdf, ps, other

    eess.SP cs.IT

    Rotatable Antenna-Enabled Spectrum Sharing in Cognitive Radio Systems

    Authors: Yanhua Tan, Beixiong Zheng, Yi Fang, Derrick Wing Kwan Ng, Rui Zhang, Jie Xu

    Abstract: Rotatable antenna (RA) technology has recently drawn significant attention in wireless systems owing to its unique ability to exploit additional spatial degrees-of-freedom (DoFs) by dynamically adjusting the three-dimensional (3D) boresight direction of each antenna. In this letter, we propose a new RA-assisted cognitive radio (CR) system designed to achieve efficient spectrum sharing while mitiga… ▽ More

    Submitted 29 September, 2025; originally announced September 2025.

    Comments: 5 pages, 4 figures. Submitted to an lEEE journal for possible publication on September 24, 2025

  2. arXiv:2509.17435  [pdf, ps, other

    cs.RO eess.SY

    GPS Denied IBVS-Based Navigation and Collision Avoidance of UAV Using a Low-Cost RGB Camera

    Authors: Xiaoyu Wang, Yan Rui Tan, William Leong, Sunan Huang, Rodney Teo, Cheng Xiang

    Abstract: This paper proposes an image-based visual servoing (IBVS) framework for UAV navigation and collision avoidance using only an RGB camera. While UAV navigation has been extensively studied, it remains challenging to apply IBVS in missions involving multiple visual targets and collision avoidance. The proposed method achieves navigation without explicit path planning, and collision avoidance is reali… ▽ More

    Submitted 22 September, 2025; originally announced September 2025.

  3. arXiv:2509.03475  [pdf, ps, other

    math.OC cs.LG eess.IV

    From Image Denoisers to Regularizing Imaging Inverse Problems: An Overview

    Authors: Hong Ye Tan, Subhadip Mukherjee, Junqi Tang

    Abstract: Inverse problems lie at the heart of modern imaging science, with broad applications in areas such as medical imaging, remote sensing, and microscopy. Recent years have witnessed a paradigm shift in solving imaging inverse problems, where data-driven regularizers are used increasingly, leading to remarkably high-fidelity reconstruction. A particularly notable approach for data-driven regularizatio… ▽ More

    Submitted 3 September, 2025; originally announced September 2025.

    MSC Class: 65K15; 49J52

  4. arXiv:2508.14922  [pdf

    q-bio.QM cs.AI cs.CV eess.IV

    Fusing Structural Phenotypes with Functional Data for Early Prediction of Primary Angle Closure Glaucoma Progression

    Authors: Swati Sharma, Thanadet Chuangsuwanich, Royston K. Y. Tan, Shimna C. Prasad, Tin A. Tun, Shamira A. Perera, Martin L. Buist, Tin Aung, Monisha E. Nongpiur, Michaël J. A. Girard

    Abstract: Purpose: To classify eyes as slow or fast glaucoma progressors in patients with primary angle closure glaucoma (PACG) using an integrated approach combining optic nerve head (ONH) structural features and sector-based visual field (VF) functional parameters. Methods: PACG patients with >5 reliable VF tests over >5 years were included. Progression was assessed in Zeiss Forum, with baseline VF within… ▽ More

    Submitted 19 August, 2025; originally announced August 2025.

    Comments: 23 pages, 5 figures, 3 tables

  5. arXiv:2507.10849  [pdf, ps, other

    cs.AR eess.SY

    OpenGCRAM: An Open-Source Gain Cell Compiler Enabling Design-Space Exploration for AI Workloads

    Authors: Xinxin Wang, Lixian Yan, Shuhan Liu, Luke Upton, Zhuoqi Cai, Yiming Tan, Shengman Li, Koustav Jana, Peijing Li, Jesse Cirimelli-Low, Thierry Tambe, Matthew Guthaus, H. -S. Philip Wong

    Abstract: Gain Cell memory (GCRAM) offers higher density and lower power than SRAM, making it a promising candidate for on-chip memory in domain-specific accelerators. To support workloads with varying traffic and lifetime metrics, GCRAM also offers high bandwidth, ultra low leakage power and a wide range of retention times, which can be adjusted through transistor design (like threshold voltage and channel… ▽ More

    Submitted 14 July, 2025; originally announced July 2025.

  6. arXiv:2507.02668  [pdf, ps, other

    eess.IV cs.CV

    MEGANet-W: A Wavelet-Driven Edge-Guided Attention Framework for Weak Boundary Polyp Detection

    Authors: Zhe Yee Tan, Ashwaq Qasem

    Abstract: Colorectal polyp segmentation is critical for early detection of colorectal cancer, yet weak and low contrast boundaries significantly limit automated accuracy. Existing deep models either blur fine edge details or rely on handcrafted filters that perform poorly under variable imaging conditions. We propose MEGANet-W, a Wavelet Driven Edge Guided Attention Network that injects directional, paramet… ▽ More

    Submitted 17 September, 2025; v1 submitted 3 July, 2025; originally announced July 2025.

    Comments: This work has been submitted to the IEEE for possible publication

  7. arXiv:2507.01841  [pdf, ps, other

    cs.LG cs.IT eess.SP math.OC

    Automatic Rank Determination for Low-Rank Adaptation via Submodular Function Maximization

    Authors: Yihang Gao, Vincent Y. F. Tan

    Abstract: In this paper, we propose SubLoRA, a rank determination method for Low-Rank Adaptation (LoRA) based on submodular function maximization. In contrast to prior approaches, such as AdaLoRA, that rely on first-order (linearized) approximations of the loss function, SubLoRA utilizes second-order information to capture the potentially complex loss landscape by incorporating the Hessian matrix. We show t… ▽ More

    Submitted 2 July, 2025; originally announced July 2025.

  8. arXiv:2506.08534  [pdf, ps, other

    eess.IV cs.AI cs.CV

    DCD: A Semantic Segmentation Model for Fetal Ultrasound Four-Chamber View

    Authors: Donglian Li, Hui Guo, Minglang Chen, Huizhen Chen, Jialing Chen, Bocheng Liang, Pengchen Liang, Ying Tan

    Abstract: Accurate segmentation of anatomical structures in the apical four-chamber (A4C) view of fetal echocardiography is essential for early diagnosis and prenatal evaluation of congenital heart disease (CHD). However, precise segmentation remains challenging due to ultrasound artifacts, speckle noise, anatomical variability, and boundary ambiguity across different gestational stages. To reduce the workl… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.

  9. arXiv:2505.18174  [pdf, ps, other

    eess.SP cs.AI cs.LG

    NMCSE: Noise-Robust Multi-Modal Coupling Signal Estimation Method via Optimal Transport for Cardiovascular Disease Detection

    Authors: Peihong Zhang, Zhixin Li, Rui Sang, Yuxuan Liu, Yiqiang Cai, Yizhou Tan, Shengchen Li

    Abstract: Electrocardiogram (ECG) and Phonocardiogram (PCG) signals are linked by a latent coupling signal representing the electrical-to-mechanical cardiac transformation. While valuable for cardiovascular disease (CVD) detection, this coupling signal is traditionally estimated using deconvolution methods that amplify noise, limiting clinical utility. In this paper, we propose Noise-Robust Multi-Modal Coup… ▽ More

    Submitted 2 June, 2025; v1 submitted 14 May, 2025; originally announced May 2025.

  10. arXiv:2504.19362  [pdf, other

    eess.IV cs.AI cs.CV

    Low-Rank Adaptive Structural Priors for Generalizable Diabetic Retinopathy Grading

    Authors: Yunxuan Wang, Ray Yin, Yumei Tan, Hao Chen, Haiying Xia

    Abstract: Diabetic retinopathy (DR), a serious ocular complication of diabetes, is one of the primary causes of vision loss among retinal vascular diseases. Deep learning methods have been extensively applied in the grading of diabetic retinopathy (DR). However, their performance declines significantly when applied to data outside the training distribution due to domain shifts. Domain generalization (DG) ha… ▽ More

    Submitted 27 April, 2025; originally announced April 2025.

    Comments: Accepted by IJCNN 2025

  11. arXiv:2503.22605  [pdf, ps, other

    cs.GR cs.CV cs.SD eess.AS

    Audio-Plane: Audio Factorization Plane Gaussian Splatting for Real-Time Talking Head Synthesis

    Authors: Shuai Shen, Wanhua Li, Yunpeng Zhang, Yap-Peng Tan, Jiwen Lu

    Abstract: Talking head synthesis has emerged as a prominent research topic in computer graphics and multimedia, yet most existing methods often struggle to strike a balance between generation quality and computational efficiency, particularly under real-time constraints. In this paper, we propose a novel framework that integrates Gaussian Splatting with a structured Audio Factorization Plane (Audio-Plane) t… ▽ More

    Submitted 26 June, 2025; v1 submitted 28 March, 2025; originally announced March 2025.

    Comments: Demo video at \url{https://sstzal.github.io/Audio-Plane/}

  12. arXiv:2503.21818  [pdf

    eess.IV cs.CV

    Deep Learning-Based Quantitative Assessment of Renal Chronicity Indices in Lupus Nephritis

    Authors: Tianqi Tu, Hui Wang, Jiangbo Pei, Xiaojuan Yu, Aidong Men, Suxia Wang, Qingchao Chen, Ying Tan, Feng Yu, Minghui Zhao

    Abstract: Background: Renal chronicity indices (CI) have been identified as strong predictors of long-term outcomes in lupus nephritis (LN) patients. However, assessment by pathologists is hindered by challenges such as substantial time requirements, high interobserver variation, and susceptibility to fatigue. This study aims to develop an effective deep learning (DL) pipeline that automates the assessment… ▽ More

    Submitted 26 March, 2025; originally announced March 2025.

  13. arXiv:2503.16862  [pdf, ps, other

    cs.SD cs.CV eess.AS

    Improving Acoustic Scene Classification with City Features

    Authors: Yiqiang Cai, Yizhou Tan, Shengchen Li, Xi Shao, Mark D. Plumbley

    Abstract: Acoustic scene recordings are often collected from a diverse range of cities. Most existing acoustic scene classification (ASC) approaches focus on identifying common acoustic scene patterns across cities to enhance generalization. However, the potential acoustic differences introduced by city-specific environmental and cultural factors are overlooked. In this paper, we hypothesize that the city-s… ▽ More

    Submitted 12 June, 2025; v1 submitted 21 March, 2025; originally announced March 2025.

  14. arXiv:2502.20311  [pdf, other

    cs.LG cs.SD eess.AS

    Adapting Automatic Speech Recognition for Accented Air Traffic Control Communications

    Authors: Marcus Yu Zhe Wee, Justin Juin Hng Wong, Lynus Lim, Joe Yu Wei Tan, Prannaya Gupta, Dillion Lim, En Hao Tew, Aloysius Keng Siew Han, Yong Zhi Lim

    Abstract: Effective communication in Air Traffic Control (ATC) is critical to maintaining aviation safety, yet the challenges posed by accented English remain largely unaddressed in Automatic Speech Recognition (ASR) systems. Existing models struggle with transcription accuracy for Southeast Asian-accented (SEA-accented) speech, particularly in noisy ATC environments. This study presents the development of… ▽ More

    Submitted 27 February, 2025; originally announced February 2025.

  15. arXiv:2502.18768  [pdf, other

    math.OC eess.SY

    Stabilization of singularly perturbed networked control systems over a single channel

    Authors: Weixuan Wang, Alejandro I. Maass, Dragan Nešić, Ying Tan, Romain Postoyan, W. P. M. H. Heemels

    Abstract: This paper studies the emulation-based stabilization of nonlinear networked control systems with two time scales. We address the challenge of using a single communication channel for transmitting both fast and slow variables between the plant and the controller. A novel dual clock mechanism is proposed to schedule transmissions for this purpose. The system is modeled as a hybrid singularly perturb… ▽ More

    Submitted 25 February, 2025; originally announced February 2025.

    Comments: 16 pages, 2 figures, submitted to Automatica

  16. arXiv:2502.17097  [pdf, other

    eess.SY

    Rotatable Antenna Enabled Wireless Communication System with Visual Recognition: A Prototype Implementation

    Authors: Liang Dai, Beixiong Zheng, Yanhua Tan, Lipeng Zhu, Fangjiong Chen, Rui Zhang

    Abstract: Rotatable antenna (RA) is an emerging technology that has great potential to exploit additional spatial degrees of freedom (DoFs) by flexibly altering the three-dimensional (3D) orientation/boresight of each antenna. In this demonstration, we present a prototype of the RA-enabled wireless communication system with a visual recognition module to evaluate the performance gains provided by the RA in… ▽ More

    Submitted 23 March, 2025; v1 submitted 24 February, 2025; originally announced February 2025.

  17. arXiv:2412.01035  [pdf, other

    cs.LG eess.SY

    Adaptive Traffic Element-Based Streetlight Control Using Neighbor Discovery Algorithm Based on IoT Events

    Authors: Yupeng Tan, Sheng Xu, Chengyue Su

    Abstract: Intelligent streetlight systems divide the streetlight network into multiple sectors, activating only the streetlights in the corresponding sectors when traffic elements pass by, rather than all streetlights, effectively reducing energy waste. This strategy requires streetlights to understand their neighbor relationships to illuminate only the streetlights in their respective sectors. However, man… ▽ More

    Submitted 1 December, 2024; originally announced December 2024.

  18. arXiv:2409.11391  [pdf, other

    eess.IV eess.SY

    Online 4D Ultrasound-Guided Robotic Tracking Enables 3D Ultrasound Localisation Microscopy with Large Tissue Displacements

    Authors: Jipeng Yan, Qingyuan Tan, Shusei Kawara, Jingwen Zhu, Bingxue Wang, Matthieu Toulemonde, Honghai Liu, Ying Tan, Meng-Xing Tang

    Abstract: Super-Resolution Ultrasound (SRUS) imaging through localising and tracking microbubbles, also known as Ultrasound Localisation Microscopy (ULM), has demonstrated significant potential for reconstructing microvasculature and flows with sub-diffraction resolution in clinical diagnostics. However, imaging organs with large tissue movements, such as those caused by respiration, presents substantial ch… ▽ More

    Submitted 25 March, 2025; v1 submitted 17 September, 2024; originally announced September 2024.

  19. arXiv:2409.09910  [pdf

    eess.IV

    Self-Supervised Elimination of Non-Independent Noise in Hyperspectral Imaging

    Authors: Guangrui Ding, Chang Liu, Jiaze Yin, Xinyan Teng, Yuying Tan, Hongjian He, Haonan Lin, Lei Tian, Ji-Xin Cheng

    Abstract: Hyperspectral imaging has been widely used for spectral and spatial identification of target molecules, yet often contaminated by sophisticated noise. Current denoising methods generally rely on independent and identically distributed noise statistics, showing corrupted performance for non-independent noise removal. Here, we demonstrate Self-supervised PErmutation Noise2noise Denoising (SPEND), a… ▽ More

    Submitted 15 September, 2024; originally announced September 2024.

  20. arXiv:2409.06580  [pdf, other

    eess.AS cs.SD

    Exploring Differences between Human Perception and Model Inference in Audio Event Recognition

    Authors: Yizhou Tan, Yanru Wu, Yuanbo Hou, Xin Xu, Hui Bu, Shengchen Li, Dick Botteldooren, Mark D. Plumbley

    Abstract: Audio Event Recognition (AER) traditionally focuses on detecting and identifying audio events. Most existing AER models tend to detect all potential events without considering their varying significance across different contexts. This makes the AER results detected by existing models often have a large discrepancy with human auditory perception. Although this is a critical and significant issue, i… ▽ More

    Submitted 10 September, 2024; originally announced September 2024.

    Comments: Dataset homepage: https://github.com/Voltmeter00/MAFAR

  21. arXiv:2408.16315   

    cs.HC cs.LG eess.SP

    Passenger hazard perception based on EEG signals for highly automated driving vehicles

    Authors: Ashton Yu Xuan Tan, Yingkai Yang, Xiaofei Zhang, Bowen Li, Xiaorong Gao, Sifa Zheng, Jianqiang Wang, Xinyu Gu, Jun Li, Yang Zhao, Yuxin Zhang, Tania Stathaki

    Abstract: Enhancing the safety of autonomous vehicles is crucial, especially given recent accidents involving automated systems. As passengers in these vehicles, humans' sensory perception and decision-making can be integrated with autonomous systems to improve safety. This study explores neural mechanisms in passenger-vehicle interactions, leading to the development of a Passenger Cognitive Model (PCM) and… ▽ More

    Submitted 27 March, 2025; v1 submitted 29 August, 2024; originally announced August 2024.

    Comments: We have decided to withdraw this submission due to ongoing revisions and further refinements in our research. A revised version may be resubmitted in the future. We appreciate the feedback and interest from the community

  22. arXiv:2407.14111  [pdf, other

    eess.SP cs.IT cs.LG

    A Mirror Descent-Based Algorithm for Corruption-Tolerant Distributed Gradient Descent

    Authors: Shuche Wang, Vincent Y. F. Tan

    Abstract: Distributed gradient descent algorithms have come to the fore in modern machine learning, especially in parallelizing the handling of large datasets that are distributed across several workers. However, scant attention has been paid to analyzing the behavior of distributed gradient descent algorithms in the presence of adversarial corruptions instead of random noise. In this paper, we formulate a… ▽ More

    Submitted 5 February, 2025; v1 submitted 19 July, 2024; originally announced July 2024.

    Comments: Accepted to the IEEE Transactions on Signal Processing

  23. arXiv:2406.04881  [pdf, other

    cs.IT eess.SP

    MIMO Capacity Analysis and Channel Estimation for Electromagnetic Information Theory

    Authors: Jieao Zhu, Vincent Y. F. Tan, Linglong Dai

    Abstract: Electromagnetic information theory (EIT) is an interdisciplinary subject that serves to integrate deterministic electromagnetic theory with stochastic Shannon's information theory. Existing EIT analysis operates in the continuous space domain, which is not aligned with the practical algorithms working in the discrete space domain. This mismatch leads to a significant difficulty in application of E… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: Submitted to the IEEE TWC. In this paper, we established the discrete-continuous correspondence for electromagnetic information theory (EIT), thus enabling analytical tools in the continuous space domain to be applied to discrete space MIMO architectures. Simulation codes will be provided at http://oa.ee.tsinghua.edu.cn/dailinglong/publications/publications.html

  24. arXiv:2405.06165  [pdf, other

    eess.SY

    Resilient control of networked switched systems subject to deception attack and DoS attack

    Authors: Rui Zhao, Zhiqiang Zuo, Ying Tan, Yijing Wang, Wentao Zhang

    Abstract: In this paper, the resilient control for switched systems in the presence of deception attack and denial-of-service (DoS) attack is addressed. Due to the interaction of two kinds of attacks and the asynchronous phenomenon of controller mode and subsystem mode, the system dynamics becomes much more complex. A criterion is derived to ensure the mean square security level of the closed-loop system. T… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  25. arXiv:2404.15294  [pdf

    eess.SP cs.LG

    Multimodal Physical Fitness Monitoring (PFM) Framework Based on TimeMAE-PFM in Wearable Scenarios

    Authors: Junjie Zhang, Zheming Zhang, Huachen Xiang, Yangquan Tan, Linnan Huo, Fengyi Wang

    Abstract: Physical function monitoring (PFM) plays a crucial role in healthcare especially for the elderly. Traditional assessment methods such as the Short Physical Performance Battery (SPPB) have failed to capture the full dynamic characteristics of physical function. Wearable sensors such as smart wristbands offer a promising solution to this issue. However, challenges exist, such as the computational co… ▽ More

    Submitted 25 March, 2024; originally announced April 2024.

    Comments: 5 pages, 6 figures

  26. arXiv:2403.19168  [pdf

    eess.SY

    Tunable Superconducting Magnetic Levitation with Self-Stability

    Authors: Qi Xu, Yi Lin, Yunfei Tan, Jianzhao Geng

    Abstract: Magnetic levitation based on the flux pinning nature of type II superconductors has the merit of self-stability, making it appealing for applications such as high speed bearings, maglev trains, space generators, etc. However, such levitation systems physically rely on the superconductor pre-capturing magnetic flux (i.e. field cooling process) before establishing the levitation state which is nonad… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: 15pages,5 figures

  27. arXiv:2402.09421  [pdf, other

    eess.SP cs.LG

    EEG Based Generative Depression Discriminator

    Authors: Ziming Mao, Hao wu, Yongxi Tan, Yuhe Jin

    Abstract: Depression is a very common but serious mood disorder.In this paper, We built a generative detection network(GDN) in accordance with three physiological laws. Our aim is that we expect the neural network to learn the relevant brain activity based on the EEG signal and, at the same time, to regenerate the target electrode signal based on the brain activity. We trained two generators, the first one… ▽ More

    Submitted 19 January, 2024; originally announced February 2024.

  28. arXiv:2401.03615  [pdf, other

    eess.IV cs.CV cs.LG

    Automated Detection of Myopic Maculopathy in MMAC 2023: Achievements in Classification, Segmentation, and Spherical Equivalent Prediction

    Authors: Yihao Li, Philippe Zhang, Yubo Tan, Jing Zhang, Zhihan Wang, Weili Jiang, Pierre-Henri Conze, Mathieu Lamard, Gwenolé Quellec, Mostafa El Habib Daho

    Abstract: Myopic macular degeneration is the most common complication of myopia and the primary cause of vision loss in individuals with pathological myopia. Early detection and prompt treatment are crucial in preventing vision impairment due to myopic maculopathy. This was the focus of the Myopic Maculopathy Analysis Challenge (MMAC), in which we participated. In task 1, classification of myopic maculopath… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

    Comments: 18 pages

  29. arXiv:2312.09521  [pdf, other

    eess.SY

    Multi-Objective Complementary Control

    Authors: Jiapeng Xu, Xiang Chen, Ying Tan, Kemin Zhou

    Abstract: This paper proposes a novel multi-objective control framework for linear time-invariant systems in which performance and robustness can be achieved in a complementary way instead of a trade-off. In particular, a state-space solution is first established for a new stabilizing control structure consisting of two independently designed controllers coordinated with a Youla-type operator ${\bm Q}$. It… ▽ More

    Submitted 13 November, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

  30. arXiv:2312.06178  [pdf, other

    eess.SY

    Adaptive Event-triggered Control For Strict-feedback Systems With Time-varying Parameters

    Authors: Yan Tan, Liucang Wu, Wenqi Liu

    Abstract: In this article, we develop a new adaptive event-triggered asymptotic control scheme for strict-feedback systems with fast time-varying parameters. To deal with time-varying parameters with unknown variation boundaries in the feedback path and the input path, we construct three adaptive laws for parameter estimation, two for the uncertain parameters in the feedback path and one for the uncertain p… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: 9 pages, 6 figures

  31. arXiv:2311.06552  [pdf, other

    eess.IV cs.CV cs.LG

    Stain Consistency Learning: Handling Stain Variation for Automatic Digital Pathology Segmentation

    Authors: Michael Yeung, Todd Watts, Sean YW Tan, Pedro F. Ferreira, Andrew D. Scott, Sonia Nielles-Vallespin, Guang Yang

    Abstract: Stain variation is a unique challenge associated with automated analysis of digital pathology. Numerous methods have been developed to improve the robustness of machine learning methods to stain variation, but comparative studies have demonstrated limited benefits to performance. Moreover, methods to handle stain variation were largely developed for H&E stained data, with evaluation generally limi… ▽ More

    Submitted 11 November, 2023; originally announced November 2023.

  32. arXiv:2310.08292  [pdf, other

    eess.SP cs.AI

    Concealed Electronic Countermeasures of Radar Signal with Adversarial Examples

    Authors: Ruinan Ma, Canjie Zhu, Mingfeng Lu, Yunjie Li, Yu-an Tan, Ruibin Zhang, Ran Tao

    Abstract: Electronic countermeasures involving radar signals are an important aspect of modern warfare. Traditional electronic countermeasures techniques typically add large-scale interference signals to ensure interference effects, which can lead to attacks being too obvious. In recent years, AI-based attack methods have emerged that can effectively solve this problem, but the attack scenarios are currentl… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  33. arXiv:2310.08089  [pdf, other

    cs.GT eess.SY stat.ML

    Learning Regularized Monotone Graphon Mean-Field Games

    Authors: Fengzhuo Zhang, Vincent Y. F. Tan, Zhaoran Wang, Zhuoran Yang

    Abstract: This paper studies two fundamental problems in regularized Graphon Mean-Field Games (GMFGs). First, we establish the existence of a Nash Equilibrium (NE) of any $λ$-regularized GMFG (for $λ\geq 0$). This result relies on weaker conditions than those in previous works for analyzing both unregularized GMFGs ($λ=0$) and $λ$-regularized MFGs, which are special cases of GMFGs. Second, we propose provab… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  34. arXiv:2307.05893  [pdf, ps, other

    eess.SP cs.LG

    Deep Unrolling for Nonconvex Robust Principal Component Analysis

    Authors: Elizabeth Z. C. Tan, Caroline Chaux, Emmanuel Soubies, Vincent Y. F. Tan

    Abstract: We design algorithms for Robust Principal Component Analysis (RPCA) which consists in decomposing a matrix into the sum of a low rank matrix and a sparse matrix. We propose a deep unrolled algorithm based on an accelerated alternating projection algorithm which aims to solve RPCA in its nonconvex form. The proposed procedure combines benefits of deep neural networks and the interpretability of the… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

    Comments: 7 pages, 3 figures; Accepted to the 2023 IEEE International Workshop on Machine Learning for Signal Processing

  35. arXiv:2307.00824   

    eess.SY cs.MA

    Sufficient Conditions on Bipartite Consensus of Weakly Connected Matrix-weighted Networks

    Authors: Chongzhi Wang, Haibin Shao, Ying Tan, Dewei Li

    Abstract: Recent advancements in bipartite consensus, a scenario where agents are divided into two disjoint sets with agents in the same set agreeing on a certain value and those in different sets agreeing on opposite or specifically related values, have highlighted its potential applications across various fields. Traditional research typically relies on the presence of a positive-negative spanning tree, w… ▽ More

    Submitted 28 September, 2024; v1 submitted 3 July, 2023; originally announced July 2023.

    Comments: There is a misstatement in Section 3.2 about the condition of the main Theorem, as in "Assumption 2 is a necessary condition". In addition, example in Fig. 2 needs to be adjusted

  36. arXiv:2305.19557  [pdf, other

    math.OC cs.LG eess.SP stat.ML

    Dictionary Learning under Symmetries via Group Representations

    Authors: Subhroshekhar Ghosh, Aaron Y. R. Low, Yong Sheng Soh, Zhuohang Feng, Brendan K. Y. Tan

    Abstract: The dictionary learning problem can be viewed as a data-driven process to learn a suitable transformation so that data is sparsely represented directly from example data. In this paper, we examine the problem of learning a dictionary that is invariant under a pre-specified group of transformations. Natural settings include Cryo-EM, multi-object tracking, synchronization, pose estimation, etc. We s… ▽ More

    Submitted 25 July, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

    Comments: 29 pages, 2 figures

  37. arXiv:2305.12301  [pdf, other

    cs.CL cs.AI cs.SD eess.AS

    Sentence Embedder Guided Utterance Encoder (SEGUE) for Spoken Language Understanding

    Authors: Yi Xuan Tan, Navonil Majumder, Soujanya Poria

    Abstract: The pre-trained speech encoder wav2vec 2.0 performs very well on various spoken language understanding (SLU) tasks. However, on many tasks, it trails behind text encoders with textual input. To improve the understanding capability of SLU encoders, various studies have used knowledge distillation to transfer knowledge from natural language understanding (NLU) encoders. We use a very simple method o… ▽ More

    Submitted 20 May, 2023; originally announced May 2023.

    Comments: Interspeech 2023

  38. arXiv:2305.08882  [pdf, other

    eess.IV physics.med-ph physics.optics

    Model-driven CT reconstruction algorithm for nano-resolution X-ray phase contrast imaging

    Authors: Xuebao Cai, Yuhang Tan, Ting Su, Dong Liang, Hairong Zheng, Jinyou Xu, Peiping Zhu, Yongshuai Ge

    Abstract: The low-density imaging performance of a zone plate based nano-resolution hard X-ray computed tomography (CT) system can be significantly improved by incorporating a grating-based Lau interferometer. Due to the diffraction, however, the acquired nano-resolution phase signal may suffer splitting problem, which impedes the direct reconstruction of phase contrast CT (nPCT) images. To overcome, a new… ▽ More

    Submitted 13 October, 2023; v1 submitted 14 May, 2023; originally announced May 2023.

  39. arXiv:2304.00138  [pdf, other

    eess.SY

    Robust Tracking Control for Nonlinear Systems: Performance optimization via extremum seeking

    Authors: Jiapeng Xu, Ying Tan, Xiang Chen

    Abstract: This paper presents a controller design and optimization framework for nonlinear dynamic systems to track a given reference signal in the presence of disturbances when the task is repeated over a finite-time interval. This novel framework mainly consists of two steps. The first step is to design a robust linear quadratic tracking controller based on the existing control structure with a Youla-type… ▽ More

    Submitted 31 March, 2023; originally announced April 2023.

  40. arXiv:2302.14677  [pdf, other

    cs.CV cs.CR eess.IV

    Backdoor Attacks Against Deep Image Compression via Adaptive Frequency Trigger

    Authors: Yi Yu, Yufei Wang, Wenhan Yang, Shijian Lu, Yap-peng Tan, Alex C. Kot

    Abstract: Recent deep-learning-based compression methods have achieved superior performance compared with traditional approaches. However, deep learning models have proven to be vulnerable to backdoor attacks, where some specific trigger patterns added to the input can lead to malicious behavior of the models. In this paper, we present a novel backdoor attack with multiple triggers against learned image com… ▽ More

    Submitted 28 February, 2023; originally announced February 2023.

    Comments: Accepted by CVPR 2023

    ACM Class: I.4

  41. arXiv:2302.00812  [pdf, other

    eess.SY

    Event-triggered Hybrid Energy-aware Scheduling in Manufacturing Systems

    Authors: Zhean Shao, Wen Li, Ying Tan

    Abstract: Incorporating renewable energy sources (RESs) into manufacturing systems has been an active research area in order to address many challenges originating from the unpredictable nature of RESs such as photovoltaics.In the energy-aware scheduling for manufacturing systems, the traditional off-line scheduling techniques cannot always work well due to their lack of robustness with respect to uncertain… ▽ More

    Submitted 1 February, 2023; originally announced February 2023.

  42. arXiv:2301.09362  [pdf, other

    cs.SD cs.LG eess.AS

    A Comprehensive Survey on Heart Sound Analysis in the Deep Learning Era

    Authors: Zhao Ren, Yi Chang, Thanh Tam Nguyen, Yang Tan, Kun Qian, Björn W. Schuller

    Abstract: Heart sound auscultation has been applied in clinical usage for early screening of cardiovascular diseases. Due to the high demand for auscultation expertise, automatic auscultation can help with auxiliary diagnosis and reduce the burden of training professional clinicians. Nevertheless, there is a limit to classic machine learning's performance improvement in the era of big data. Deep learning ha… ▽ More

    Submitted 11 May, 2024; v1 submitted 23 January, 2023; originally announced January 2023.

    Comments: Accepted by IEEE Computational Intelligence Magazine

  43. arXiv:2301.00934  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Finding the Most Transferable Tasks for Brain Image Segmentation

    Authors: Yicong Li, Yang Tan, Jingyun Yang, Yang Li, Xiao-Ping Zhang

    Abstract: Although many studies have successfully applied transfer learning to medical image segmentation, very few of them have investigated the selection strategy when multiple source tasks are available for transfer. In this paper, we propose a prior knowledge guided and transferability based framework to select the best source tasks among a collection of brain image segmentation tasks, to improve the tr… ▽ More

    Submitted 2 January, 2023; originally announced January 2023.

    Comments: Accepted by BIBM 2022

  44. arXiv:2211.16433   

    eess.SY

    On Robust Observer Design for System Motion on SE(3) Using Onboard Visual Sensors

    Authors: Tong Zhang, Ying Tan, Xiang Chen, Zike Lei

    Abstract: Onboard visual sensing has been widely used in the unmanned ground vehicle (UGV) and/or unmanned aerial vehicle (UAV), which can be modeled as dynamic systems on SE(3). The onboard sensing outputs of the dynamic system can usually be applied to derive the relative position between the feature marks and the system, but bearing with explicit geometrical constraint. Such a visual geometrical constrai… ▽ More

    Submitted 21 March, 2023; v1 submitted 29 November, 2022; originally announced November 2022.

    Comments: Need Further Improvement

  45. arXiv:2211.11509  [pdf, other

    eess.IV cs.CV cs.LG

    Segmentation, Classification, and Quality Assessment of UW-OCTA Images for the Diagnosis of Diabetic Retinopathy

    Authors: Yihao Li, Rachid Zeghlache, Ikram Brahim, Hui Xu, Yubo Tan, Pierre-Henri Conze, Mathieu Lamard, Gwenolé Quellec, Mostafa El Habib Daho

    Abstract: Diabetic Retinopathy (DR) is a severe complication of diabetes that can cause blindness. Although effective treatments exist (notably laser) to slow the progression of the disease and prevent blindness, the best treatment remains prevention through regular check-ups (at least once a year) with an ophthalmologist. Optical Coherence Tomography Angiography (OCTA) allows for the visualization of the r… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

  46. arXiv:2210.15152  [pdf, ps, other

    eess.SY

    Robust output regulation of linear system subject to modeled and unmodeled uncertainty

    Authors: Zhicheng Zhang, Zhiqiang Zuo, Xiang Chen, Ying Tan, Yijing Wang

    Abstract: In this paper, a novel robust output regulation control framework is proposed for the system subject to noise, modeled disturbance and unmodeled disturbance to seek tracking performance and robustness simultaneously. The output regulation scheme is utilized in the framework to track the reference in the presence of modeled disturbance, and the effect of unmodeled disturbance is reduced by an… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

  47. arXiv:2210.06664  [pdf

    eess.IV cs.AI cs.CV

    Are Macula or Optic Nerve Head Structures better at Diagnosing Glaucoma? An Answer using AI and Wide-Field Optical Coherence Tomography

    Authors: Charis Y. N. Chiang, Fabian Braeu, Thanadet Chuangsuwanich, Royston K. Y. Tan, Jacqueline Chua, Leopold Schmetterer, Alexandre Thiery, Martin Buist, Michaël J. A. Girard

    Abstract: Purpose: (1) To develop a deep learning algorithm to automatically segment structures of the optic nerve head (ONH) and macula in 3D wide-field optical coherence tomography (OCT) scans; (2) To assess whether 3D macula or ONH structures (or the combination of both) provide the best diagnostic power for glaucoma. Methods: A cross-sectional comparative study was performed which included wide-field sw… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: 23 pages, 5 figures

  48. arXiv:2208.00025  [pdf, other

    eess.SP

    Six-center Assessment of CNN-Transformer with Belief Matching Loss for Patient-independent Seizure Detection in EEG

    Authors: Wei Yan Peh, Prasanth Thangavel, Yuanyuan Yao, John Thomas, Yee Leng Tan, Justin Dauwels

    Abstract: Neurologists typically identify epileptic seizures from electroencephalograms (EEGs) by visual inspection. This process is often time-consuming, especially for EEG recordings that last hours or days. To expedite the process, a reliable, automated, and patient-independent seizure detector is essential. However, developing a patient-independent seizure detector is challenging as seizures exhibit div… ▽ More

    Submitted 22 November, 2022; v1 submitted 29 July, 2022; originally announced August 2022.

    Comments: Submitting to IJNS

  49. arXiv:2206.09620  [pdf, other

    cs.IT eess.SP

    Asymptotic Nash Equilibrium for the $M$-ary Sequential Adversarial Hypothesis Testing Game

    Authors: Jiachun Pan, Yonglong Li, Vincent Y. F. Tan

    Abstract: In this paper, we consider a novel $M$-ary sequential hypothesis testing problem in which an adversary is present and perturbs the distributions of the samples before the decision maker observes them. This problem is formulated as a sequential adversarial hypothesis testing game played between the decision maker and the adversary. This game is a zero-sum and strategic one. We assume the adversary… ▽ More

    Submitted 20 June, 2022; originally announced June 2022.

    Comments: The paper was presented in part at the 2022 International Symposium on Information Theory (ISIT). It has been submitted to IEEE Transactions on Information Forensics and Security

  50. arXiv:2204.00630  [pdf, other

    eess.IV cs.CV

    Extremely Low-light Image Enhancement with Scene Text Restoration

    Authors: Pohao Hsu, Che-Tsung Lin, Chun Chet Ng, Jie-Long Kew, Mei Yih Tan, Shang-Hong Lai, Chee Seng Chan, Christopher Zach

    Abstract: Deep learning-based methods have made impressive progress in enhancing extremely low-light images - the image quality of the reconstructed images has generally improved. However, we found out that most of these methods could not sufficiently recover the image details, for instance, the texts in the scene. In this paper, a novel image enhancement framework is proposed to precisely restore the scene… ▽ More

    Submitted 1 April, 2022; originally announced April 2022.