Skip to main content

Showing 1–50 of 50 results for author: Tan, S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2509.24395  [pdf, ps, other

    eess.AS cs.SD

    Unsupervised Single-Channel Speech Separation with a Diffusion Prior under Speaker-Embedding Guidance

    Authors: Runwu Shi, Kai Li, Chang Li, Jiang Wang, Sihan Tan, Kazuhiro Nakadai

    Abstract: Speech separation is a fundamental task in audio processing, typically addressed with fully supervised systems trained on paired mixtures. While effective, such systems typically rely on synthetic data pipelines, which may not reflect real-world conditions. Instead, we revisit the source-model paradigm, training a diffusion generative model solely on anechoic speech and formulating separation as a… ▽ More

    Submitted 29 September, 2025; originally announced September 2025.

    Comments: 5 pages, 2 figures, submitted to ICASSP 2026

  2. arXiv:2509.10055  [pdf

    eess.SY

    Data-driven optimization of sparse sensor placement in thermal hydraulic experiments

    Authors: Xicheng Wang, Yun. Feng, Dmitry Grishchenko, Pavel Kudinov, Ruifeng Tian, Sichao Tan

    Abstract: Thermal-Hydraulic (TH) experiments provide valuable insight into the physics of heat and mass transfer and qualified data for code development, calibration and validation. However, measurements are typically collected from sparsely distributed sensors, offering limited coverage over the domain of interest and phenomena of interest. Determination of the spatial configuration of these sensors is cru… ▽ More

    Submitted 12 September, 2025; originally announced September 2025.

  3. arXiv:2508.11115  [pdf, ps, other

    cs.CV cs.HC eess.SP

    UWB-PostureGuard: A Privacy-Preserving RF Sensing System for Continuous Ergonomic Sitting Posture Monitoring

    Authors: Haotang Li, Zhenyu Qi, Sen He, Kebin Peng, Sheng Tan, Yili Ren, Tomas Cerny, Jiyue Zhao, Zi Wang

    Abstract: Improper sitting posture during prolonged computer use has become a significant public health concern. Traditional posture monitoring solutions face substantial barriers, including privacy concerns with camera-based systems and user discomfort with wearable sensors. This paper presents UWB-PostureGuard, a privacy-preserving ultra-wideband (UWB) sensing system that advances mobile technologies for… ▽ More

    Submitted 14 August, 2025; originally announced August 2025.

  4. arXiv:2507.15385  [pdf, ps, other

    eess.SY

    Transformer-based Deep Learning Model for Joint Routing and Scheduling with Varying Electric Vehicle Numbers

    Authors: Jun Kang Yap, Vishnu Monn Baskaran, Wen Shan Tan, Ze Yang Ding, Hao Wang, David L. Dowe

    Abstract: The growing integration of renewable energy sources in modern power systems has introduced significant operational challenges due to their intermittent and uncertain outputs. In recent years, mobile energy storage systems (ESSs) have emerged as a popular flexible resource for mitigating these challenges. Compared to stationary ESSs, mobile ESSs offer additional spatial flexibility, enabling cost-e… ▽ More

    Submitted 21 July, 2025; originally announced July 2025.

    Comments: Accepted at Industry Applications Society Annual Meeting (IAS 2025)

  5. arXiv:2507.15307  [pdf, ps, other

    eess.SY

    Joint Optimisation of Electric Vehicle Routing and Scheduling: A Deep Learning-Driven Approach for Dynamic Fleet Sizes

    Authors: Jun Kang Yap, Vishnu Monn Baskaran, Wen Shan Tan, Ze Yang Ding, Hao Wang, David L. Dowe

    Abstract: Electric Vehicles (EVs) are becoming increasingly prevalent nowadays, with studies highlighting their potential as mobile energy storage systems to provide grid support. Realising this potential requires effective charging coordination, which are often formulated as mixed-integer programming (MIP) problems. However, MIP problems are NP-hard and often intractable when applied to time-sensitive task… ▽ More

    Submitted 21 July, 2025; originally announced July 2025.

    Comments: Accepted at International Joint Conference on Neural Networks (IJCNN 2025)

  6. arXiv:2507.00270  [pdf, ps, other

    eess.SY

    EMSpice 2.1: A Coupled EM and IR Drop Analysis Tool with Joule Heating and Thermal Map Integration for VLSI Reliability

    Authors: Subed Lamichhane, Haotian Lu, Sheldon X. -D. Tan

    Abstract: Electromigration (EM) remains a critical reliability concern in current and future copper-based VLSI circuits. As technology scales down, EM-induced IR drop becomes increasingly severe. While several EM-aware IR drop analysis tools have been proposed, few incorporate the real impact of temperature distribution on both EM and IR drop effects. In this work, we introduce EMSpice 2.1, an enhanced tool… ▽ More

    Submitted 30 June, 2025; originally announced July 2025.

    Comments: 4 Pages, accepted to SMACD 2025

  7. arXiv:2506.00564  [pdf, ps, other

    eess.IV cs.CV

    Image Restoration Learning via Noisy Supervision in the Fourier Domain

    Authors: Haosen Liu, Jiahao Liu, Shan Tan, Edmund Y. Lam

    Abstract: Noisy supervision refers to supervising image restoration learning with noisy targets. It can alleviate the data collection burden and enhance the practical applicability of deep learning techniques. However, existing methods suffer from two key drawbacks. Firstly, they are ineffective in handling spatially correlated noise commonly observed in practical applications such as low-light imaging and… ▽ More

    Submitted 31 May, 2025; originally announced June 2025.

  8. arXiv:2505.16807  [pdf, ps, other

    eess.SP

    Chirp Delay-Doppler Domain Modulation: A New Paradigm of Integrated Sensing and Communication for Autonomous Vehicles

    Authors: Zhuoran Li, Shufeng Tan, Zhen Gao, Yi Tao, Zhonghuai Wu, Zhongxiang Li, Chun Hu, Dezhi Zheng

    Abstract: Autonomous driving is reshaping the way humans travel, with millimeter wave (mmWave) radar playing a crucial role in this transformation to enabe vehicle-to-everything (V2X). Although chirp is widely used in mmWave radar systems for its strong sensing capabilities, the lack of integrated communication functions in existing systems may limit further advancement of autonomous driving. In light of th… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

  9. arXiv:2503.11324  [pdf, other

    cs.MM cs.CV eess.IV

    Safe-VAR: Safe Visual Autoregressive Model for Text-to-Image Generative Watermarking

    Authors: Ziyi Wang, Songbai Tan, Gang Xu, Xuerui Qiu, Hongbin Xu, Xin Meng, Ming Li, Fei Richard Yu

    Abstract: With the success of autoregressive learning in large language models, it has become a dominant approach for text-to-image generation, offering high efficiency and visual quality. However, invisible watermarking for visual autoregressive (VAR) models remains underexplored, despite its importance in misuse prevention. Existing watermarking methods, designed for diffusion models, often struggle to ad… ▽ More

    Submitted 14 March, 2025; originally announced March 2025.

  10. arXiv:2503.02685  [pdf, other

    q-bio.NC cs.CV eess.SP q-bio.QM

    TReND: Transformer derived features and Regularized NMF for neonatal functional network Delineation

    Authors: Sovesh Mohapatra, Minhui Ouyang, Shufang Tan, Jianlin Guo, Lianglong Sun, Yong He, Hao Huang

    Abstract: Precise parcellation of functional networks (FNs) of early developing human brain is the fundamental basis for identifying biomarker of developmental disorders and understanding functional development. Resting-state fMRI (rs-fMRI) enables in vivo exploration of functional changes, but adult FN parcellations cannot be directly applied to the neonates due to incomplete network maturation. No standar… ▽ More

    Submitted 4 March, 2025; originally announced March 2025.

    Comments: 10 Pages, 5 figures

  11. arXiv:2412.19990  [pdf, other

    eess.IV cs.CV

    SegKAN: High-Resolution Medical Image Segmentation with Long-Distance Dependencies

    Authors: Shengbo Tan, Rundong Xue, Shipeng Luo, Zeyu Zhang, Xinran Wang, Lei Zhang, Daji Ergu, Zhang Yi, Yang Zhao, Ying Cai

    Abstract: Hepatic vessels in computed tomography scans often suffer from image fragmentation and noise interference, making it difficult to maintain vessel integrity and posing significant challenges for vessel segmentation. To address this issue, we propose an innovative model: SegKAN. First, we improve the conventional embedding module by adopting a novel convolutional network structure for image embeddin… ▽ More

    Submitted 2 January, 2025; v1 submitted 27 December, 2024; originally announced December 2024.

  12. arXiv:2411.13862  [pdf, other

    eess.IV cs.CV cs.RO

    Image Compression Using Novel View Synthesis Priors

    Authors: Luyuan Peng, Mandar Chitre, Hari Vishnu, Yuen Min Too, Bharath Kalyan, Rajat Mishra, Soo Pieng Tan

    Abstract: Real-time visual feedback is essential for tetherless control of remotely operated vehicles, particularly during inspection and manipulation tasks. Though acoustic communication is the preferred choice for medium-range communication underwater, its limited bandwidth renders it impractical to transmit images or videos in real-time. To address this, we propose a model-based image compression techniq… ▽ More

    Submitted 27 November, 2024; v1 submitted 21 November, 2024; originally announced November 2024.

    Comments: Preprint submitted to IEEE Journal of Oceanic Engineering

  13. arXiv:2410.18461  [pdf, other

    eess.IV cs.CV cs.LG physics.med-ph

    Uncertainty-Error correlations in Evidential Deep Learning models for biomedical segmentation

    Authors: Hai Siong Tan, Kuancheng Wang, Rafe Mcbeth

    Abstract: In this work, we examine the effectiveness of an uncertainty quantification framework known as Evidential Deep Learning applied in the context of biomedical image segmentation. This class of models involves assigning Dirichlet distributions as priors for segmentation labels, and enables a few distinct definitions of model uncertainties. Using the cardiac and prostate MRI images available in the Me… ▽ More

    Submitted 24 October, 2024; originally announced October 2024.

    Comments: 15 pages

    Journal ref: Published in Proceedings of TAAI 2024

  14. arXiv:2409.14413  [pdf

    eess.IV

    Real-time Detection and Auto focusing of Beam Profiles from Silicon Photonics Gratings using YOLO model

    Authors: Yu Dian Lim, Hong Yu Li, Simon Chun Kiat Goh, Xiangyu Wang, Peng Zhao, Chuan Seng Tan

    Abstract: When observing the chip-to-free-space light beams from silicon photonics (SiPh) to free-space, manual adjustment of camera lens is often required to obtain a focused image of the light beams. In this letter, we demonstrated an auto-focusing system based on you-only-look-once (YOLO) model. The trained YOLO model exhibits high classification accuracy of 99.7% and high confidence level >0.95 when det… ▽ More

    Submitted 22 September, 2024; originally announced September 2024.

  15. arXiv:2409.00204  [pdf, other

    eess.IV cs.CV

    MedDet: Generative Adversarial Distillation for Efficient Cervical Disc Herniation Detection

    Authors: Zeyu Zhang, Nengmin Yi, Shengbo Tan, Ying Cai, Yi Yang, Lei Xu, Qingtai Li, Zhang Yi, Daji Ergu, Yang Zhao

    Abstract: Cervical disc herniation (CDH) is a prevalent musculoskeletal disorder that significantly impacts health and requires labor-intensive analysis from experts. Despite advancements in automated detection of medical imaging, two significant challenges hinder the real-world application of these methods. First, the computational complexity and resource demands present a significant gap for real-time app… ▽ More

    Submitted 18 October, 2024; v1 submitted 30 August, 2024; originally announced September 2024.

    Comments: Accepted to BIBM 2024 Oral

  16. arXiv:2408.10287  [pdf

    physics.optics cs.AI eess.IV

    Recognizing Beam Profiles from Silicon Photonics Gratings using Transformer Model

    Authors: Yu Dian Lim, Hong Yu Li, Simon Chun Kiat Goh, Xiangyu Wang, Peng Zhao, Chuan Seng Tan

    Abstract: Over the past decade, there has been extensive work in developing integrated silicon photonics (SiPh) gratings for the optical addressing of trapped ion qubits in the ion trap quantum computing community. However, when viewing beam profiles from infrared (IR) cameras, it is often difficult to determine the corresponding heights where the beam profiles are located. In this work, we developed transf… ▽ More

    Submitted 22 August, 2024; v1 submitted 19 August, 2024; originally announced August 2024.

  17. arXiv:2407.19544  [pdf

    cond-mat.mtrl-sci eess.IV

    Deep Generative Models-Assisted Automated Labeling for Electron Microscopy Images Segmentation

    Authors: Wenhao Yuan, Bingqing Yao, Shengdong Tan, Fengqi You, Qian He

    Abstract: The rapid advancement of deep learning has facilitated the automated processing of electron microscopy (EM) big data stacks. However, designing a framework that eliminates manual labeling and adapts to domain gaps remains challenging. Current research remains entangled in the dilemma of pursuing complete automation while still requiring simulations or slight manual annotations. Here we demonstrate… ▽ More

    Submitted 28 July, 2024; originally announced July 2024.

  18. arXiv:2407.16961  [pdf, other

    cs.CV cs.RO eess.IV

    Pose Estimation from Camera Images for Underwater Inspection

    Authors: Luyuan Peng, Hari Vishnu, Mandar Chitre, Yuen Min Too, Bharath Kalyan, Rajat Mishra, Soo Pieng Tan

    Abstract: High-precision localization is pivotal in underwater reinspection missions. Traditional localization methods like inertial navigation systems, Doppler velocity loggers, and acoustic positioning face significant challenges and are not cost-effective for some applications. Visual localization is a cost-effective alternative in such cases, leveraging the cameras already equipped on inspection vehicle… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: Submitted to IEEE Journal of Oceanic Engineering

  19. arXiv:2404.17126  [pdf, other

    cs.LG cs.AI eess.IV physics.med-ph

    Deep Evidential Learning for Radiotherapy Dose Prediction

    Authors: Hai Siong Tan, Kuancheng Wang, Rafe Mcbeth

    Abstract: In this work, we present a novel application of an uncertainty-quantification framework called Deep Evidential Learning in the domain of radiotherapy dose prediction. Using medical images of the Open Knowledge-Based Planning Challenge dataset, we found that this model can be effectively harnessed to yield uncertainty estimates that inherited correlations with prediction errors upon completion of n… ▽ More

    Submitted 23 September, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: 28 pages

    Journal ref: Computers in Biology and Medicine, Vol. 182, Nov 2024, 109172

  20. arXiv:2404.15163  [pdf, other

    cs.CV eess.IV

    Adaptive Mixed-Scale Feature Fusion Network for Blind AI-Generated Image Quality Assessment

    Authors: Tianwei Zhou, Songbai Tan, Wei Zhou, Yu Luo, Yuan-Gen Wang, Guanghui Yue

    Abstract: With the increasing maturity of the text-to-image and image-to-image generative models, AI-generated images (AGIs) have shown great application potential in advertisement, entertainment, education, social media, etc. Although remarkable advancements have been achieved in generative models, very few efforts have been paid to design relevant quality assessment models. In this paper, we propose a nov… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: IEEE Transactions on Broadcasting (TBC)

  21. arXiv:2403.15132  [pdf, other

    cs.CV eess.IV

    Transfer CLIP for Generalizable Image Denoising

    Authors: Jun Cheng, Dong Liang, Shan Tan

    Abstract: Image denoising is a fundamental task in computer vision. While prevailing deep learning-based supervised and self-supervised methods have excelled in eliminating in-distribution noise, their susceptibility to out-of-distribution (OOD) noise remains a significant challenge. The recent emergence of contrastive language-image pre-training (CLIP) model has showcased exceptional capabilities in open-w… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR2024

  22. arXiv:2403.01229  [pdf, other

    cs.CV cs.AI cs.LG eess.SP

    REWIND Dataset: Privacy-preserving Speaking Status Segmentation from Multimodal Body Movement Signals in the Wild

    Authors: Jose Vargas Quiros, Chirag Raman, Stephanie Tan, Ekin Gedik, Laura Cabrera-Quiros, Hayley Hung

    Abstract: Recognizing speaking in humans is a central task towards understanding social interactions. Ideally, speaking would be detected from individual voice recordings, as done previously for meeting scenarios. However, individual voice recordings are hard to obtain in the wild, especially in crowded mingling scenarios due to cost, logistics, and privacy concerns. As an alternative, machine learning mode… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

  23. arXiv:2402.18600  [pdf

    eess.IV cs.AI q-bio.TO

    Artificial Intelligence and Diabetes Mellitus: An Inside Look Through the Retina

    Authors: Yasin Sadeghi Bazargani, Majid Mirzaei, Navid Sobhi, Mirsaeed Abdollahi, Ali Jafarizadeh, Siamak Pedrammehr, Roohallah Alizadehsani, Ru San Tan, Sheikh Mohammed Shariful Islam, U. Rajendra Acharya

    Abstract: Diabetes mellitus (DM) predisposes patients to vascular complications. Retinal images and vasculature reflect the body's micro- and macrovascular health. They can be used to diagnose DM complications, including diabetic retinopathy (DR), neuropathy, nephropathy, and atherosclerotic cardiovascular disease, as well as forecast the risk of cardiovascular events. Artificial intelligence (AI)-enabled s… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 44 Pages, 6 figures, 1 table, 166 references

    ACM Class: J.3.2; J.3.3

  24. arXiv:2401.13587  [pdf, other

    cs.IT eess.SP

    Deep Learning Based Adaptive Joint mmWave Beam Alignment

    Authors: Daniel Tandler, Marc Gauger, Ahmet Serdar Tan, Sebastian Dörner, Stephan ten Brink

    Abstract: The challenging propagation environment, combined with the hardware limitations of mmWave systems, gives rise to the need for accurate initial access beam alignment strategies with low latency and high achievable beamforming gain. Much of the recent work in this area either focuses on one-sided beam alignment, or, joint beam alignment methods where both sides of the link perform a sequence of fixe… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  25. arXiv:2311.06572  [pdf, other

    eess.IV cs.CV

    Swin UNETR++: Advancing Transformer-Based Dense Dose Prediction Towards Fully Automated Radiation Oncology Treatments

    Authors: Kuancheng Wang, Hai Siong Tan, Rafe Mcbeth

    Abstract: The field of Radiation Oncology is uniquely positioned to benefit from the use of artificial intelligence to fully automate the creation of radiation treatment plans for cancer therapy. This time-consuming and specialized task combines patient imaging with organ and tumor segmentation to generate a 3D radiation dose distribution to meet clinical treatment goals, similar to voxel-level dense predic… ▽ More

    Submitted 12 October, 2024; v1 submitted 11 November, 2023; originally announced November 2023.

    Comments: Extended Abstract presented at Machine Learning for Health (ML4H) symposium 2023, December 10th, 2023, New Orleans, United States, 16 pages

  26. arXiv:2311.06552  [pdf, other

    eess.IV cs.CV cs.LG

    Stain Consistency Learning: Handling Stain Variation for Automatic Digital Pathology Segmentation

    Authors: Michael Yeung, Todd Watts, Sean YW Tan, Pedro F. Ferreira, Andrew D. Scott, Sonia Nielles-Vallespin, Guang Yang

    Abstract: Stain variation is a unique challenge associated with automated analysis of digital pathology. Numerous methods have been developed to improve the robustness of machine learning methods to stain variation, but comparative studies have demonstrated limited benefits to performance. Moreover, methods to handle stain variation were largely developed for H&E stained data, with evaluation generally limi… ▽ More

    Submitted 11 November, 2023; originally announced November 2023.

  27. arXiv:2308.16742  [pdf, other

    eess.IV cs.CV

    Unsupervised CT Metal Artifact Reduction by Plugging Diffusion Priors in Dual Domains

    Authors: Xuan Liu, Yaoqin Xie, Songhui Diao, Shan Tan, Xiaokun Liang

    Abstract: During the process of computed tomography (CT), metallic implants often cause disruptive artifacts in the reconstructed images, impeding accurate diagnosis. Several supervised deep learning-based approaches have been proposed for reducing metal artifacts (MAR). However, these methods heavily rely on training with simulated data, as obtaining paired metal artifact CT and clean CT data in clinical s… ▽ More

    Submitted 5 January, 2024; v1 submitted 31 August, 2023; originally announced August 2023.

  28. arXiv:2305.15887  [pdf, other

    eess.IV cs.CV

    Diffusion Probabilistic Priors for Zero-Shot Low-Dose CT Image Denoising

    Authors: Xuan Liu, Yaoqin Xie, Jun Cheng, Songhui Diao, Shan Tan, Xiaokun Liang

    Abstract: Denoising low-dose computed tomography (CT) images is a critical task in medical image computing. Supervised deep learning-based approaches have made significant advancements in this area in recent years. However, these methods typically require pairs of low-dose and normal-dose CT images for training, which are challenging to obtain in clinical settings. Existing unsupervised deep learning-based… ▽ More

    Submitted 13 July, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

  29. arXiv:2305.02493  [pdf, other

    cs.LG cs.AI eess.SY

    RCP-RF: A Comprehensive Road-car-pedestrian Risk Management Framework based on Driving Risk Potential Field

    Authors: Shuhang Tan, Zhiling Wang, Yan Zhong

    Abstract: Recent years have witnessed the proliferation of traffic accidents, which led wide researches on Automated Vehicle (AV) technologies to reduce vehicle accidents, especially on risk assessment framework of AV technologies. However, existing time-based frameworks can not handle complex traffic scenarios and ignore the motion tendency influence of each moving objects on the risk distribution, leading… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

  30. An Electromagnetic-Information-Theory Based Model for Efficient Characterization of MIMO Systems in Complex Space

    Authors: Ruifeng Li, Da Li, Jinyan Ma, Zhaoyang Feng, Ling Zhang, Shurun Tan, Wei E. I. Sha, Hongsheng Chen, Er-Ping Li

    Abstract: It is the pursuit of a multiple-input-multiple-output (MIMO) system to approach and even break the limit of channel capacity. However, it is always a big challenge to efficiently characterize the MIMO systems in complex space and get better propagation performance than the conventional MIMO systems considering only free space, which is important for guiding the power and phase allocation of antenn… ▽ More

    Submitted 13 January, 2023; originally announced January 2023.

    Comments: 13 pages, 14 figures

    Journal ref: IEEE Transactions on Antennas and Propagation, 2023

  31. arXiv:2301.01703  [pdf, other

    cs.IT eess.SP

    Technology Trends for Massive MIMO towards 6G

    Authors: Yiming Huo, Xingqin Lin, Boya Di, Hongliang Zhang, Francisco Javier Lorca Hernando, Ahmet Serdar Tan, Shahid Mumtaz, Özlem Tuğfe Demir, Kun Chen-Hu

    Abstract: At the dawn of the next-generation wireless systems and networks, massive multiple-input multiple-output (MIMO) has been envisioned as one of the enabling technologies. With the continued success of being applied in the 5G and beyond, the massive MIMO technology has demonstrated its advantageousness, integrability, and extendibility. Moreover, several evolutionary features and revolutionizing tren… ▽ More

    Submitted 5 January, 2023; v1 submitted 4 January, 2023; originally announced January 2023.

    Comments: 7 pages, 5 figures. This work has been submitted to the IEEE for possible publication

  32. A Remote Baby Surveillance System with RFID and GPS Tracking

    Authors: Ruven A/L Sundarajoo, Gwo Chin Chung, Wai Leong Pang, Soo Fun Tan

    Abstract: In the 21st century, sending babies or children to daycare centres has become more and more common among young guardians. The balance between full-time work and child care is increasingly challenging nowadays. In Malaysia, thousands of child abuse cases have been reported from babysitting centres every year, which indeed triggers the anxiety and stress of the guardians. Hence, this paper proposes… ▽ More

    Submitted 26 November, 2022; originally announced November 2022.

    Comments: 12 pages, 13 figures Published with International Journal of Engineering Trends and Technology (IJETT)

    Journal ref: International Journal of Engineering Trends and Technology, vol. 70, no. 11, pp. 81-92, 2022

  33. arXiv:2210.14446  [pdf, other

    cs.CL cs.SD eess.AS

    Smart Speech Segmentation using Acousto-Linguistic Features with look-ahead

    Authors: Piyush Behre, Naveen Parihar, Sharman Tan, Amy Shah, Eva Sharma, Geoffrey Liu, Shuangyu Chang, Hosam Khalil, Chris Basoglu, Sayan Pathak

    Abstract: Segmentation for continuous Automatic Speech Recognition (ASR) has traditionally used silence timeouts or voice activity detectors (VADs), which are both limited to acoustic features. This segmentation is often overly aggressive, given that people naturally pause to think as they speak. Consequently, segmentation happens mid-sentence, hindering both punctuation and downstream tasks like machine tr… ▽ More

    Submitted 27 October, 2022; v1 submitted 25 October, 2022; originally announced October 2022.

  34. arXiv:2201.05766  [pdf, ps, other

    cs.IT eess.SP

    Integrated Sensing and Communication with mmWave Massive MIMO: A Compressed Sampling Perspective

    Authors: Zhen Gao, Ziwei Wan, Dezhi Zheng, Shufeng Tan, Christos Masouros, Derrick Wing Kwan Ng, Sheng Chen

    Abstract: Integrated sensing and communication (ISAC) has opened up numerous game-changing opportunities for realizing future wireless systems. In this paper, we propose an ISAC processing framework relying on millimeter-wave (mmWave) massive multiple-input multiple-output (MIMO) systems. Specifically, we provide a compressed sampling (CS) perspective to facilitate ISAC processing, which can not only recove… ▽ More

    Submitted 9 September, 2022; v1 submitted 15 January, 2022; originally announced January 2022.

    Comments: 32 pages, 15 figures, accepted by IEEE Transactions on Wireless Communications

  35. arXiv:2110.05319  [pdf, other

    cs.CV eess.IV physics.geo-ph

    MD Loss: Efficient Training of 3D Seismic Fault Segmentation Network under Sparse Labels by Weakening Anomaly Annotation

    Authors: Yimin Dou, Kewen Li, Jianbing Zhu, Timing Li, Shaoquan Tan, Zongchao Huang

    Abstract: Data-driven fault detection has been regarded as a 3D image segmentation task. The models trained from synthetic data are difficult to generalize in some surveys. Recently, training 3D fault segmentation using sparse manual 2D slices is thought to yield promising results, but manual labeling has many false negative labels (abnormal annotations), which is detrimental to training and consequently to… ▽ More

    Submitted 21 June, 2022; v1 submitted 11 October, 2021; originally announced October 2021.

    Comments: This work has been submitted to the IEEE for possible publication

  36. Dynamic Response and Stability Margin Improvement of Wireless Power Receiver Systems via Right-Half-Plane Zero Elimination

    Authors: Kerui Li, Siew-Chong Tan, Ron Shu Yuen Hui

    Abstract: The series-series compensation topology is widely adopted in many wireless power transfer applications. For such systems, their wireless power receiver part typically involves a DC-DC converter with front-stage full-bridge diode rectifier, to process the high-frequency transmitted AC power into a DC output voltage for the load. It is recently reported that the current source nature of the series-s… ▽ More

    Submitted 17 April, 2021; originally announced June 2021.

    Comments: IEEE Transactions on Power Electronics, 2021

  37. arXiv:2104.04641  [pdf, other

    cs.CV eess.IV physics.optics

    CodedStereo: Learned Phase Masks for Large Depth-of-field Stereo

    Authors: Shiyu Tan, Yicheng Wu, Shoou-I Yu, Ashok Veeraraghavan

    Abstract: Conventional stereo suffers from a fundamental trade-off between imaging volume and signal-to-noise ratio (SNR) -- due to the conflicting impact of aperture size on both these variables. Inspired by the extended depth of field cameras, we propose a novel end-to-end learning-based technique to overcome this limitation, by introducing a phase mask at the aperture plane of the cameras in a stereo ima… ▽ More

    Submitted 9 April, 2021; originally announced April 2021.

    Comments: Accepted to CVPR 2021 as an oral presentation

  38. On Effect of Right-Half-Plane Zero Present in Buck Converters with Input Current Source in Wireless Power Receiver Systems

    Authors: Kerui Li, Siew-Chong Tan, Ron Shu Yuen Hui

    Abstract: In wireless power receiver systems, the buck converter is widely used to step down the higher rectified voltage derived from the wireless receiver coil, to a lower output voltage for the immediate battery charging process. In this work, the presence and effect of the right-half-plane (RHP) zeros found in the small-signal inductor-current-to-duty-ratio and output-voltage-to-duty ratio transfer func… ▽ More

    Submitted 25 November, 2020; originally announced November 2020.

    Comments: 11 pages. IEEE Transactions on Power Electronics 2020 (Early access)

    Journal ref: IEEE Trans. Power Electron., vol. 36, no. 6, pp. 6364-6374, June 2021

  39. arXiv:2008.03715  [pdf, other

    eess.SP cs.DC cs.MM

    A Modular Approach for Synchronized Wireless Multimodal Multisensor Data Acquisition in Highly Dynamic Social Settings

    Authors: Chirag Raman, Stephanie Tan, Hayley Hung

    Abstract: Existing data acquisition literature for human behavior research provides wired solutions, mainly for controlled laboratory setups. In uncontrolled free-standing conversation settings, where participants are free to walk around, these solutions are unsuitable. While wireless solutions are employed in the broadcasting industry, they can be prohibitively expensive. In this work, we propose a modular… ▽ More

    Submitted 9 August, 2020; originally announced August 2020.

    Comments: 9 pages, 8 figures, Proceedings of the 28th ACM International Conference on Multimedia (MM '20), October 12--16, 2020, Seattle, WA, USA. First two authors contributed equally

  40. Implementation of UAV Coordination Based on a Hierarchical Multi-UAV Simulation Platform

    Authors: Kun Xiao, Lan Ma, Shaochang Tan, Yirui Cong, Xiangke Wang

    Abstract: In this paper, a hierarchical multi-UAV simulation platform,called XTDrone, is designed for UAV swarms, which is completely open-source 4 . There are six layers in XTDrone: communication, simulator,low-level control, high-level control, coordination, and human interac-tion layers. XTDrone has three advantages. Firstly, the simulation speedcan be adjusted to match the computer performance, based on… ▽ More

    Submitted 30 May, 2022; v1 submitted 3 May, 2020; originally announced May 2020.

    Comments: 12 pages, 10 figures. And for the, see https://gitee.com/robin_shaun/XTDrone

    Journal ref: Proceedings of 2020 International Conference on Guidance Navigation and Control, ICGNC 2020, Tianjin, China, October 23-25, 2020

  41. On Beat Frequency Oscillation of Two-Stage Wireless Power Receivers

    Authors: Kerui Li, Siew-Chong Tan, Ron Shu Yuen Hui

    Abstract: Two-stage wireless power receivers, which typically include an AC-DC diode rectifier and a DC-DC regulator, are popular solutions in low-power wireless power transfer applications. However, the interaction between the rectifier and the regulator may introduce beat frequency oscillation on both the DC-link and output capacitors. In this paper, the cause of the beat frequency oscillation and its rel… ▽ More

    Submitted 5 October, 2020; v1 submitted 28 April, 2020; originally announced April 2020.

    Journal ref: in IEEE Transactions on Power Electronics, vol. 35, no. 12, pp. 12741-12751, Dec. 2020

  42. arXiv:2004.13181  [pdf, ps, other

    cs.LG cs.NE eess.IV stat.ML

    EM-GAN: Fast Stress Analysis for Multi-Segment Interconnect Using Generative Adversarial Networks

    Authors: Wentian Jin, Sheriff Sadiqbatcha, Jinwei Zhang, Sheldon X. -D. Tan

    Abstract: In this paper, we propose a fast transient hydrostatic stress analysis for electromigration (EM) failure assessment for multi-segment interconnects using generative adversarial networks (GANs). Our work leverages the image synthesis feature of GAN-based generative deep neural networks. The stress evaluation of multi-segment interconnects, modeled by partial differential equations, can be viewed as… ▽ More

    Submitted 27 April, 2020; originally announced April 2020.

  43. Highly-Efficient Single-Switch-Regulated Resonant Wireless Power Receiver with Hybrid Modulation

    Authors: Kerui Li, Albert Ting Leung Lee, Siew-Chong Tan, Ron Shu Yuen Hui

    Abstract: In this paper, a highly-efficient single-switch-regulated resonant wireless power receiver with hybrid modulation is proposed. To achieve both high efficiency and good output voltage regulation, phase shift and pulse width hybrid modulation are simultaneously applied. The soft switching operation in this topology is achieved by the cycle-by-cycle phase shift adjustment between the input current an… ▽ More

    Submitted 5 January, 2021; v1 submitted 9 April, 2020; originally announced April 2020.

    Comments: in IEEE Journal of Emerging and Selected Topics in Power Electronics. 2020

  44. arXiv:2002.12588  [pdf, other

    eess.IV cs.CV cs.LG

    Regional Registration of Whole Slide Image Stacks Containing Highly Deformed Artefacts

    Authors: Mahsa Paknezhad, Sheng Yang Michael Loh, Yukti Choudhury, Valerie Koh Cui Koh, TimothyTay Kwang Yong, Hui Shan Tan, Ravindran Kanesvaran, Puay Hoon Tan, John Yuen Shyi Peng, Weimiao Yu, Yongcheng Benjamin Tan, Yong Zhen Loy, Min-Han Tan, Hwee Kuan Lee

    Abstract: Motivation: High resolution 2D whole slide imaging provides rich information about the tissue structure. This information can be a lot richer if these 2D images can be stacked into a 3D tissue volume. A 3D analysis, however, requires accurate reconstruction of the tissue volume from the 2D image stack. This task is not trivial due to the distortions that each individual tissue slice experiences wh… ▽ More

    Submitted 28 February, 2020; originally announced February 2020.

  45. Single-Switch-Regulated Resonant WPT Receiver

    Authors: Kerui Li, Siew Chong Tan, Ron Shu Yuen Hui

    Abstract: A single-switch-regulated wireless power transfer (WPT) receiver is presented in this letter. Aiming at low-cost applications, the system involves only a single-switch class-E resonant rectifier, a frequency synchronization circuit, and a microcontroller. The number of power semiconductor devices required in this circuit is minimal. Only one active switch is used and no diode is required. As a sin… ▽ More

    Submitted 18 December, 2019; v1 submitted 12 December, 2019; originally announced December 2019.

    Journal ref: in IEEE Transactions on Power Electronics, vol. 34, no. 11, pp. 10386-10391, Nov. 2019

  46. Single-Stage Regulated Resonant WPT Receiver with Low Input Harmonic Distortion

    Authors: Kerui Li, Siew Chong Tan, Ron Shu Yuen Hui

    Abstract: Resonant rectifier topologies would be a promising candidate for achieving simple, compact, and reliable single-stage wireless power transfer (WPT) receiver if not for the lack of good DC regulation capability. This paper investigates the problems that prevent the feasibility of single-stage DC regulation in resonant rectifier topologies. A possible solution is the proposed differential resonant r… ▽ More

    Submitted 6 January, 2020; v1 submitted 12 December, 2019; originally announced December 2019.

    Journal ref: in IEEE Transactions on Power Electronics, vol. 35, no. 7, pp. 6820-6829, July 2020

  47. arXiv:1911.12796  [pdf, other

    cs.CV cs.LG eess.IV

    Light-weight Calibrator: a Separable Component for Unsupervised Domain Adaptation

    Authors: Shaokai Ye, Kailu Wu, Mu Zhou, Yunfei Yang, Sia huat Tan, Kaidi Xu, Jiebo Song, Chenglong Bao, Kaisheng Ma

    Abstract: Existing domain adaptation methods aim at learning features that can be generalized among domains. These methods commonly require to update source classifier to adapt to the target domain and do not properly handle the trade off between the source domain and the target domain. In this work, instead of training a classifier to adapt to the target domain, we use a separable component called data cal… ▽ More

    Submitted 28 February, 2020; v1 submitted 28 November, 2019; originally announced November 2019.

    Comments: Accepted by CVPR2020

  48. arXiv:1911.04657  [pdf, other

    cs.MM eess.IV

    CALPA-NET: Channel-pruning-assisted Deep Residual Network for Steganalysis of Digital Images

    Authors: Shunquan Tan, Weilong Wu, Zilong Shao, Qiushi Li, Bin Li, Jiwu Huang

    Abstract: Over the past few years, detection performance improvements of deep-learning based steganalyzers have been usually achieved through structure expansion. However, excessive expanded structure results in huge computational cost, storage overheads, and consequently difficulty in training and deployment. In this paper we propose CALPA-NET, a ChAnneL-Pruning-Assisted deep residual network architecture… ▽ More

    Submitted 23 June, 2020; v1 submitted 11 November, 2019; originally announced November 2019.

    Comments: Accepted by IEEE Transactions on Information Forensics & Security

  49. arXiv:1910.09570  [pdf, other

    q-bio.QM cs.CV eess.SP stat.AP stat.ML

    Icentia11K: An Unsupervised Representation Learning Dataset for Arrhythmia Subtype Discovery

    Authors: Shawn Tan, Guillaume Androz, Ahmad Chamseddine, Pierre Fecteau, Aaron Courville, Yoshua Bengio, Joseph Paul Cohen

    Abstract: We release the largest public ECG dataset of continuous raw signals for representation learning containing 11 thousand patients and 2 billion labelled beats. Our goal is to enable semi-supervised ECG models to be made as well as to discover unknown subtypes of arrhythmia and anomalous ECG signal events. To this end, we propose an unsupervised representation learning task, evaluated in a semi-super… ▽ More

    Submitted 21 October, 2019; originally announced October 2019.

    Comments: Under Review

  50. arXiv:1908.07406  [pdf, ps, other

    eess.SP

    Multi-Objective Optimization for Drone Delivery

    Authors: Suttinee Sawadsitang, Dusit Niyato, Puay Siew Tan, Sarana Nutanong

    Abstract: Recently, an unmanned aerial vehicle (UAV), as known as drone, has become an alternative means of package delivery. Although the drone delivery scheduling has been studied in recent years, most existing models are formulated as a single objective optimization problem. However, in practice, the drone delivery scheduling has multiple objectives that the shipper has to achieve. Moreover, drone delive… ▽ More

    Submitted 24 July, 2019; originally announced August 2019.

    Comments: 5 pages, 4 figures

    Journal ref: 2019 IEEE 90th Vehicular Technology Conference: VTC2019-Fall