Skip to main content

Showing 1–50 of 56 results for author: Liang, H

Searching in archive eess. Search in all archives.
.
  1. arXiv:2506.20475  [pdf, ps, other

    eess.SY eess.IV

    Learning-based safety lifting monitoring system for cranes on construction sites

    Authors: Hao Chen, Yu Hin Ng, Ching-Wei Chang, Haobo Liang, Yanke Wang

    Abstract: Lifting on construction sites, as a frequent operation, works still with safety risks, especially for modular integrated construction (MiC) lifting due to its large weight and size, probably leading to accidents, causing damage to the modules, or more critically, posing safety hazards to on-site workers. Aiming to reduce the safety risks in lifting scenarios, we design an automated safe lifting mo… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

    Comments: 20 pages, 10 figures

  2. arXiv:2506.18938  [pdf, ps, other

    cs.CV eess.SY

    Bird's-eye view safety monitoring for the construction top under the tower crane

    Authors: Yanke Wang, Yu Hin Ng, Haobo Liang, Ching-Wei Chang, Hao Chen

    Abstract: The tower crane is involving more automated and intelligent operation procedure, and importantly, the application of automation technologies to the safety issues is imperative ahead of the utilization of any other advances. Among diverse risk management tasks on site, it is essential to protect the human workers on the workspace between the tower crane and constructed building top area (constructi… ▽ More

    Submitted 22 June, 2025; originally announced June 2025.

  3. arXiv:2506.01947  [pdf, ps, other

    eess.IV cs.CV

    RAW Image Reconstruction from RGB on Smartphones. NTIRE 2025 Challenge Report

    Authors: Marcos V. Conde, Radu Timofte, Radu Berdan, Beril Besbinar, Daisuke Iso, Pengzhou Ji, Xiong Dun, Zeying Fan, Chen Wu, Zhansheng Wang, Pengbo Zhang, Jiazi Huang, Qinglin Liu, Wei Yu, Shengping Zhang, Xiangyang Ji, Kyungsik Kim, Minkyung Kim, Hwalmin Lee, Hekun Ma, Huan Zheng, Yanyan Wei, Zhao Zhang, Jing Fang, Meilin Gao , et al. (8 additional authors not shown)

    Abstract: Numerous low-level vision tasks operate in the RAW domain due to its linear properties, bit depth, and sensor designs. Despite this, RAW image datasets are scarce and more expensive to collect than the already large and public sRGB datasets. For this reason, many approaches try to generate realistic RAW images using sensor information and sRGB images. This paper covers the second challenge on RAW… ▽ More

    Submitted 2 June, 2025; originally announced June 2025.

    Comments: CVPR 2025 - New Trends in Image Restoration and Enhancement (NTIRE)

  4. arXiv:2505.03078  [pdf, other

    cs.GT cs.SI eess.SY

    Coevolution of Actions and Opinions in Networks of Coordinating and Anti-Coordinating Agents

    Authors: Hong Liang, Mengbin Ye, Lorenzo Zino, Weiguo Xia

    Abstract: In this paper, we investigate the dynamics of coordinating and anti-coordinating agents in a coevolutionary model for actions and opinions. In the model, the individuals of a population interact on a two-layer network, sharing their opinions and observing others' action, while revising their own opinions and actions according to a game-theoretic mechanism, grounded in the social psychology literat… ▽ More

    Submitted 5 May, 2025; originally announced May 2025.

    Comments: Manuscript under review as a journal submission

  5. arXiv:2503.20613  [pdf, other

    cs.LG cs.AI cs.NI eess.SY

    State-Aware Perturbation Optimization for Robust Deep Reinforcement Learning

    Authors: Zongyuan Zhang, Tianyang Duan, Zheng Lin, Dong Huang, Zihan Fang, Zekai Sun, Ling Xiong, Hongbin Liang, Heming Cui, Yong Cui

    Abstract: Recently, deep reinforcement learning (DRL) has emerged as a promising approach for robotic control. However, the deployment of DRL in real-world robots is hindered by its sensitivity to environmental perturbations. While existing whitebox adversarial attacks rely on local gradient information and apply uniform perturbations across all states to evaluate DRL robustness, they fail to account for te… ▽ More

    Submitted 26 March, 2025; originally announced March 2025.

    Comments: 15 pages, 11 figures

  6. arXiv:2502.13998  [pdf, ps, other

    eess.IV cs.AI cs.CR cs.CV

    A Baseline Method for Removing Invisible Image Watermarks using Deep Image Prior

    Authors: Hengyue Liang, Taihui Li, Ju Sun

    Abstract: Image watermarks have been considered a promising technique to help detect AI-generated content, which can be used to protect copyright or prevent fake image abuse. In this work, we present a black-box method for removing invisible image watermarks, without the need of any dataset of watermarked images or any knowledge about the watermark system. Our approach is simple to implement: given a single… ▽ More

    Submitted 2 July, 2025; v1 submitted 19 February, 2025; originally announced February 2025.

    Comments: Pulished in Transaction of Machine Learning Research (TMLR): https://openreview.net/forum?id=g85Vxlrq0O

  7. arXiv:2502.10058  [pdf, other

    cs.CL eess.AS

    MTLM: Incorporating Bidirectional Text Information to Enhance Language Model Training in Speech Recognition Systems

    Authors: Qingliang Meng, Pengju Ren, Tian Li, Changsong Dai, Huizhi Liang

    Abstract: Automatic speech recognition (ASR) systems normally consist of an acoustic model (AM) and a language model (LM). The acoustic model estimates the probability distribution of text given the input speech, while the language model calibrates this distribution toward a specific knowledge domain to produce the final transcription. Traditional ASR-specific LMs are typically trained in a unidirectional (… ▽ More

    Submitted 14 June, 2025; v1 submitted 14 February, 2025; originally announced February 2025.

  8. arXiv:2501.15368  [pdf, other

    cs.CL cs.SD eess.AS

    Baichuan-Omni-1.5 Technical Report

    Authors: Yadong Li, Jun Liu, Tao Zhang, Tao Zhang, Song Chen, Tianpeng Li, Zehuan Li, Lijun Liu, Lingfeng Ming, Guosheng Dong, Da Pan, Chong Li, Yuanbo Fang, Dongdong Kuang, Mingrui Wang, Chenglin Zhu, Youwei Zhang, Hongyu Guo, Fengyu Zhang, Yuran Wang, Bowen Ding, Wei Song, Xu Li, Yuqi Huo, Zheng Liang , et al. (68 additional authors not shown)

    Abstract: We introduce Baichuan-Omni-1.5, an omni-modal model that not only has omni-modal understanding capabilities but also provides end-to-end audio generation capabilities. To achieve fluent and high-quality interaction across modalities without compromising the capabilities of any modality, we prioritized optimizing three key aspects. First, we establish a comprehensive data cleaning and synthesis pip… ▽ More

    Submitted 25 January, 2025; originally announced January 2025.

  9. arXiv:2412.13461  [pdf, other

    cs.CV cs.AI eess.IV

    Look Inside for More: Internal Spatial Modality Perception for 3D Anomaly Detection

    Authors: Hanzhe Liang, Guoyang Xie, Chengbin Hou, Bingshu Wang, Can Gao, Jinbao Wang

    Abstract: 3D anomaly detection has recently become a significant focus in computer vision. Several advanced methods have achieved satisfying anomaly detection performance. However, they typically concentrate on the external structure of 3D samples and struggle to leverage the internal information embedded within samples. Inspired by the basic intuition of why not look inside for more, we introduce a straigh… ▽ More

    Submitted 10 March, 2025; v1 submitted 17 December, 2024; originally announced December 2024.

    Comments: AAAI2025 Poster

  10. arXiv:2412.08029  [pdf, other

    cs.CV cs.AI cs.HC cs.MM eess.IV

    NeRF-NQA: No-Reference Quality Assessment for Scenes Generated by NeRF and Neural View Synthesis Methods

    Authors: Qiang Qu, Hanxue Liang, Xiaoming Chen, Yuk Ying Chung, Yiran Shen

    Abstract: Neural View Synthesis (NVS) has demonstrated efficacy in generating high-fidelity dense viewpoint videos using a image set with sparse views. However, existing quality assessment methods like PSNR, SSIM, and LPIPS are not tailored for the scenes with dense viewpoints synthesized by NVS and NeRF variants, thus, they often fall short in capturing the perceptual quality, including spatial and angular… ▽ More

    Submitted 10 December, 2024; originally announced December 2024.

    Journal ref: IEEE Transactions on Visualization and Computer Graphics, vol. 30, no. 5, pp. 2129-2139, May 2024

  11. arXiv:2406.09317  [pdf, other

    eess.IV cs.CV

    Enhancing Diagnostic Accuracy in Rare and Common Fundus Diseases with a Knowledge-Rich Vision-Language Model

    Authors: Meng Wang, Tian Lin, Aidi Lin, Kai Yu, Yuanyuan Peng, Lianyu Wang, Cheng Chen, Ke Zou, Huiyu Liang, Man Chen, Xue Yao, Meiqin Zhang, Binwei Huang, Chaoxin Zheng, Peixin Zhang, Wei Chen, Yilong Luo, Yifan Chen, Honghe Xia, Tingkun Shi, Qi Zhang, Jinming Guo, Xiaolin Chen, Jingcheng Wang, Yih Chung Tham , et al. (24 additional authors not shown)

    Abstract: Previous foundation models for fundus images were pre-trained with limited disease categories and knowledge base. Here we introduce a knowledge-rich vision-language model (RetiZero) that leverages knowledge from more than 400 fundus diseases. For RetiZero's pretraining, we compiled 341,896 fundus images paired with texts, sourced from public datasets, ophthalmic literature, and online resources, e… ▽ More

    Submitted 10 April, 2025; v1 submitted 13 June, 2024; originally announced June 2024.

  12. arXiv:2406.08782   

    eess.IV cs.CV

    Hybrid Spatial-spectral Neural Network for Hyperspectral Image Denoising

    Authors: Hao Liang, Chengjie, Kun Li, Xin Tian

    Abstract: Hyperspectral image (HSI) denoising is an essential procedure for HSI applications. Unfortunately, the existing Transformer-based methods mainly focus on non-local modeling, neglecting the importance of locality in image denoising. Moreover, deep learning methods employ complex spectral learning mechanisms, thus introducing large computation costs. To address these problems, we propose a hybrid… ▽ More

    Submitted 1 August, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

    Comments: There are some errors in professional theory

  13. arXiv:2406.04685  [pdf, other

    eess.SY cs.NI

    Statistical QoS Provisioning Architecture for 6G Satellite-Terrestrial Integrated Networks

    Authors: Jingqing Wang, Wenchi Cheng, Wei Zhang, Hui Liang

    Abstract: The emergence of massive ultra-reliable and low latency communications (mURLLC) as a category of time/reliability-sensitive service over 6G networks has received considerable research attention, which has presented unprecedented challenges. As one of the key enablers for 6G, satellite-terrestrial integrated networks (STIN) have been developed to offer more expansive connectivity and comprehensive… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  14. arXiv:2404.11171  [pdf, other

    cs.LG cs.AI eess.SP

    Personalized Heart Disease Detection via ECG Digital Twin Generation

    Authors: Yaojun Hu, Jintai Chen, Lianting Hu, Dantong Li, Jiahuan Yan, Haochao Ying, Huiying Liang, Jian Wu

    Abstract: Heart diseases rank among the leading causes of global mortality, demonstrating a crucial need for early diagnosis and intervention. Most traditional electrocardiogram (ECG) based automated diagnosis methods are trained at population level, neglecting the customization of personalized ECGs to enhance individual healthcare management. A potential solution to address this limitation is to employ dig… ▽ More

    Submitted 11 May, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

  15. arXiv:2402.01380  [pdf, other

    cs.CV eess.IV

    Efficient Dynamic-NeRF Based Volumetric Video Coding with Rate Distortion Optimization

    Authors: Zhiyu Zhang, Guo Lu, Huanxiong Liang, Anni Tang, Qiang Hu, Li Song

    Abstract: Volumetric videos, benefiting from immersive 3D realism and interactivity, hold vast potential for various applications, while the tremendous data volume poses significant challenges for compression. Recently, NeRF has demonstrated remarkable potential in volumetric video compression thanks to its simple representation and powerful 3D modeling capabilities, where a notable work is ReRF. However, R… ▽ More

    Submitted 7 November, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: Accepted by IEEE ICME 2024

  16. Arithmetic Average Density Fusion -- Part IV: Distributed Heterogeneous Fusion of RFS and LRFS Filters via Variational Approximation

    Authors: Tiancheng Li, Haozhe Liang, Guchong Li, Jesús García Herrero, Quan Pan

    Abstract: This paper, the fourth part of a series of papers on the arithmetic average (AA) density fusion approach and its application for target tracking, addresses the intricate challenge of distributed heterogeneous multisensor multitarget tracking, where each inter-connected sensor operates a probability hypothesis density (PHD) filter, a multiple Bernoulli (MB) filter or a labeled MB (LMB) filter and t… ▽ More

    Submitted 30 January, 2024; originally announced February 2024.

    Comments: 13 pages,14 figures

    Journal ref: IEEE Transactions on Signal Processing, 2025

  17. arXiv:2401.00225  [pdf

    eess.AS cs.AI eess.SP

    Enhancing dysarthria speech feature representation with empirical mode decomposition and Walsh-Hadamard transform

    Authors: Ting Zhu, Shufei Duan, Camille Dingam, Huizhi Liang, Wei Zhang

    Abstract: Dysarthria speech contains the pathological characteristics of vocal tract and vocal fold, but so far, they have not yet been included in traditional acoustic feature sets. Moreover, the nonlinearity and non-stationarity of speech have been ignored. In this paper, we propose a feature enhancement algorithm for dysarthria speech called WHFEMD. It combines empirical mode decomposition (EMD) and fast… ▽ More

    Submitted 30 December, 2023; originally announced January 2024.

  18. arXiv:2312.16057  [pdf, other

    cs.IT eess.SP

    Semantic Importance-Aware Based for Multi-User Communication Over MIMO Fading Channels

    Authors: Haotai Liang, Zhicheng Bao, Wannian An, Chen Dong, Xiaodong Xu

    Abstract: Semantic communication, as a novel communication paradigm, has attracted the interest of many scholars, with multi-user, multi-input multi-output (MIMO) scenarios being one of the critical contexts. This paper presents a semantic importance-aware based communication system (SIA-SC) over MIMO Rayleigh fading channels. Combining the semantic symbols' inequality and the equivalent subchannels of MIMO… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

  19. arXiv:2312.10051  [pdf, other

    eess.SP

    Semantic Synchronization for Enhanced Reliability in Communication Systems

    Authors: Xiaoyi Liu, Haotai Liang, Chen Dong, Xiaodong Xu

    Abstract: As a new communication paradigm, semantic communication has received widespread attention in communication fields. However, since the decoding of semantic signals relies on contextual knowledge, misalignment between the starting position of the semantic signal and the AI-based semantic decoder would prevent source signal recovery and reconstruction. To achieve more precise semantic communication,… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

  20. arXiv:2312.08998  [pdf

    eess.AS cs.AI cs.SD eess.SP

    Design, construction and evaluation of emotional multimodal pathological speech database

    Authors: Ting Zhu, Shufei Duan, Huizhi Liang, Wei Zhang

    Abstract: The lack of an available emotion pathology database is one of the key obstacles in studying the emotion expression status of patients with dysarthria. The first Chinese multimodal emotional pathological speech database containing multi-perspective information is constructed in this paper. It includes 29 controls and 39 patients with different degrees of motor dysarthria, expressing happy, sad, ang… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

  21. arXiv:2309.12849  [pdf, other

    cs.LG eess.SY

    DeepOPF-U: A Unified Deep Neural Network to Solve AC Optimal Power Flow in Multiple Networks

    Authors: Heng Liang, Changhong Zhao

    Abstract: The traditional machine learning models to solve optimal power flow (OPF) are mostly trained for a given power network and lack generalizability to today's power networks with varying topologies and growing plug-and-play distributed energy resources (DERs). In this paper, we propose DeepOPF-U, which uses one unified deep neural network (DNN) to solve alternating-current (AC) OPF problems in differ… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

    Comments: 3 pages, 2 figures

  22. arXiv:2308.16738  [pdf, other

    eess.IV cs.CV cs.LG

    SFUSNet: A Spatial-Frequency domain-based Multi-branch Network for diagnosis of Cervical Lymph Node Lesions in Ultrasound Images

    Authors: Yubiao Yue, Jun Xue, Haihua Liang, Bingchun Luo, Zhenzhang Li

    Abstract: Booming deep learning has substantially improved the diagnosis for diverse lesions in ultrasound images, but a conspicuous research gap concerning cervical lymph node lesions still remains. The objective of this work is to diagnose cervical lymph node lesions in ultrasound images by leveraging a deep learning model. To this end, we first collected 3392 cervical ultrasound images containing normal… ▽ More

    Submitted 4 October, 2023; v1 submitted 31 August, 2023; originally announced August 2023.

  23. arXiv:2308.14081   

    eess.IV cs.CV

    U-SEANNet: A Simple, Efficient and Applied U-Shaped Network for Diagnosis of Nasal Diseases on Nasal Endoscopic Images

    Authors: Yubiao Yue, Jun Xue, Chao Wang, Haihua Liang, Zhenzhang Li

    Abstract: Numerous studies have affirmed that deep learning models can facilitate early diagnosis of lesions in endoscopic images. However, the lack of available datasets stymies advancements in research on nasal endoscopy, and existing models fail to strike a good trade-off between model diagnosis performance, model complexity and parameters size, rendering them unsuitable for real-world application. To br… ▽ More

    Submitted 11 February, 2024; v1 submitted 27 August, 2023; originally announced August 2023.

    Comments: There are some descriptive errors in the manuscript

  24. arXiv:2308.04805  [pdf, other

    cs.IR cs.SD eess.AS

    DiVa: An Iterative Framework to Harvest More Diverse and Valid Labels from User Comments for Music

    Authors: Hongru Liang, Jingyao Liu, Yuanxin Xiang, Jiachen Du, Lanjun Zhou, Shushen Pan, Wenqiang Lei

    Abstract: Towards sufficient music searching, it is vital to form a complete set of labels for each song. However, current solutions fail to resolve it as they cannot produce diverse enough mappings to make up for the information missed by the gold labels. Based on the observation that such missing information may already be presented in user comments, we propose to study the automated music labeling in an… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

    Comments: 11 pages, 5 figures, published to ACM MM 2023

  25. arXiv:2306.10772  [pdf, other

    cs.SD eess.AS

    Learning an Interpretable End-to-End Network for Real-Time Acoustic Beamforming

    Authors: Hao Liang, Guanxing Zhou, Xiaotong Tu, Andreas Jakobsson, Xinghao Ding, Yue Huang

    Abstract: Recently, many forms of audio industrial applications, such as sound monitoring and source localization, have begun exploiting smart multi-modal devices equipped with a microphone array. Regrettably, model-based methods are often difficult to employ for such devices due to their high computational complexity, as well as the difficulty of appropriately selecting the user-determined parameters. As a… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

    Comments: 12 pages, 9 figures

  26. arXiv:2305.00149  [pdf, other

    eess.IV cs.CV

    X-ray Recognition: Patient identification from X-rays using a contrastive objective

    Authors: Hao Liang, Kevin Ni, Guha Balakrishnan

    Abstract: Recent research demonstrates that deep learning models are capable of precisely extracting bio-information (e.g. race, gender and age) from patients' Chest X-Rays (CXRs). In this paper, we further show that deep learning models are also surprisingly accurate at recognition, i.e., distinguishing CXRs belonging to the same patient from those belonging to different patients. These findings suggest po… ▽ More

    Submitted 28 April, 2023; originally announced May 2023.

  27. arXiv:2305.00147  [pdf, other

    eess.IV cs.CV

    Visualizing chest X-ray dataset biases using GANs

    Authors: Hao Liang, Kevin Ni, Guha Balakrishnan

    Abstract: Recent work demonstrates that images from various chest X-ray datasets contain visual features that are strongly correlated with protected demographic attributes like race and gender. This finding raises issues of fairness, since some of these factors may be used by downstream algorithms for clinical predictions. In this work, we propose a framework, using generative adversarial networks (GANs), t… ▽ More

    Submitted 5 September, 2023; v1 submitted 28 April, 2023; originally announced May 2023.

    Comments: Medical Imaging with Deep Learning(MIDL) 2023

  28. arXiv:2303.15206  [pdf, other

    cs.CV eess.IV

    Perceptual Quality Assessment of NeRF and Neural View Synthesis Methods for Front-Facing Views

    Authors: Hanxue Liang, Tianhao Wu, Param Hanji, Francesco Banterle, Hongyun Gao, Rafal Mantiuk, Cengiz Oztireli

    Abstract: Neural view synthesis (NVS) is one of the most successful techniques for synthesizing free viewpoint videos, capable of achieving high fidelity from only a sparse set of captured images. This success has led to many variants of the techniques, each evaluated on a set of test views typically using image quality metrics such as PSNR, SSIM, or LPIPS. There has been a lack of research on how NVS metho… ▽ More

    Submitted 24 October, 2023; v1 submitted 24 March, 2023; originally announced March 2023.

  29. arXiv:2303.11692  [pdf, other

    cs.SD cs.IR eess.AS

    ByteCover3: Accurate Cover Song Identification on Short Queries

    Authors: Xingjian Du, Zijie Wang, Xia Liang, Huidong Liang, Bilei Zhu, Zejun Ma

    Abstract: Deep learning based methods have become a paradigm for cover song identification (CSI) in recent years, where the ByteCover systems have achieved state-of-the-art results on all the mainstream datasets of CSI. However, with the burgeon of short videos, many real-world applications require matching short music excerpts to full-length music tracks in the database, which is still under-explored and w… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: Accepeted by ICASSP 2023

  30. Non-Orthogonal Multiple Access Enhanced Multi-User Semantic Communication

    Authors: Weizhi Li, Haotai Liang, Chen Dong, Xiaodong Xu, Ping Zhang, Kaijun Liu

    Abstract: Semantic communication serves as a novel paradigm and attracts the broad interest of researchers. One critical aspect of it is the multi-user semantic communication theory, which can further promote its application to the practical network environment. While most existing works focused on the design of end-to-end single-user semantic transmission, a novel non-orthogonal multiple access (NOMA)-base… ▽ More

    Submitted 20 November, 2023; v1 submitted 12 March, 2023; originally announced March 2023.

    Comments: accepted by IEEE Transactions on Cognitive Communications and Networking

  31. arXiv:2303.01175  [pdf, ps, other

    math.AC cs.IT eess.SP

    A Field-Theoretic View of Unlabeled Sensing

    Authors: Hao Liang, Jingyu Lu, Manolis C. Tsakiris, Lihong Zhi

    Abstract: Unlabeled sensing is the problem of solving a linear system of equations, where the right-hand-side vector is known only up to a permutation. In this work, we study fields of rational functions related to symmetric polynomials and their images under a linear projection of the variables; as a consequence, we establish that the solution to an n-dimensional unlabeled sensing problem with generic data… ▽ More

    Submitted 4 November, 2024; v1 submitted 2 March, 2023; originally announced March 2023.

    Comments: 12 pages

  32. arXiv:2301.03331  [pdf, other

    cs.CV cs.AI eess.IV

    A Specific Task-oriented Semantic Image Communication System for substation patrol inspection

    Authors: Senran Fan, Haotai Liang, Chen Dong, Xiaodong Xu, Geng Liu

    Abstract: Intelligent inspection robots are widely used in substation patrol inspection, which can help check potential safety hazards by patrolling the substation and sending back scene images. However, when patrolling some marginal areas with weak signal, the scene images cannot be sucessfully transmissted to be used for hidden danger elimination, which greatly reduces the quality of robots'daily work. To… ▽ More

    Submitted 13 April, 2024; v1 submitted 9 January, 2023; originally announced January 2023.

    Comments: 9 pages, 8 figures

    Journal ref: IEEE Transactions on Power Delivery; vol. 39; no. 2; pp. 835-844; April 2024

  33. arXiv:2212.03093  [pdf

    eess.SY

    Cooperative Guidance Strategy for Active Defense Spacecraft with Imperfect Information via Deep Reinforcement Learning

    Authors: Li Zhi, Haizhao Liang, Jinze Wu, Jianying Wang, Yu Zheng

    Abstract: In this paper, an adaptive cooperative guidance strategy for the active protection of a target spacecraft trying to evade an interceptor was developed. The target spacecraft performs evasive maneuvers, launching an active defense vehicle to divert the interceptor. Instead of classical strategies, which are based on optimal control or differential game theory, the problem was solved by using the de… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

  34. arXiv:2211.02320  [pdf, other

    eess.SY

    Aircraft Ground Taxiing Deduction and Conflict Early Warning Method Based on Control Command Information

    Authors: Jingchang Zhuge, Huiyuan Liang, Yiming Zhang, Shichao Li, Xinyu Yang, Jun Wu

    Abstract: Aircraft taxiing conflict is a threat to the safety of airport operations, mainly due to the human error in control command infor-mation. In order to solve the problem, The aircraft taxiing deduction and conflict early warning method based on control order information is proposed. This method does not need additional equipment and operating costs, and is completely based on his-torical data and co… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

  35. arXiv:2210.00621  [pdf, other

    cs.LG cs.CV eess.SP math.OC

    Optimization for Robustness Evaluation beyond $\ell_p$ Metrics

    Authors: Hengyue Liang, Buyun Liang, Ying Cui, Tim Mitchell, Ju Sun

    Abstract: Empirical evaluation of deep learning models against adversarial attacks entails solving nontrivial constrained optimization problems. Popular algorithms for solving these constrained problems rely on projected gradient descent (PGD) and require careful tuning of multiple hyperparameters. Moreover, PGD can only handle $\ell_1$, $\ell_2$, and $\ell_\infty$ attack models due to the use of analytical… ▽ More

    Submitted 13 November, 2022; v1 submitted 2 October, 2022; originally announced October 2022.

    Comments: 5 pages, 1 figure, 3 tables, accepted by the 14th International OPT Workshop on Optimization for Machine Learning, and submitted to the 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2023)

  36. arXiv:2203.16988  [pdf

    cs.SD cs.LG eess.AS

    Acoustic-Net: A Novel Neural Network for Sound Localization and Quantification

    Authors: Guanxing Zhou, Hao Liang, Xinghao Ding, Yue Huang, Xiaotong Tu, Saqlain Abbas

    Abstract: Acoustic source localization has been applied in different fields, such as aeronautics and ocean science, generally using multiple microphones array data to reconstruct the source location. However, the model-based beamforming methods fail to achieve the high-resolution of conventional beamforming maps. Deep neural networks are also appropriate to locate the sound source, but in general, these met… ▽ More

    Submitted 31 March, 2022; originally announced March 2022.

  37. arXiv:2203.10674  [pdf, other

    cs.LG cs.CR cs.NI eess.SY

    RareGAN: Generating Samples for Rare Classes

    Authors: Zinan Lin, Hao Liang, Giulia Fanti, Vyas Sekar

    Abstract: We study the problem of learning generative adversarial networks (GANs) for a rare class of an unlabeled dataset subject to a labeling budget. This problem is motivated from practical applications in domains including security (e.g., synthesizing packets for DNS amplification attacks), systems and networking (e.g., synthesizing workloads that trigger high resource usage), and machine learning (e.g… ▽ More

    Submitted 20 March, 2022; originally announced March 2022.

    Comments: Published in AAAI 2022

  38. arXiv:2203.05087  [pdf, other

    eess.SY

    False Data Injection Attack on Electric Vehicle-Assisted Voltage Regulation

    Authors: Yuan Liu, Omid Ardakanian, Ioanis Nikolaidis, Hao Liang

    Abstract: With the large scale penetration of electric vehicles (EVs) and the advent of bidirectional chargers, EV aggregators will become a major player in the voltage regulation market. This paper proposes a novel false data injection attack (FDIA) against the voltage regulation capacity estimation of EV charging stations, the process that underpins voltage regulation in distribution system. The proposed… ▽ More

    Submitted 9 March, 2022; originally announced March 2022.

    Comments: 10 pages

  39. arXiv:2202.09595  [pdf, other

    eess.SP

    Innovative semantic communication system

    Authors: Chen Dong, Haotai Liang, Xiaodong Xu, Shujun Han, Bizhu Wang, Ping Zhang

    Abstract: Traditional communication systems focus on the transmission process, and the context-dependent meaning has been ignored. The fact that 5G system has approached Shannon limit and the increasing amount of data will cause communication bottleneck, such as the increased delay problems. Inspired by the ability of artificial intelligence to understand semantics, we propose a new communication paradigm,… ▽ More

    Submitted 19 February, 2022; originally announced February 2022.

  40. arXiv:2112.06074  [pdf, other

    cs.CV cs.LG eess.IV eess.SP

    Early Stopping for Deep Image Prior

    Authors: Hengkang Wang, Taihui Li, Zhong Zhuang, Tiancong Chen, Hengyue Liang, Ju Sun

    Abstract: Deep image prior (DIP) and its variants have showed remarkable potential for solving inverse problems in computer vision, without any extra training data. Practical DIP models are often substantially overparameterized. During the fitting process, these models learn mostly the desired visual content first, and then pick up the potential modeling and observational noise, i.e., overfitting. Thus, the… ▽ More

    Submitted 11 December, 2023; v1 submitted 11 December, 2021; originally announced December 2021.

    Comments: Published in TMLR (https://openreview.net/forum?id=231ZzrLC8X)

    Journal ref: Transactions on Machine Learning Research (TMLR), 2835-8856 (12/2023)

  41. arXiv:2112.05844  [pdf, other

    eess.SY

    Economic MPC-based planning for marine vehicles: Tuning safety and energy efficiency

    Authors: Haojiao Liang, Huiping Li, Jian Gao, Rongxin Cui, Demin Xu

    Abstract: Energy efficiency and safety are two critical objectives for marine vehicles operating in environments with obstacles, and they generally conflict with each other. In this paper, we propose a novel online motion planning method of marine vehicles which can make trade-offs between the two design objectives based on the framework of economic model predictive control (EMPC). Firstly, the feasible tra… ▽ More

    Submitted 10 December, 2021; originally announced December 2021.

  42. arXiv:2110.12271  [pdf, other

    cs.CV cs.LG eess.IV eess.SP

    Self-Validation: Early Stopping for Single-Instance Deep Generative Priors

    Authors: Taihui Li, Zhong Zhuang, Hengyue Liang, Le Peng, Hengkang Wang, Ju Sun

    Abstract: Recent works have shown the surprising effectiveness of deep generative models in solving numerous image reconstruction (IR) tasks, even without training data. We call these models, such as deep image prior and deep decoder, collectively as single-instance deep generative priors (SIDGPs). The successes, however, often hinge on appropriate early stopping (ES), which by far has largely been handled… ▽ More

    Submitted 23 October, 2021; originally announced October 2021.

    Comments: To appear in British Machine Vision Conference (BMVC) 2021

  43. arXiv:2107.07988  [pdf, other

    cs.CV cs.LG cs.SD eess.AS eess.IV

    Controlled AutoEncoders to Generate Faces from Voices

    Authors: Hao Liang, Lulan Yu, Guikang Xu, Bhiksha Raj, Rita Singh

    Abstract: Multiple studies in the past have shown that there is a strong correlation between human vocal characteristics and facial features. However, existing approaches generate faces simply from voice, without exploring the set of features that contribute to these observed correlations. A computational methodology to explore this can be devised by rephrasing the question to: "how much would a target face… ▽ More

    Submitted 16 July, 2021; originally announced July 2021.

  44. arXiv:2106.12511  [pdf

    eess.IV cs.CV cs.LG

    High-Throughput Precision Phenotyping of Left Ventricular Hypertrophy with Cardiovascular Deep Learning

    Authors: Grant Duffy, Paul P Cheng, Neal Yuan, Bryan He, Alan C. Kwan, Matthew J. Shun-Shin, Kevin M. Alexander, Joseph Ebinger, Matthew P. Lungren, Florian Rader, David H. Liang, Ingela Schnittger, Euan A. Ashley, James Y. Zou, Jignesh Patel, Ronald Witteles, Susan Cheng, David Ouyang

    Abstract: Left ventricular hypertrophy (LVH) results from chronic remodeling caused by a broad range of systemic and cardiovascular disease including hypertension, aortic stenosis, hypertrophic cardiomyopathy, and cardiac amyloidosis. Early detection and characterization of LVH can significantly impact patient care but is limited by under-recognition of hypertrophy, measurement error and variability, and di… ▽ More

    Submitted 23 June, 2021; originally announced June 2021.

  45. arXiv:2106.05152  [pdf, other

    eess.IV cs.CV cs.LG

    Rethinking Transfer Learning for Medical Image Classification

    Authors: Le Peng, Hengyue Liang, Gaoxiang Luo, Taihui Li, Ju Sun

    Abstract: Transfer learning (TL) from pretrained deep models is a standard practice in modern medical image classification (MIC). However, what levels of features to be reused are problem-dependent, and uniformly finetuning all layers of pretrained models may be suboptimal. This insight has partly motivated the recent differential TL strategies, such as TransFusion (TF) and layer-wise finetuning (LWFT), whi… ▽ More

    Submitted 26 May, 2024; v1 submitted 9 June, 2021; originally announced June 2021.

    Comments: Accepted by BMVC2023 (oral)

  46. arXiv:2103.00345  [pdf, other

    cs.RO cs.CR cs.LG eess.SY

    End-to-end Uncertainty-based Mitigation of Adversarial Attacks to Automated Lane Centering

    Authors: Ruochen Jiao, Hengyi Liang, Takami Sato, Junjie Shen, Qi Alfred Chen, Qi Zhu

    Abstract: In the development of advanced driver-assistance systems (ADAS) and autonomous vehicles, machine learning techniques that are based on deep neural networks (DNNs) have been widely used for vehicle perception. These techniques offer significant improvement on average perception accuracy over traditional methods, however, have been shown to be susceptible to adversarial attacks, where small perturba… ▽ More

    Submitted 27 February, 2021; originally announced March 2021.

    Comments: 8 pages for conference

  47. arXiv:2012.09154  [pdf

    eess.IV cs.CV physics.optics

    Exploration of Whether Skylight Polarization Patterns Contain Three-dimensional Attitude Information

    Authors: Huaju Liang, Hongyang Bai, Tong Zhou

    Abstract: Our previous work has demonstrated that Rayleigh model, which is widely used in polarized skylight navigation to describe skylight polarization patterns, does not contain three-dimensional (3D) attitude information [1]. However, it is still necessary to further explore whether the skylight polarization patterns contain 3D attitude information. So, in this paper, a social spider optimization (SSO)… ▽ More

    Submitted 30 November, 2020; originally announced December 2020.

  48. arXiv:2010.08091  [pdf, other

    cs.SD cs.MM eess.AS

    PiRhDy: Learning Pitch-, Rhythm-, and Dynamics-aware Embeddings for Symbolic Music

    Authors: Hongru Liang, Wenqiang Lei, Paul Yaozhu Chan, Zhenglu Yang, Maosong Sun, Tat-Seng Chua

    Abstract: Definitive embeddings remain a fundamental challenge of computational musicology for symbolic music in deep learning today. Analogous to natural language, music can be modeled as a sequence of tokens. This motivates the majority of existing solutions to explore the utilization of word embedding models to build music embeddings. However, music differs from natural languages in two key aspects: (1)… ▽ More

    Submitted 15 October, 2020; originally announced October 2020.

    Comments: ACM Multimedia 2020 -- best paper

  49. Leveraging Weakly-hard Constraints for Improving System Fault Tolerance with Functional and Timing Guarantees

    Authors: Hengyi Liang, Zhilu Wang, Ruochen Jiao, Qi Zhu

    Abstract: Many safety-critical real-time systems operate under harsh environment and are subject to soft errors caused by transient or intermittent faults. It is critical and yet often very challenging to apply fault tolerance techniques in these systems, due to their resource limitations and stringent constraints on timing and functionality. In this work, we leverage the concept of weakly-hard constraints,… ▽ More

    Submitted 14 August, 2020; originally announced August 2020.

    Comments: ICCAD 2020

  50. arXiv:2007.12578  [pdf, other

    eess.IV cs.CV cs.LG

    Stain Style Transfer of Histopathology Images Via Structure-Preserved Generative Learning

    Authors: Hanwen Liang, Konstantinos N. Plataniotis, Xingyu Li

    Abstract: Computational histopathology image diagnosis becomes increasingly popular and important, where images are segmented or classified for disease diagnosis by computers. While pathologists do not struggle with color variations in slides, computational solutions usually suffer from this critical issue. To address the issue of color variations in histopathology images, this study proposes two stain styl… ▽ More

    Submitted 24 July, 2020; originally announced July 2020.