Skip to main content

Showing 1–50 of 65 results for author: Jin, X

Searching in archive eess. Search in all archives.
.
  1. arXiv:2507.04510  [pdf, ps, other

    eess.IV cs.CV

    Dynamic Frequency Feature Fusion Network for Multi-Source Remote Sensing Data Classification

    Authors: Yikang Zhao, Feng Gao, Xuepeng Jin, Junyu Dong, Qian Du

    Abstract: Multi-source data classification is a critical yet challenging task for remote sensing image interpretation. Existing methods lack adaptability to diverse land cover types when modeling frequency domain features. To this end, we propose a Dynamic Frequency Feature Fusion Network (DFFNet) for hyperspectral image (HSI) and Synthetic Aperture Radar (SAR) / Light Detection and Ranging (LiDAR) data joi… ▽ More

    Submitted 6 July, 2025; originally announced July 2025.

    Comments: Accepted by IEEE GRSL

  2. arXiv:2507.04100  [pdf, ps, other

    cs.LG cs.AI eess.SY

    Hierarchical Testing with Rabbit Optimization for Industrial Cyber-Physical Systems

    Authors: Jinwei Hu, Zezhi Tang, Xin Jin, Benyuan Zhang, Yi Dong, Xiaowei Huang

    Abstract: This paper presents HERO (Hierarchical Testing with Rabbit Optimization), a novel black-box adversarial testing framework for evaluating the robustness of deep learning-based Prognostics and Health Management systems in Industrial Cyber-Physical Systems. Leveraging Artificial Rabbit Optimization, HERO generates physically constrained adversarial examples that align with real-world data distributio… ▽ More

    Submitted 5 July, 2025; originally announced July 2025.

    Comments: Preprint accepted by IEEE Transactions on Industrial Cyber Physical Systems

  3. arXiv:2506.07909  [pdf, ps, other

    eess.SP

    Double Low-Rank 4D Tensor Decomposition for Circular RIS-Aided mmWave MIMO-NOMA System Channel Estimation in Mobility Scenarios

    Authors: Wanyuan Cai, Xiaoping Jin, Youming Li, Menglei Sheng, Mingjun Huang, Qinke Qi, Qiang Guo

    Abstract: Channel estimation is not only essential to highly reliable data transmission and massive device access but also an important component of the integrated sensing and communication (ISAC) in the sixth-generation (6G) mobile communication systems. In this paper, we consider a downlink channel estimation problem for circular reconfigurable intelligent surface (RIS)-aided millimeter-wave (mmWave) mult… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

  4. arXiv:2506.06190  [pdf, ps, other

    cs.SD cs.GR eess.AS

    NAT: Neural Acoustic Transfer for Interactive Scenes in Real Time

    Authors: Xutong Jin, Bo Pang, Chenxi Xu, Xinyun Hou, Guoping Wang, Sheng Li

    Abstract: Previous acoustic transfer methods rely on extensive precomputation and storage of data to enable real-time interaction and auditory feedback. However, these methods struggle with complex scenes, especially when dynamic changes in object position, material, and size significantly alter sound effects. These continuous variations lead to fluctuating acoustic transfer distributions, making it challen… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

  5. arXiv:2505.12089  [pdf, ps, other

    eess.IV cs.AI cs.CV

    NTIRE 2025 Challenge on Efficient Burst HDR and Restoration: Datasets, Methods, and Results

    Authors: Sangmin Lee, Eunpil Park, Angel Canelo, Hyunhee Park, Youngjo Kim, Hyung-Ju Chun, Xin Jin, Chongyi Li, Chun-Le Guo, Radu Timofte, Qi Wu, Tianheng Qiu, Yuchun Dong, Shenglin Ding, Guanghua Pan, Weiyu Zhou, Tao Hu, Yixu Feng, Duwei Dai, Yu Cao, Peng Wu, Wei Dong, Yanning Zhang, Qingsen Yan, Simon J. Larsen , et al. (11 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2025 Efficient Burst HDR and Restoration Challenge, which aims to advance efficient multi-frame high dynamic range (HDR) and restoration techniques. The challenge is based on a novel RAW multi-frame fusion dataset, comprising nine noisy and misaligned RAW frames with various exposure levels per scene. Participants were tasked with developing solutions capable of effect… ▽ More

    Submitted 17 May, 2025; originally announced May 2025.

  6. arXiv:2504.12711  [pdf, other

    cs.CV cs.AI eess.IV

    NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images: Methods and Results

    Authors: Xin Li, Yeying Jin, Xin Jin, Zongwei Wu, Bingchen Li, Yufei Wang, Wenhan Yang, Yu Li, Zhibo Chen, Bihan Wen, Robby T. Tan, Radu Timofte, Qiyu Rong, Hongyuan Jing, Mengmeng Zhang, Jinglong Li, Xiangyu Lu, Yi Ren, Yuting Liu, Meng Zhang, Xiang Chen, Qiyuan Guan, Jiangxin Dong, Jinshan Pan, Conglin Gou , et al. (112 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images. This challenge received a wide range of impressive solutions, which are developed and evaluated using our collected real-world Raindrop Clarity dataset. Unlike existing deraining datasets, our Raindrop Clarity dataset is more diverse and challenging in degradation types and contents, which includ… ▽ More

    Submitted 19 April, 2025; v1 submitted 17 April, 2025; originally announced April 2025.

    Comments: Challenge Report of CVPR NTIRE 2025; 26 pages; Methods from 32 teams

  7. arXiv:2503.08835  [pdf, other

    eess.SY

    High-Precision Overlay Registration via Spatial-Terminal Iterative Learning in Roll-to-Roll Manufacturing

    Authors: Zifeng Wang, Xiaoning Jin

    Abstract: Roll-to-roll (R2R) printing technologies are promising for high-volume continuous production of substrate-based electronic products. One of the major challenges in R2R flexible electronics printing is achieving tight alignment tolerances, as specified by the device resolution (usually at the micro-meter level), for multi-layer printed electronics. The alignment of the printed patterns in different… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

  8. arXiv:2502.10187  [pdf, other

    eess.SY

    Reinforcement Learning based Constrained Optimal Control: an Interpretable Reward Design

    Authors: Jingjie Ni, Fangfei Li, Xin Jin, Xianlun Peng, Yang Tang

    Abstract: This paper presents an interpretable reward design framework for reinforcement learning based constrained optimal control problems with state and terminal constraints. The problem is formalized within a standard partially observable Markov decision process framework. The reward function is constructed from four weighted components: a terminal constraint reward, a guidance reward, a penalty for sta… ▽ More

    Submitted 14 February, 2025; originally announced February 2025.

  9. arXiv:2412.18933  [pdf, other

    cs.CV cs.MM eess.IV

    TINQ: Temporal Inconsistency Guided Blind Video Quality Assessment

    Authors: Yixiao Li, Xiaoyuan Yang, Weide Liu, Xin Jin, Xu Jia, Yukun Lai, Haotao Liu, Paul L Rosin, Wei Zhou

    Abstract: Blind video quality assessment (BVQA) has been actively researched for user-generated content (UGC) videos. Recently, super-resolution (SR) techniques have been widely applied in UGC. Therefore, an effective BVQA method for both UGC and SR scenarios is essential. Temporal inconsistency, referring to irregularities between consecutive frames, is relevant to video quality. Current BVQA approaches ty… ▽ More

    Submitted 25 December, 2024; originally announced December 2024.

  10. arXiv:2412.18158  [pdf, other

    cs.CV eess.IV

    Semantics Disentanglement and Composition for Versatile Codec toward both Human-eye Perception and Machine Vision Task

    Authors: Jinming Liu, Yuntao Wei, Junyan Lin, Shengyang Zhao, Heming Sun, Zhibo Chen, Wenjun Zeng, Xin Jin

    Abstract: While learned image compression methods have achieved impressive results in either human visual perception or machine vision tasks, they are often specialized only for one domain. This drawback limits their versatility and generalizability across scenarios and also requires retraining to adapt to new applications-a process that adds significant complexity and cost in real-world scenarios. In this… ▽ More

    Submitted 23 December, 2024; originally announced December 2024.

  11. arXiv:2411.06357  [pdf

    eess.IV

    A Diffuse Light Field Imaging Model for Forward-Scattering Photon-Coded Signal Retrieval

    Authors: Hongkun Cao, Xin Jin, Junjie Wei, Yihui Fan, Dongyu Du

    Abstract: Scattering imaging is often hindered by extremely low signal-to-noise ratios (SNRs) due to the prevalence of scattering noise. Light field imaging has been shown to be effective in suppressing noise and collect more ballistic photons as signals. However, to overcome the SNR limit in super-strong scattering environments, even with light field framework, only rare ballistic signals are insufficient.… ▽ More

    Submitted 9 November, 2024; originally announced November 2024.

  12. arXiv:2410.18094  [pdf, other

    q-bio.QM cs.AI cs.LG eess.SP

    Self-supervised inter-intra period-aware ECG representation learning for detecting atrial fibrillation

    Authors: Xiangqian Zhu, Mengnan Shi, Xuexin Yu, Chang Liu, Xiaocong Lian, Jintao Fei, Jiangying Luo, Xin Jin, Ping Zhang, Xiangyang Ji

    Abstract: Atrial fibrillation is a commonly encountered clinical arrhythmia associated with stroke and increased mortality. Since professional medical knowledge is required for annotation, exploiting a large corpus of ECGs to develop accurate supervised learning-based atrial fibrillation algorithms remains challenging. Self-supervised learning (SSL) is a promising recipe for generalized ECG representation l… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

    Comments: Preprint submitted to Biomedical Signal Processing and Control

  13. DiffSound: Differentiable Modal Sound Rendering and Inverse Rendering for Diverse Inference Tasks

    Authors: Xutong Jin, Chenxi Xu, Ruohan Gao, Jiajun Wu, Guoping Wang, Sheng Li

    Abstract: Accurately estimating and simulating the physical properties of objects from real-world sound recordings is of great practical importance in the fields of vision, graphics, and robotics. However, the progress in these directions has been limited -- prior differentiable rigid or soft body simulation techniques cannot be directly applied to modal sound synthesis due to the high sampling rate of audi… ▽ More

    Submitted 20 September, 2024; originally announced September 2024.

    Comments: 12 pages, 10 figures. Published in Siggraph 2024. Project page: https://hellojxt.github.io/DiffSound/

  14. arXiv:2409.07891  [pdf, other

    cs.CL cs.SD eess.AS

    A corpus-based investigation of pitch contours of monosyllabic words in conversational Taiwan Mandarin

    Authors: Xiaoyun Jin, Mirjam Ernestus, R. Harald Baayen

    Abstract: In Mandarin, the tonal contours of monosyllabic words produced in isolation or in careful speech are characterized by four lexical tones: a high-level tone (T1), a rising tone (T2), a dipping tone (T3) and a falling tone (T4). However, in spontaneous speech, the actual tonal realization of monosyllabic words can deviate significantly from these canonical tones due to intra-syllabic co-articulation… ▽ More

    Submitted 19 October, 2024; v1 submitted 12 September, 2024; originally announced September 2024.

  15. arXiv:2408.14255  [pdf, other

    eess.IV

    MSFMamba: Multi-Scale Feature Fusion State Space Model for Multi-Source Remote Sensing Image Classification

    Authors: Feng Gao, Xuepeng Jin, Xiaowei Zhou, Junyu Dong, Qian Du

    Abstract: In the field of multi-source remote sensing image classification, remarkable progress has been made by using Convolutional Neural Network (CNN) and Transformer. Recently, Mamba-based methods built upon the State Space Model (SSM) have shown great potential for long-range dependency modeling with linear complexity, but they have rarely been explored for multi-source remote sensing image classificat… ▽ More

    Submitted 26 January, 2025; v1 submitted 26 August, 2024; originally announced August 2024.

    Comments: IEEE TGRS 2025

  16. arXiv:2407.11700  [pdf, other

    cs.CV eess.IV

    Rate-Distortion-Cognition Controllable Versatile Neural Image Compression

    Authors: Jinming Liu, Ruoyu Feng, Yunpeng Qi, Qiuyu Chen, Zhibo Chen, Wenjun Zeng, Xin Jin

    Abstract: Recently, the field of Image Coding for Machines (ICM) has garnered heightened interest and significant advances thanks to the rapid progress of learning-based techniques for image compression and analysis. Previous studies often require training separate codecs to support various bitrate levels, machine tasks, and networks, thus lacking both flexibility and practicality. To address these challeng… ▽ More

    Submitted 17 July, 2024; v1 submitted 16 July, 2024; originally announced July 2024.

    Comments: ECCV2024

  17. arXiv:2406.01245  [pdf, other

    eess.IV

    Sparse Focus Network for Multi-Source Remote Sensing Data Classification

    Authors: Xuepeng Jin, Junyan Lin, Feng Gao, Lin Qi, Yang Zhou

    Abstract: Multi-source remote sensing data classification has emerged as a prominent research topic with the advancement of various sensors. Existing multi-source data classification methods are susceptible to irrelevant information interference during multi-source feature extraction and fusion. To solve this issue, we propose a sparse focus network for multi-source data classification. Sparse attention is… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Accepted by IEEE IGARSS 2024

  18. arXiv:2406.01235  [pdf, other

    eess.IV

    Boosting Spatial-Spectral Masked Auto-Encoder Through Mining Redundant Spectra for HSI-SAR/LiDAR Classification

    Authors: Junyan Lin, Xuepeng Jin, Feng Gao, Junyu Dong, Hui Yu

    Abstract: Although recent masked image modeling (MIM)-based HSI-LiDAR/SAR classification methods have gradually recognized the importance of the spectral information, they have not adequately addressed the redundancy among different spectra, resulting in information leakage during the pretraining stage. This issue directly impairs the representation ability of the model. To tackle the problem, we propose a… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Accepted by IGARSS 2024

  19. arXiv:2404.15750  [pdf, other

    eess.SP

    A Reconfigurable Subarray Architecture and Hybrid Beamforming for Millimeter-Wave Dual-Function-Radar-Communication Systems

    Authors: Xin Jin, Tiejun Lv, Wei Ni, Zhipeng Lin, Qiuming Zhu, Ekram Hossain, H. Vincent Poor

    Abstract: Dual-function-radar-communication (DFRC) is a promising candidate technology for next-generation networks. By integrating hybrid analog-digital (HAD) beamforming into a multi-user millimeter-wave (mmWave) DFRC system, we design a new reconfigurable subarray (RS) architecture and jointly optimize the HAD beamforming to maximize the communication sum-rate and ensure a prescribed signal-to-clutter-pl… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: 14 pages, 9 figures, Accepted by IEEE TWC

  20. arXiv:2404.10235  [pdf, ps, other

    eess.SP

    Integrated Sensing and Communication for Edge Inference with End-to-End Multi-View Fusion

    Authors: Xibin Jin, Guoliang Li, Shuai Wang, Miaowen Wen, Chengzhong Xu, H. Vincent Poor

    Abstract: Integrated sensing and communication (ISAC) is a promising solution to accelerate edge inference via the dual use of wireless signals. However, this paradigm needs to minimize the inference error and latency under ISAC co-functionality interference, for which the existing ISAC or edge resource allocation algorithms become inefficient, as they ignore the inter-dependency between low-level ISAC desi… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  21. arXiv:2404.01632  [pdf, other

    cs.LG eess.SY

    Enhancing Functional Safety in Automotive AMS Circuits through Unsupervised Machine Learning

    Authors: Ayush Arunachalam, Ian Kintz, Suvadeep Banerjee, Arnab Raha, Xiankun Jin, Fei Su, Viswanathan Pillai Prasanth, Rubin A. Parekhji, Suriyaprakash Natarajan, Kanad Basu

    Abstract: Given the widespread use of safety-critical applications in the automotive field, it is crucial to ensure the Functional Safety (FuSa) of circuits and components within automotive systems. The Analog and Mixed-Signal (AMS) circuits prevalent in these systems are more vulnerable to faults induced by parametric perturbations, noise, environmental stress, and other factors, in comparison to their dig… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: 12 pages, 12 figures

  22. arXiv:2401.14750  [pdf, ps, other

    eess.SY

    A Stochastic Hybrid Approach to Decentralized Networked Control: Stochastic Network Delays and Poisson Pulsing Attacks

    Authors: Dandan Zhang, Xin Jin, Hongye Su

    Abstract: By designing the decentralized time-regularized (Zeno-free) event-triggered strategies for the state-feedback control law, this paper considers the stochastic stabilization of a class of networked control systems, where two sources of randomness exist in multiple decentralized networks that operate asynchronously and independently: the communication channels are constrained by the stochastic netwo… ▽ More

    Submitted 12 June, 2025; v1 submitted 26 January, 2024; originally announced January 2024.

    Comments: 17 pages, 11 figures

  23. arXiv:2401.02678  [pdf, other

    cs.SD cs.MM eess.AS

    MusicAOG: an Energy-Based Model for Learning and Sampling a Hierarchical Representation of Symbolic Music

    Authors: Yikai Qian, Tianle Wang, Xinyi Tong, Xin Jin, Duo Xu, Bo Zheng, Tiezheng Ge, Feng Yu, Song-Chun Zhu

    Abstract: In addressing the challenge of interpretability and generalizability of artificial music intelligence, this paper introduces a novel symbolic representation that amalgamates both explicit and implicit musical information across diverse traditions and granularities. Utilizing a hierarchical and-or graph representation, the model employs nodes and edges to encapsulate a broad spectrum of musical ele… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

  24. arXiv:2310.19288  [pdf, other

    eess.IV cs.CV

    EDiffSR: An Efficient Diffusion Probabilistic Model for Remote Sensing Image Super-Resolution

    Authors: Yi Xiao, Qiangqiang Yuan, Kui Jiang, Jiang He, Xianyu Jin, Liangpei Zhang

    Abstract: Recently, convolutional networks have achieved remarkable development in remote sensing image Super-Resoltuion (SR) by minimizing the regression objectives, e.g., MSE loss. However, despite achieving impressive performance, these methods often suffer from poor visual quality with over-smooth issues. Generative adversarial networks have the potential to infer intricate details, but they are easy to… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: Submitted to IEEE TGRS

  25. arXiv:2310.14965  [pdf, ps, other

    eess.IV physics.optics

    Parallel compressive super-resolution imaging with wide field-of-view based on physics enhanced network

    Authors: Xiao-Peng Jin, An-Dong Xiong, Wei Zhang, Xiao-Qing Wang, Fan Liu, Chang-Heng Li, Xu-Ri Yao, Xue-Feng Liu, Qing Zhao

    Abstract: Achieving both high-performance and wide field-of-view (FOV) super-resolution imaging has been attracting increasing attention in recent years. However, such goal suffers from long reconstruction time and huge storage space. Parallel compressive imaging (PCI) provides an efficient solution, but the super-resolution quality and imaging speed are strongly dependent on precise optical transfer functi… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

  26. arXiv:2308.03448  [pdf, other

    cs.CV eess.IV

    Make Explicit Calibration Implicit: Calibrate Denoiser Instead of the Noise Model

    Authors: Xin Jin, Jia-Wen Xiao, Ling-Hao Han, Chunle Guo, Xialei Liu, Chongyi Li, Ming-Ming Cheng

    Abstract: Explicit calibration-based methods have dominated RAW image denoising under extremely low-light environments. However, these methods are impeded by several critical limitations: a) the explicit calibration process is both labor- and time-intensive, b) challenge exists in transferring denoisers across different camera models, and c) the disparity between synthetic and real noise is exacerbated by d… ▽ More

    Submitted 25 December, 2023; v1 submitted 7 August, 2023; originally announced August 2023.

  27. arXiv:2307.00954  [pdf, other

    cs.CV eess.IV

    HODINet: High-Order Discrepant Interaction Network for RGB-D Salient Object Detection

    Authors: Kang Yi, Jing Xu, Xiao Jin, Fu Guo, Yan-Feng Wu

    Abstract: RGB-D salient object detection (SOD) aims to detect the prominent regions by jointly modeling RGB and depth information. Most RGB-D SOD methods apply the same type of backbones and fusion modules to identically learn the multimodality and multistage features. However, these features contribute differently to the final saliency results, which raises two issues: 1) how to model discrepant characteri… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

  28. arXiv:2306.02309  [pdf, other

    eess.SY

    Synchronization of multiple rigid body systems: a survey

    Authors: X. Jin, Daniel W. C. Ho, Y. Tang

    Abstract: The multi-agent system has been a hot topic in the past few decades owing to its lower cost, higher robustness, and higher flexibility. As a particular multi-agent system, the multiple rigid body system received a growing interest for its wide applications in transportation, aerospace, and ocean exploration. Due to the non-Euclidean configuration space of attitudes and the inherent nonlinearity of… ▽ More

    Submitted 27 August, 2023; v1 submitted 4 June, 2023; originally announced June 2023.

  29. arXiv:2305.11715  [pdf

    eess.IV cs.CV physics.med-ph

    A quality assurance framework for real-time monitoring of deep learning segmentation models in radiotherapy

    Authors: Xiyao Jin, Yao Hao, Jessica Hilliard, Zhehao Zhang, Maria A. Thomas, Hua Li, Abhinav K. Jha, Geoffrey D. Hugo

    Abstract: To safely deploy deep learning models in the clinic, a quality assurance framework is needed for routine or continuous monitoring of input-domain shift and the models' performance without ground truth contours. In this work, cardiac substructure segmentation was used as an example task to establish a QA framework. A benchmark dataset consisting of Computed Tomography (CT) images along with manual… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

  30. arXiv:2305.02586  [pdf, other

    eess.IV cs.CV

    Semantically Structured Image Compression via Irregular Group-Based Decoupling

    Authors: Ruoyu Feng, Yixin Gao, Xin Jin, Runsen Feng, Zhibo Chen

    Abstract: Image compression techniques typically focus on compressing rectangular images for human consumption, however, resulting in transmitting redundant content for downstream applications. To overcome this limitation, some previous works propose to semantically structure the bitstream, which can meet specific application requirements by selective transmission and reconstruction. Nevertheless, they divi… ▽ More

    Submitted 2 March, 2025; v1 submitted 4 May, 2023; originally announced May 2023.

    Comments: Accept by ICCV2023

  31. arXiv:2304.11521  [pdf, other

    cs.SD cs.CV cs.MM eess.AS

    An Order-Complexity Model for Aesthetic Quality Assessment of Homophony Music Performance

    Authors: Xin Jin, Wu Zhou, Jinyu Wang, Duo Xu, Yiqing Rong, Jialin Sun

    Abstract: Although computational aesthetics evaluation has made certain achievements in many fields, its research of music performance remains to be explored. At present, subjective evaluation is still a ultimate method of music aesthetics research, but it will consume a lot of human and material resources. In addition, the music performance generated by AI is still mechanical, monotonous and lacking in bea… ▽ More

    Submitted 22 April, 2023; originally announced April 2023.

    Journal ref: AIART 2023 ICME Workshop

  32. arXiv:2303.06859  [pdf, other

    cs.CV cs.MM eess.IV

    Learning Distortion Invariant Representation for Image Restoration from A Causality Perspective

    Authors: Xin Li, Bingchen Li, Xin Jin, Cuiling Lan, Zhibo Chen

    Abstract: In recent years, we have witnessed the great advancement of Deep neural networks (DNNs) in image restoration. However, a critical limitation is that they cannot generalize well to real-world degradations with different degrees or types. In this paper, we are the first to propose a novel training strategy for image restoration from the causality perspective, to improve the generalization ability of… ▽ More

    Submitted 31 March, 2023; v1 submitted 13 March, 2023; originally announced March 2023.

    Comments: Accepted by CVPR2023

  33. arXiv:2303.05744  [pdf

    eess.IV cs.AI cs.MM

    QVRF: A Quantization-error-aware Variable Rate Framework for Learned Image Compression

    Authors: Kedeng Tong, Yaojun Wu, Yue Li, Kai Zhang, Li Zhang, Xin Jin

    Abstract: Learned image compression has exhibited promising compression performance, but variable bitrates over a wide range remain a challenge. State-of-the-art variable rate methods compromise the loss of model performance and require numerous additional parameters. In this paper, we present a Quantization-error-aware Variable Rate Framework (QVRF) that utilizes a univariate quantization regulator a to ac… ▽ More

    Submitted 10 March, 2023; originally announced March 2023.

    Comments: 7 pages, 6 figures

  34. arXiv:2301.05908  [pdf, other

    cs.SD cs.CV cs.MM eess.AS

    An Order-Complexity Model for Aesthetic Quality Assessment of Symbolic Homophony Music Scores

    Authors: Xin Jin, Wu Zhou, Jinyu Wang, Duo Xu, Yiqing Rong, Shuai Cui

    Abstract: Computational aesthetics evaluation has made great achievements in the field of visual arts, but the research work on music still needs to be explored. Although the existing work of music generation is very substantial, the quality of music score generated by AI is relatively poor compared with that created by human composers. The music scores created by AI are usually monotonous and devoid of emo… ▽ More

    Submitted 14 January, 2023; originally announced January 2023.

  35. arXiv:2206.15077  [pdf, ps, other

    eess.SY

    A perspective on Attitude Control Issues and Techniques

    Authors: Dandan Zhang, Xin Jin, Hongye Su

    Abstract: This paper reviews the attitude control problems for rigid-body systems, starting from the attitude representation for rigid body kinematics. Highly redundant rotation matrix defines the attitude orientation globally and uniquely by 9 parameters, which is the most fundamental one, without any singularities; minimum 3-parameter Euler angles or (modified) Rodrigues parameters define the attitude ori… ▽ More

    Submitted 30 June, 2022; originally announced June 2022.

    Comments: 13 pages, 6 figures, 2 tables

    MSC Class: 70E50 ACM Class: J.2.1; J.2.7; A.1

  36. arXiv:2203.06882  [pdf, other

    eess.SY

    Robust Event Triggering Control for Lateral Dynamics of Intelligent Vehicles with Designable Inter-event Times

    Authors: Xing Chu, Zhi Liu, Lei Mao, Xin Jin, Zhaoxia Peng, Guoguang Wen

    Abstract: In this brief, an improved event-triggered update mechanism (ETM) for the linear quadratic regulator is proposed to solve the lateral motion control problem of intelligent vehicle under bounded disturbances. Based on a novel event function using a clock-like variable to determine the triggering time, we further introduce two new design parameters to improve control performance. Distinct from exist… ▽ More

    Submitted 14 March, 2022; originally announced March 2022.

    Comments: 5pages, 4 figures

  37. arXiv:2202.10837  [pdf

    eess.IV cs.CV

    SADN: Learned Light Field Image Compression with Spatial-Angular Decorrelation

    Authors: Kedeng Tong, Xin Jin, Chen Wang, Fan Jiang

    Abstract: Light field image becomes one of the most promising media types for immersive video applications. In this paper, we propose a novel end-to-end spatial-angular-decorrelated network (SADN) for high-efficiency light field image compression. Different from the existing methods that exploit either spatial or angular consistency in the light field image, SADN decouples the angular and spatial informatio… ▽ More

    Submitted 22 February, 2022; originally announced February 2022.

  38. Learning Cross-Scale Weighted Prediction for Efficient Neural Video Compression

    Authors: Zongyu Guo, Runsen Feng, Zhizheng Zhang, Xin Jin, Zhibo Chen

    Abstract: Neural video codecs have demonstrated great potential in video transmission and storage applications. Existing neural hybrid video coding approaches rely on optical flow or Gaussian-scale flow for prediction, which cannot support fine-grained adaptation to diverse motion content. Towards more content-adaptive prediction, we propose a novel cross-scale prediction module that achieves more effective… ▽ More

    Submitted 15 March, 2023; v1 submitted 25 December, 2021; originally announced December 2021.

    Comments: Preprint. Revised after peer-reviewimg

  39. arXiv:2111.13078  [pdf, other

    cs.CV eess.IV

    A Close Look at Few-shot Real Image Super-resolution from the Distortion Relation Perspective

    Authors: Xin Li, Xin Jin, Jun Fu, Xiaoyuan Yu, Bei Tong, Zhibo Chen

    Abstract: Collecting amounts of distorted/clean image pairs in the real world is non-trivial, which seriously limits the practical applications of these supervised learning-based methods on real-world image super-resolution (RealSR). Previous works usually address this problem by leveraging unsupervised learning-based technologies to alleviate the dependency on paired training samples. However, these method… ▽ More

    Submitted 18 April, 2023; v1 submitted 25 November, 2021; originally announced November 2021.

    Comments: 12 pages, first paper for few-shot real image super-resolution

  40. arXiv:2109.04887  [pdf, ps, other

    eess.IV physics.optics

    Mid-wave infrared super-resolution imaging based on compressive calibration and sampling

    Authors: Xiao-Peng Jin, Qing Zhao, Xue-Feng Liu, An-Dong Xiong

    Abstract: Mid-wave infrared (MWIR) cameras for large number pixels are extremely expensive compared with their counterparts in visible light, thus, super-resolution imaging (SRI) for MWIR by increasing imaging pixels has always been a research hotspot in recent years. Over the last decade, with the extensively investigation of the compressed sensing (CS) method, focal plane array (FPA) based compressive ima… ▽ More

    Submitted 8 September, 2021; originally announced September 2021.

  41. arXiv:2108.13249  [pdf, ps, other

    cs.SD eess.AS

    RSKNet-MTSP: Effective and Portable Deep Architecture for Speaker Verification

    Authors: Yanfeng Wu, Chenkai Guo, Junan Zhao, Xiao Jin, Jing Xu

    Abstract: The convolutional neural network (CNN) based approaches have shown great success for speaker verification (SV) tasks, where modeling long temporal context and reducing information loss of speaker characteristics are two important challenges significantly affecting the verification performance. Previous works have introduced dilated convolution and multi-scale aggregation methods to address above c… ▽ More

    Submitted 30 August, 2021; originally announced August 2021.

    Comments: submitted to Neurocomputing

  42. Integrated Decision and Control at Multi-Lane Intersections with Mixed Traffic Flow

    Authors: Jianhua Jiang, Yangang Ren, Yang Guan, Shengbo Eben Li, Yuming Yin, Xiaoping Jin

    Abstract: Autonomous driving at intersections is one of the most complicated and accident-prone traffic scenarios, especially with mixed traffic participants such as vehicles, bicycles and pedestrians. The driving policy should make safe decisions to handle the dynamic traffic conditions and meet the requirements of on-board computation. However, most of the current researches focuses on simplified intersec… ▽ More

    Submitted 30 August, 2021; originally announced August 2021.

    Comments: 8 pages, 10 figures, 11 equations and 14 conferences

  43. arXiv:2108.07425  [pdf, other

    cs.SD cs.GR eess.AS

    NeuralSound: Learning-based Modal Sound Synthesis With Acoustic Transfer

    Authors: Xutong Jin, Sheng Li, Guoping Wang, Dinesh Manocha

    Abstract: We present a novel learning-based modal sound synthesis approach that includes a mixed vibration solver for modal analysis and an end-to-end sound radiation network for acoustic transfer. Our mixed vibration solver consists of a 3D sparse convolution network and a Locally Optimal Block Preconditioned Conjugate Gradient module (LOBPCG) for iterative optimization. Moreover, we highlight the correlat… ▽ More

    Submitted 28 May, 2022; v1 submitted 16 August, 2021; originally announced August 2021.

  44. Joint Secure Design of Downlink and D2D Cooperation Strategies for Multi-User Systems

    Authors: Seok-Hwan Park, Xianglan Jin

    Abstract: This work studies the role of inter-user device-to-device (D2D) cooperation for improving physical-layer secret communication in multi-user downlink systems. It is assumed that there are out-of-band D2D channels, on each of which a selected legitimate user transmits an amplified version of the received downlink signal to other legitimate users. A key technical challenge for designing such systems… ▽ More

    Submitted 13 April, 2021; originally announced April 2021.

    Comments: Accepted for publication on IEEE Signal Processing Letters

  45. arXiv:2104.01592  [pdf, other

    eess.IV cs.CV

    Synthesizing MR Image Contrast Enhancement Using 3D High-resolution ConvNets

    Authors: Chao Chen, Catalina Raymond, Bill Speier, Xinyu Jin, Timothy F. Cloughesy, Dieter Enzmann, Benjamin M. Ellingson, Corey W. Arnold

    Abstract: \textit{Objective:} Gadolinium-based contrast agents (GBCAs) have been widely used to better visualize disease in brain magnetic resonance imaging (MRI). However, gadolinium deposition within the brain and body has raised safety concerns about the use of GBCAs. Therefore, the development of novel approaches that can decrease or even eliminate GBCA exposure while providing similar contrast informat… ▽ More

    Submitted 16 July, 2022; v1 submitted 4 April, 2021; originally announced April 2021.

    Comments: This paper is accpted by IEEE TBME, Code is available at \url{https://github.com/chenchao666/Contrast-enhanced-MRI-Synthesis}

  46. arXiv:2103.06677  [pdf, other

    eess.SP

    Plane Spiral OAM Mode-Group Based MIMO Communications: An Experimental Study

    Authors: Xiaowen Xiong, Shilie Zheng, Zelin Zhu, Yuqi Chen, Hongzhe Shi, Bingchen Pan, Cheng Ren, Xianbin Yu, Xiaofeng Jin, Wei E. I. Sha, Xianmin Zhang

    Abstract: Spatial division multiplexing using conventional orbital angular momentum (OAM) has become a well-known physical layer transmission method over the past decade. The mode-group (MG) superposed by specific single mode plane spiral OAM (PSOAM) waves has been proved to be a flexible beamforming method to achieve the azimuthal pattern diversity, which inherits the spiral phase distribution of conventio… ▽ More

    Submitted 11 March, 2021; originally announced March 2021.

  47. arXiv:2012.09550  [pdf, other

    eess.IV cs.CV

    Learned Block-based Hybrid Image Compression

    Authors: Yaojun Wu, Xin Li, Zhizheng Zhang, Xin Jin, Zhibo Chen

    Abstract: Recent works on learned image compression perform encoding and decoding processes in a full-resolution manner, resulting in two problems when deployed for practical applications. First, parallel acceleration of the autoregressive entropy model cannot be achieved due to serial decoding. Second, full-resolution inference often causes the out-of-memory(OOM) problem with limited GPU resources, especia… ▽ More

    Submitted 11 October, 2021; v1 submitted 17 December, 2020; originally announced December 2020.

    Comments: 13 pages, 13 figures, accepted by IEEE Trans. on Circuits and Systems for Video Technology

  48. arXiv:2012.06131  [pdf, other

    cs.CV eess.IV

    Learning Omni-frequency Region-adaptive Representations for Real Image Super-Resolution

    Authors: Xin Li, Xin Jin, Tao Yu, Yingxue Pang, Simeng Sun, Zhizheng Zhang, Zhibo Chen

    Abstract: Traditional single image super-resolution (SISR) methods that focus on solving single and uniform degradation (i.e., bicubic down-sampling), typically suffer from poor performance when applied into real-world low-resolution (LR) images due to the complicated realistic degradations. The key to solving this more challenging real image super-resolution (RealSR) problem lies in learning feature repres… ▽ More

    Submitted 10 January, 2021; v1 submitted 11 December, 2020; originally announced December 2020.

    Comments: Accepted by AAAI2021

  49. arXiv:2010.06718  [pdf, other

    eess.SY

    Grid-Interactive Multi-Zone Building Control Using Reinforcement Learning with Global-Local Policy Search

    Authors: Xiangyu Zhang, Rohit Chintala, Andrey Bernstein, Peter Graf, Xin Jin

    Abstract: In this paper, we develop a grid-interactive multi-zone building controller based on a deep reinforcement learning (RL) approach. The controller is designed to facilitate building operation during normal conditions and demand response events, while ensuring occupants comfort and energy efficiency. We leverage a continuous action space RL formulation, and devise a two-stage global-local RL training… ▽ More

    Submitted 13 October, 2020; originally announced October 2020.

  50. arXiv:2009.14547  [pdf, other

    eess.IV cs.CV cs.LG

    FAN: Frequency Aggregation Network for Real Image Super-resolution

    Authors: Yingxue Pang, Xin Li, Xin Jin, Yaojun Wu, Jianzhao Liu, Sen Liu, Zhibo Chen

    Abstract: Single image super-resolution (SISR) aims to recover the high-resolution (HR) image from its low-resolution (LR) input image. With the development of deep learning, SISR has achieved great progress. However, It is still a challenge to restore the real-world LR image with complicated authentic degradations. Therefore, we propose FAN, a frequency aggregation network, to address the real-world image… ▽ More

    Submitted 30 September, 2020; originally announced September 2020.

    Comments: 14 pages, 7 figures, presented as a workshop paper at AIM 2020 Challenge @ ECCV 2020