Skip to main content

Showing 1–50 of 69 results for author: Qi, J

Searching in archive eess. Search in all archives.
.
  1. arXiv:2506.13971  [pdf, other

    eess.AS cs.CL cs.HC cs.LG cs.MM

    Multimodal Fusion with Semi-Supervised Learning Minimizes Annotation Quantity for Modeling Videoconference Conversation Experience

    Authors: Andrew Chang, Chenkai Hu, Ji Qi, Zhuojian Wei, Kexin Zhang, Viswadruth Akkaraju, David Poeppel, Dustin Freeman

    Abstract: Group conversations over videoconferencing are a complex social behavior. However, the subjective moments of negative experience, where the conversation loses fluidity or enjoyment remain understudied. These moments are infrequent in naturalistic data, and thus training a supervised learning (SL) model requires costly manual data annotation. We applied semi-supervised learning (SSL) to leverage ta… ▽ More

    Submitted 31 May, 2025; originally announced June 2025.

    Comments: Interspeech 2025

  2. arXiv:2505.13102  [pdf, ps, other

    cs.LG cs.AI eess.SP

    Lightweight Transformer via Unrolling of Mixed Graph Algorithms for Traffic Forecast

    Authors: Ji Qi, Tam Thuc Do, Mingxiao Liu, Zhuoshi Pan, Yuzhe Li, Gene Cheung, H. Vicky Zhao

    Abstract: To forecast traffic with both spatial and temporal dimensions, we unroll a mixed-graph-based optimization algorithm into a lightweight and interpretable transformer-like neural net. Specifically, we construct two graphs: an undirected graph $\mathcal{G}^u$ capturing spatial correlations across geography, and a directed graph $\mathcal{G}^d$ capturing sequential relationships over time. We formulat… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

    Comments: 19 pages, 5 figures, 8 tables

  3. arXiv:2505.06025  [pdf, ps, other

    cs.NI cs.DC eess.SY

    Efficient Information Updates in Compute-First Networking via Reinforcement Learning with Joint AoI and VoI

    Authors: Jianpeng Qi, Chao Liu, Chengxiang Xu, Rui Wang, Junyu Dong, Yanwei Yu

    Abstract: Timely and efficient dissemination of service information is critical in compute-first networking systems, where user requests arrive dynamically and computing resources are constrained. In such systems, the access point (AP) plays a key role in forwarding user requests to a server based on its latest received service information. This paper considers a single-source, single-destination system and… ▽ More

    Submitted 9 May, 2025; originally announced May 2025.

    Comments: 11pages, 40 figures

  4. arXiv:2504.08504  [pdf, other

    eess.SP

    STF-GCN: A Multi-Domain Graph Convolution Network Method for Automatic Modulation Recognition via Adaptive Correlation

    Authors: Mingyuan Shao, Zhengqiu Fu, Dingzhao Li, Fuqing Zhang, Yilin Cai, Shaohua Hong, Lin Cao, Yuan Peng, Jie Qi

    Abstract: Automatic Modulation Recognition (AMR) is an essential part of Intelligent Transportation System (ITS) dynamic spectrum allocation. However, current deep learning-based AMR (DL-AMR) methods are challenged to extract discriminative and robust features at low signal-to-noise ratios (SNRs), where the representation of modulation symbols is highly interfered by noise. Furthermore, current research on… ▽ More

    Submitted 11 April, 2025; originally announced April 2025.

  5. arXiv:2503.00760  [pdf, other

    eess.IV cs.CV

    NCF: Neural Correspondence Field for Medical Image Registration

    Authors: Lei Zhou, Nimu Yuan, Katjana Ehrlich, Jinyi Qi

    Abstract: Deformable image registration is a fundamental task in medical image processing. Traditional optimization-based methods often struggle with accuracy in dealing with complex deformation. Recently, learning-based methods have achieved good performance on public datasets, but the scarcity of medical image data makes it challenging to build a generalizable model to handle diverse real-world scenarios.… ▽ More

    Submitted 2 March, 2025; originally announced March 2025.

  6. arXiv:2501.18201  [pdf, other

    cs.AI eess.SY

    Neural Operator based Reinforcement Learning for Control of first-order PDEs with Spatially-Varying State Delay

    Authors: Jiaqi Hu, Jie Qi, Jing Zhang

    Abstract: Control of distributed parameter systems affected by delays is a challenging task, particularly when the delays depend on spatial variables. The idea of integrating analytical control theory with learning-based control within a unified control scheme is becoming increasingly promising and advantageous. In this paper, we address the problem of controlling an unstable first-order hyperbolic PDE with… ▽ More

    Submitted 30 January, 2025; originally announced January 2025.

    Comments: 6 Pages, 7 Figures

  7. arXiv:2501.09759  [pdf

    eess.SP physics.app-ph

    A wideband amplifying and filtering reconfigurable intelligent surface for wireless relay

    Authors: Lijie Wu, Qun Yan Zhou, Jun Yan Dai, Siran Wang, Junwei Zhang, Zhen Jie Qi, Hanqing Yang, Ruizhe Jiang, Zheng Xing Wang, Huidong Li, Zhen Zhang, Jiang Luo, Qiang Cheng, Tie Jun Cui

    Abstract: Programmable metasurfaces have garnered significant attention due to their exceptional ability to manipulate electromagnetic (EM) waves in real time, leading to the emergence of a prominent area in wireless communication, namely reconfigurable intelligent surfaces (RISs), to control the signal propagation and coverage. However, the existing RISs usually suffer from limited operating distance and b… ▽ More

    Submitted 31 December, 2024; originally announced January 2025.

  8. arXiv:2501.05961  [pdf, other

    cs.CV eess.IV

    Swin-X2S: Reconstructing 3D Shape from 2D Biplanar X-ray with Swin Transformers

    Authors: Kuan Liu, Zongyuan Ying, Jie Jin, Dongyan Li, Ping Huang, Wenjian Wu, Zhe Chen, Jin Qi, Yong Lu, Lianfu Deng, Bo Chen

    Abstract: The conversion from 2D X-ray to 3D shape holds significant potential for improving diagnostic efficiency and safety. However, existing reconstruction methods often rely on hand-crafted features, manual intervention, and prior knowledge, resulting in unstable shape errors and additional processing costs. In this paper, we introduce Swin-X2S, an end-to-end deep learning method for directly reconstru… ▽ More

    Submitted 10 January, 2025; originally announced January 2025.

  9. arXiv:2412.08219  [pdf, other

    eess.SY

    Neural Operator Feedback for a First-Order PIDE with Spatially-Varying State Delay

    Authors: Jie Qi, Jiaqi Hu, Jing Zhang, Miroslav Krstic

    Abstract: A transport PDE with a spatial integral and recirculation with constant delay has been a benchmark for neural operator approximations of PDE backstepping controllers. Introducing a spatially-varying delay into the model gives rise to a gain operator defined through integral equations which the operator's input -- the varying delay function -- enters in previously unencountered manners, including i… ▽ More

    Submitted 14 December, 2024; v1 submitted 11 December, 2024; originally announced December 2024.

    Comments: This 14 page paper contains 1 table and 20 figures

  10. arXiv:2410.18610  [pdf, other

    eess.IV cs.CV

    A Joint Representation Using Continuous and Discrete Features for Cardiovascular Diseases Risk Prediction on Chest CT Scans

    Authors: Minfeng Xu, Chen-Chen Fan, Yan-Jie Zhou, Wenchao Guo, Pan Liu, Jing Qi, Le Lu, Hanqing Chao, Kunlun He

    Abstract: Cardiovascular diseases (CVD) remain a leading health concern and contribute significantly to global mortality rates. While clinical advancements have led to a decline in CVD mortality, accurately identifying individuals who could benefit from preventive interventions remains an unsolved challenge in preventive cardiology. Current CVD risk prediction models, recommended by guidelines, are based on… ▽ More

    Submitted 15 November, 2024; v1 submitted 24 October, 2024; originally announced October 2024.

    Comments: 23 pages, 9 figures

  11. arXiv:2410.06115  [pdf, other

    cs.IT eess.SP

    A physics-based perspective for understanding and utilizing spatial resources of wireless channels

    Authors: Hui Xu, Jun Wei Wu, Zhen Jie Qi, Hao Tian Wu, Rui Wen Shao, Qiang Cheng, Jieao Zhu, Linglong Dai, Tie Jun Cui

    Abstract: To satisfy the increasing demands for transmission rates of wireless communications, it is necessary to use spatial resources of electromagnetic (EM) waves. In this context, EM information theory (EIT) has become a hot topic by integrating the theoretical framework of deterministic mathematics and stochastic statistics to explore the transmission mechanisms of continuous EM waves. However, the pre… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

    Comments: 31pages, 8 figures

  12. arXiv:2409.15596  [pdf, other

    eess.SP

    Computational Ghost Imaging with Low-Density Parity-Check Code

    Authors: Shuang Liu, Yunkai Hu, Jinquan Qi, Shensheng Han, Zihuai Lin

    Abstract: Ghost imaging (GI) is a high-resolution imaging technology that has been a subject of interest to many fields in the past 20 years. Most GI researchers focus on the reconstruction of signal under-sampling, nevertheless, how to use information redundancy to improve the result's belief in a complex environment has hardly been studied. Motivated by this, we propose a computational GI system based on… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.

  13. arXiv:2311.03557  [pdf, other

    cs.LG cs.CV eess.IV

    Spatio-Temporal Similarity Measure based Multi-Task Learning for Predicting Alzheimer's Disease Progression using MRI Data

    Authors: Xulong Wang, Yu Zhang, Menghui Zhou, Tong Liu, Jun Qi, Po Yang

    Abstract: Identifying and utilising various biomarkers for tracking Alzheimer's disease (AD) progression have received many recent attentions and enable helping clinicians make the prompt decisions. Traditional progression models focus on extracting morphological biomarkers in regions of interest (ROIs) from MRI/PET images, such as regional average cortical thickness and regional volume. They are effective… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  14. arXiv:2307.11436  [pdf, other

    math.OC cs.LG eess.SY math.AP

    Neural Operators for PDE Backstepping Control of First-Order Hyperbolic PIDE with Recycle and Delay

    Authors: Jie Qi, Jing Zhang, Miroslav Krstic

    Abstract: The recently introduced DeepONet operator-learning framework for PDE control is extended from the results for basic hyperbolic and parabolic PDEs to an advanced hyperbolic class that involves delays on both the state and the system output or input. The PDE backstepping design produces gain functions that are outputs of a nonlinear operator, mapping functions on a spatial domain into functions on a… ▽ More

    Submitted 14 June, 2024; v1 submitted 21 July, 2023; originally announced July 2023.

    Comments: 20 pages

    Journal ref: Systems & Control Letters, 2024

  15. arXiv:2307.11424  [pdf, ps, other

    math.OC eess.SY math.AP physics.class-ph physics.flu-dyn

    Robust stabilization of $2 \times 2$ first-order hyperbolic PDEs with uncertain input delay

    Authors: Jing Zhang, Jie Qi

    Abstract: A backstepping-based compensator design is developed for a system of $2\times2$ first-order linear hyperbolic partial differential equations (PDE) in the presence of an uncertain long input delay at boundary. We introduce a transport PDE to represent the delayed input, which leads to three coupled first-order hyperbolic PDEs. A novel backstepping transformation, composed of two Volterra transforma… ▽ More

    Submitted 21 July, 2023; originally announced July 2023.

  16. arXiv:2307.04212  [pdf, other

    math.AP eess.SY math.OC physics.class-ph physics.flu-dyn

    Delay-Adaptive Control of First-order Hyperbolic PIDEs

    Authors: Shanshan Wang, Jie Qi, Miroslav Krstic

    Abstract: We develop a delay-adaptive controller for a class of first-order hyperbolic partial integro-differential equations (PIDEs) with an unknown input delay. By employing a transport PDE to represent delayed actuator states, the system is transformed into a transport partial differential equation (PDE) with unknown propagation speed cascaded with a PIDE. A parameter update law is designed using a Lyapu… ▽ More

    Submitted 9 July, 2023; originally announced July 2023.

  17. arXiv:2307.03727  [pdf, ps, other

    math.OC eess.SY math.AP physics.class-ph physics.flu-dyn

    Bilateral boundary control of an input delayed 2-D reaction-diffusion equation

    Authors: Dandan Guan, Yanmei Chen, Jie Qi, Linglong Du

    Abstract: In this paper, a delay compensation design method based on PDE backstepping is developed for a two-dimensional reaction-diffusion partial differential equation (PDE) with bilateral input delays. The PDE is defined in a rectangular domain, and the bilateral control is imposed on a pair of opposite sides of the rectangle. To represent the delayed bilateral inputs, we introduce two 2-D transport PDEs… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

    Comments: 11 pages, 3 figures(including 8 sub-figures)

  18. arXiv:2306.07090  [pdf, other

    eess.AS cs.SD q-bio.QM

    Parameter-efficient Dysarthric Speech Recognition Using Adapter Fusion and Householder Transformation

    Authors: Jinzi Qi, Hugo Van hamme

    Abstract: In dysarthric speech recognition, data scarcity and the vast diversity between dysarthric speakers pose significant challenges. While finetuning has been a popular solution, it can lead to overfitting and low parameter efficiency. Adapter modules offer a better solution, with their small size and easy applicability. Additionally, Adapter Fusion can facilitate knowledge transfer from multiple learn… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

    Comments: Accepted by Interspeech 2023

  19. arXiv:2305.12838  [pdf, other

    eess.AS cs.SD

    An Enhanced Res2Net with Local and Global Feature Fusion for Speaker Verification

    Authors: Yafeng Chen, Siqi Zheng, Hui Wang, Luyao Cheng, Qian Chen, Jiajun Qi

    Abstract: Effective fusion of multi-scale features is crucial for improving speaker verification performance. While most existing methods aggregate multi-scale features in a layer-wise manner via simple operations, such as summation or concatenation. This paper proposes a novel architecture called Enhanced Res2Net (ERes2Net), which incorporates both local and global feature fusion techniques to improve the… ▽ More

    Submitted 3 August, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

  20. arXiv:2305.00127  [pdf, other

    cs.LG cs.AI eess.SY

    Optimal Scheduling in IoT-Driven Smart Isolated Microgrids Based on Deep Reinforcement Learning

    Authors: Jiaju Qi, Lei Lei, Kan Zheng, Simon X. Yang, Xuemin, Shen

    Abstract: In this paper, we investigate the scheduling issue of diesel generators (DGs) in an Internet of Things (IoT)-Driven isolated microgrid (MG) by deep reinforcement learning (DRL). The renewable energy is fully exploited under the uncertainty of renewable generation and load demand. The DRL agent learns an optimal policy from history renewable and load data of previous days, where the policy can gene… ▽ More

    Submitted 28 April, 2023; originally announced May 2023.

  21. arXiv:2302.00676  [pdf

    physics.optics eess.SY physics.app-ph

    Enhancing Light Extraction of Organic Light Emitting Diodes by Deep-Groove High-index Dielectric Nanomesh Using Large-area Nanoimprint

    Authors: Ji Qi, Wei Ding, Qi Zhang, Yuxuan Wang, Hao Chen, Stephen Y. Chou

    Abstract: To solve the conventional conflict between maintaining good charge transport property and achieving high light extraction efficiency when using micro/nanostructure patterned substrates to extract light from organic light emitting diodes (OLEDs), we developed a novel OLED structure, termed High-index Deep-Groove Dielectric Nanomesh OLED (HDNM-OLED), fabricated by large-area nanoimprint lithography… ▽ More

    Submitted 31 January, 2023; originally announced February 2023.

    Comments: arXiv admin note: text overlap with arXiv:2302.00044

  22. arXiv:2211.13939  [pdf, other

    cs.SD cs.LG eess.AS

    Efficient Incremental Text-to-Speech on GPUs

    Authors: Muyang Du, Chuan Liu, Jiaxing Qi, Junjie Lai

    Abstract: Incremental text-to-speech, also known as streaming TTS, has been increasingly applied to online speech applications that require ultra-low response latency to provide an optimal user experience. However, most of the existing speech synthesis pipelines deployed on GPU are still non-incremental, which uncovers limitations in high-concurrency scenarios, especially when the pipeline is built with end… ▽ More

    Submitted 5 December, 2022; v1 submitted 25 November, 2022; originally announced November 2022.

    Comments: 5 pages, 4 figures

  23. arXiv:2210.13144  [pdf, other

    eess.AS cs.SD

    Weak-Supervised Dysarthria-invariant Features for Spoken Language Understanding using an FHVAE and Adversarial Training

    Authors: Jinzi Qi, Hugo Van hamme

    Abstract: The scarcity of training data and the large speaker variation in dysarthric speech lead to poor accuracy and poor speaker generalization of spoken language understanding systems for dysarthric speech. Through work on the speech features, we focus on improving the model generalization ability with limited dysarthric data. Factorized Hierarchical Variational Auto-Encoders (FHVAE) trained unsupervise… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

  24. arXiv:2210.06382  [pdf, other

    eess.AS cs.AI cs.LG cs.SD eess.SP

    An Ensemble Teacher-Student Learning Approach with Poisson Sub-sampling to Differential Privacy Preserving Speech Recognition

    Authors: Chao-Han Huck Yang, Jun Qi, Sabato Marco Siniscalchi, Chin-Hui Lee

    Abstract: We propose an ensemble learning framework with Poisson sub-sampling to effectively train a collection of teacher models to issue some differential privacy (DP) guarantee for training data. Through boosting under DP, a student model derived from the training data suffers little model degradation from the models trained with no privacy protection. Our proposed solution leverages upon two mechanisms,… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: Accepted to ISCA, ISCSLP 2022, Singapore. 5 Pages

  25. arXiv:2205.12459  [pdf, other

    cs.CV eess.IV

    A CNN with Noise Inclined Module and Denoise Framework for Hyperspectral Image Classification

    Authors: Zhiqiang Gong, Ping Zhong, Jiahao Qi, Panhe Hu

    Abstract: Deep Neural Networks have been successfully applied in hyperspectral image classification. However, most of prior works adopt general deep architectures while ignore the intrinsic structure of the hyperspectral image, such as the physical noise generation. This would make these deep models unable to generate discriminative features and provide impressive classification performance. To leverage suc… ▽ More

    Submitted 24 May, 2022; originally announced May 2022.

    Journal ref: IET Image Processing, 2022

  26. arXiv:2205.09987  [pdf, other

    cs.RO eess.SY

    Model Predictive Manipulation of Compliant Objects with Multi-Objective Optimizer and Adversarial Network for Occlusion Compensation

    Authors: Jiaming Qi, Dongyu Li, Yufeng Gao, Peng Zhou, David Navarro-Alarcon

    Abstract: The robotic manipulation of compliant objects is currently one of the most active problems in robotics due to its potential to automate many important applications. Despite the progress achieved by the robotics community in recent years, the 3D shaping of these types of materials remains an open research problem. In this paper, we propose a new vision-based controller to automatically regulate the… ▽ More

    Submitted 20 May, 2022; originally announced May 2022.

  27. arXiv:2203.07659  [pdf

    eess.IV cs.CV

    Breast Cancer Molecular Subtypes Prediction on Pathological Images with Discriminative Patch Selecting and Multi-Instance Learning

    Authors: Hong Liu, Wen-Dong Xu, Zi-Hao Shang, Xiang-Dong Wang, Hai-Yan Zhou, Ke-Wen Ma, Huan Zhou, Jia-Lin Qi, Jia-Rui Jiang, Li-Lan Tan, Hui-Min Zeng, Hui-Juan Cai, Kuan-Song Wang, Yue-Liang Qian

    Abstract: Molecular subtypes of breast cancer are important references to personalized clinical treatment. For cost and labor savings, only one of the patient's paraffin blocks is usually selected for subsequent immunohistochemistry (IHC) to obtain molecular subtypes. Inevitable sampling error is risky due to tumor heterogeneity and could result in a delay in treatment. Molecular subtype prediction from con… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.

  28. arXiv:2203.06031  [pdf, other

    cs.LG cs.AI cs.SD eess.AS

    Exploiting Low-Rank Tensor-Train Deep Neural Networks Based on Riemannian Gradient Descent With Illustrations of Speech Processing

    Authors: Jun Qi, Chao-Han Huck Yang, Pin-Yu Chen, Javier Tejedor

    Abstract: This work focuses on designing low complexity hybrid tensor networks by considering trade-offs between the model complexity and practical performance. Firstly, we exploit a low-rank tensor-train deep neural network (TT-DNN) to build an end-to-end deep learning pipeline, namely LR-TT-DNN. Secondly, a hybrid model combining LR-TT-DNN with a convolutional neural network (CNN), which is denoted as CNN… ▽ More

    Submitted 11 March, 2022; originally announced March 2022.

    Comments: 10 pages, 10 Figures

  29. arXiv:2203.03550  [pdf, other

    cs.CL cs.AI cs.DC cs.NE eess.AS

    When BERT Meets Quantum Temporal Convolution Learning for Text Classification in Heterogeneous Computing

    Authors: Chao-Han Huck Yang, Jun Qi, Samuel Yen-Chi Chen, Yu Tsao, Pin-Yu Chen

    Abstract: The rapid development of quantum computing has demonstrated many unique characteristics of quantum advantages, such as richer feature representation and more secured protection on model parameters. This work proposes a vertical federated learning architecture based on variational quantum circuits to demonstrate the competitive performance of a quantum-enhanced pre-trained BERT model for text class… ▽ More

    Submitted 17 February, 2022; originally announced March 2022.

    Comments: Accepted to ICASSP 2022

  30. arXiv:2202.06727   

    cs.LG eess.SY

    STG-GAN: A spatiotemporal graph generative adversarial networks for short-term passenger flow prediction in urban rail transit systems

    Authors: Jinlei Zhang, Hua Li, Lixing Yang, Guangyin Jin, Jianguo Qi, Ziyou Gao

    Abstract: Short-term passenger flow prediction is an important but challenging task for better managing urban rail transit (URT) systems. Some emerging deep learning models provide good insights to improve short-term prediction accuracy. However, there exist many complex spatiotemporal dependencies in URT systems. Most previous methods only consider the absolute error between ground truth and predictions as… ▽ More

    Submitted 16 August, 2023; v1 submitted 10 February, 2022; originally announced February 2022.

    Comments: There are some errors that might mislead readers for this version. There is no new version right now

    ACM Class: E.0

  31. arXiv:2201.10609  [pdf, other

    cs.SD cs.LG eess.AS

    Exploiting Hybrid Models of Tensor-Train Networks for Spoken Command Recognition

    Authors: Jun Qi, Javier Tejedor

    Abstract: This work aims to design a low complexity spoken command recognition (SCR) system by considering different trade-offs between the number of model parameters and classification accuracy. More specifically, we exploit a deep hybrid architecture of a tensor-train (TT) network to build an end-to-end SRC pipeline. Our command recognition system, namely CNN+(TT-DNN), is composed of convolutional layers… ▽ More

    Submitted 11 January, 2022; originally announced January 2022.

    Comments: Accepted in Proc. ICASSP 2022

  32. arXiv:2201.01443  [pdf, other

    eess.IV cs.CV physics.med-ph

    Neural KEM: A Kernel Method with Deep Coefficient Prior for PET Image Reconstruction

    Authors: Siqi Li, Kuang Gong, Ramsey D. Badawi, Edward J. Kim, Jinyi Qi, Guobao Wang

    Abstract: Image reconstruction of low-count positron emission tomography (PET) data is challenging. Kernel methods address the challenge by incorporating image prior information in the forward model of iterative PET image reconstruction. The kernelized expectation-maximization (KEM) algorithm has been developed and demonstrated to be effective and easy to implement. A common approach for a further improveme… ▽ More

    Submitted 24 October, 2022; v1 submitted 4 January, 2022; originally announced January 2022.

    Comments: arXiv admin note: text overlap with arXiv:2110.01174

  33. arXiv:2112.09216  [pdf, other

    eess.IV cs.CV

    A Deep-Learning Framework for Improving COVID-19 CT Image Quality and Diagnostic Accuracy

    Authors: Garvit Goel, Jingyuan Qi, Wu-chun Feng, Guohua Cao

    Abstract: We present a deep-learning based computing framework for fast-and-accurate CT (DL-FACT) testing of COVID-19. Our CT-based DL framework was developed to improve the testing speed and accuracy of COVID-19 (plus its variants) via a DL-based approach for CT image enhancement and classification. The image enhancement network is adapted from DDnet, short for DenseNet and Deconvolution based network. To… ▽ More

    Submitted 16 December, 2021; originally announced December 2021.

    Comments: 10 pages

  34. arXiv:2112.01697  [pdf, other

    cs.CV cs.CL cs.LG cs.SD eess.AS

    LMR-CBT: Learning Modality-fused Representations with CB-Transformer for Multimodal Emotion Recognition from Unaligned Multimodal Sequences

    Authors: Ziwang Fu, Feng Liu, Hanyang Wang, Siyuan Shen, Jiahao Zhang, Jiayin Qi, Xiangling Fu, Aimin Zhou

    Abstract: Learning modality-fused representations and processing unaligned multimodal sequences are meaningful and challenging in multimodal emotion recognition. Existing approaches use directional pairwise attention or a message hub to fuse language, visual, and audio modalities. However, those approaches introduce information redundancy when fusing features and are inefficient without considering the comp… ▽ More

    Submitted 2 December, 2021; originally announced December 2021.

    Comments: 9 pages ,Figure 2, Table 5

  35. Delay-Compensated Distributed PDE Control of Traffic with Connected/Automated Vehicles

    Authors: Jie Qi, Shurong Mo, Miroslav Krstic

    Abstract: We develop an input delay-compensating design for stabilization of an Aw-Rascle-Zhang (ARZ) traffic model in congested regime which is governed by a $2\times 2$ first-order hyperbolic nonlinear PDE. The traffic flow consists of both adaptive cruise control-equipped (ACC-equipped) and manually-driven vehicles. The control input is the time gap of ACC-equipped and connected vehicles, which is subjec… ▽ More

    Submitted 2 September, 2022; v1 submitted 19 July, 2021; originally announced July 2021.

  36. arXiv:2106.10359  [pdf, other

    eess.IV cs.CV physics.med-ph

    Direct Reconstruction of Linear Parametric Images from Dynamic PET Using Nonlocal Deep Image Prior

    Authors: Kuang Gong, Ciprian Catana, Jinyi Qi, Quanzheng Li

    Abstract: Direct reconstruction methods have been developed to estimate parametric images directly from the measured PET sinograms by combining the PET imaging model and tracer kinetics in an integrated framework. Due to limited counts received, signal-to-noise-ratio (SNR) and resolution of parametric images produced by direct reconstruction frameworks are still limited. Recently supervised deep learning me… ▽ More

    Submitted 18 June, 2021; originally announced June 2021.

    Comments: 10 pages, 10 figures

  37. arXiv:2106.07337  [pdf, other

    eess.AS

    Speech Disorder Classification Using Extended Factorized Hierarchical Variational Auto-encoders

    Authors: Jinzi Qi, Hugo Van hamme

    Abstract: Objective speech disorder classification for speakers with communication difficulty is desirable for diagnosis and administering therapy. With the current state of speech technology, it is evident to propose neural networks for this application. But neural network model training is hampered by a lack of labeled disordered speech data. In this research, we apply an extended version of Factorized Hi… ▽ More

    Submitted 14 June, 2021; originally announced June 2021.

    Comments: 5 pages, 2 figures, submitted to INTERSPEECH2021

  38. arXiv:2106.02424  [pdf, other

    cs.RO eess.SY

    Contour Moments Based Manipulation of Composite Rigid-Deformable Objects with Finite Time Model Estimation and Shape/Position Control

    Authors: Jiaming Qi, Guangfu Ma, Jihong Zhu, Peng Zhou, Yueyong Lyu, Haibo Zhang, David Navarro-Alarcon

    Abstract: The robotic manipulation of composite rigid-deformable objects (i.e. those with mixed non-homogeneous stiffness properties) is a challenging problem with clear practical applications that, despite the recent progress in the field, it has not been sufficiently studied in the literature. To deal with this issue, in this paper we propose a new visual servoing method that has the capability to manipul… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

  39. arXiv:2104.00230  [pdf, other

    eess.AS

    Bidirectional Multiscale Feature Aggregation for Speaker Verification

    Authors: Jiajun Qi, Wu Guo, Bin Gu

    Abstract: In this paper, we propose a novel bidirectional multiscale feature aggregation (BMFA) network with attentional fusion modules for text-independent speaker verification. The feature maps from different stages of the backbone network are iteratively combined and refined in both a bottom-up and top-down manner. Furthermore, instead of simple concatenation or element-wise addition of feature maps from… ▽ More

    Submitted 31 March, 2021; originally announced April 2021.

  40. arXiv:2012.00803  [pdf, other

    eess.SY

    Generator Parameter Estimation by Q-Learning Based on PMU Measurements

    Authors: Seyyed Rashid Khazeiynasab, Junjian Qi, Issa Batarseh

    Abstract: In this paper, a novel Q-learning based approach is proposed for estimating the parameters of synchronous generators using PMU measurements. Event playback is used to generate model outputs under different parameters for training the agent in Q-learning. We assume that the exact values of some parameters in the model are not known by the agent in Q-learning. Then, an optimal history-dependent poli… ▽ More

    Submitted 1 December, 2020; originally announced December 2020.

  41. arXiv:2010.13309  [pdf, other

    cs.SD cs.LG cs.NE eess.AS quant-ph

    Decentralizing Feature Extraction with Quantum Convolutional Neural Network for Automatic Speech Recognition

    Authors: Chao-Han Huck Yang, Jun Qi, Samuel Yen-Chi Chen, Pin-Yu Chen, Sabato Marco Siniscalchi, Xiaoli Ma, Chin-Hui Lee

    Abstract: We propose a novel decentralized feature extraction approach in federated learning to address privacy-preservation issues for speech recognition. It is built upon a quantum convolutional neural network (QCNN) composed of a quantum circuit encoder for feature extraction, and a recurrent neural network (RNN) based end-to-end acoustic model (AM). To enhance model parameter protection in a decentraliz… ▽ More

    Submitted 12 February, 2021; v1 submitted 25 October, 2020; originally announced October 2020.

    Comments: Accepted to IEEE ICASSP 2021. Code is available: https://github.com/huckiyang/QuantumSpeech-QCNN

    Journal ref: 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

  42. arXiv:2010.10919   

    eess.AS cs.SD

    Multi-task Metric Learning for Text-independent Speaker Verification

    Authors: Yafeng Chen, Wu Guo, Jingjing Shi, Jiajun Qi, Tan Liu

    Abstract: In this work, we introduce metric learning (ML) to enhance the deep embedding learning for text-independent speaker verification (SV). Specifically, the deep speaker embedding network is trained with conventional cross entropy loss and auxiliary pair-based ML loss function. For the auxiliary ML task, training samples of a mini-batch are first arranged into pairs, then positive and negative pairs a… ▽ More

    Submitted 22 March, 2023; v1 submitted 21 October, 2020; originally announced October 2020.

    Comments: Not a particularly high-quality work, so we request withdrawal

  43. arXiv:2010.07540  [pdf, other

    eess.SY

    Multi-Objective PMU Allocation for Resilient Power System Monitoring

    Authors: Hamed Haggi, Wei Sun, Junjian Qi

    Abstract: Phasor measurement units (PMUs) enable better system monitoring and security enhancement in smart grids. In order to enhance power system resilience against outages and blackouts caused by extreme weather events or man-made attacks, it remains a major challenge to determine the optimal number and location of PMUs. In this paper, a multi-objective resilient PMU placement (MORPP) problem is formulat… ▽ More

    Submitted 15 October, 2020; originally announced October 2020.

    Comments: IEEE PES General Meeting 2020

  44. arXiv:2010.06248   

    eess.AS

    Exploring Universal Speech Attributes for Speaker Verification with an Improved Cross-stitch Network

    Authors: Jiajun Qi, Wu Guo, Jingjing Shi, Yafeng Chen, Tan Liu

    Abstract: The universal speech attributes for x-vector based speaker verification (SV) are addressed in this paper. The manner and place of articulation form the fundamental speech attribute unit (SAU), and then new speech attribute (NSA) units for acoustic modeling are generated by tied tri-SAU states. An improved cross-stitch network is adopted as a multitask learning (MTL) framework for integrating these… ▽ More

    Submitted 31 May, 2023; v1 submitted 13 October, 2020; originally announced October 2020.

    Comments: Not a particularly high-quality work, so we request withdrawal

  45. arXiv:2009.14155  [pdf, other

    eess.SY

    Resilience Analysis and Cascading FailureModeling of Power Systems under Extreme Temperatures

    Authors: Seyyed Rashid Khazeiynasab, Junjian Qi

    Abstract: In this paper, we propose an AC power flow based cascading failure model that explicitly considers external weather conditions, extreme temperatures in particular, and evaluates the impact of extreme temperature on the initiation and propagation of cascading blackouts. Specifically, load and dynamic line rating changes are modeled due to temperature disturbance, the probabilities for transmission… ▽ More

    Submitted 29 September, 2020; originally announced September 2020.

  46. arXiv:2009.01003  [pdf, other

    cs.CL cs.SD eess.AS

    Variational Inference-Based Dropout in Recurrent Neural Networks for Slot Filling in Spoken Language Understanding

    Authors: Jun Qi, Xu Liu, Javier Tejedor

    Abstract: This paper proposes to generalize the variational recurrent neural network (RNN) with variational inference (VI)-based dropout regularization employed for the long short-term memory (LSTM) cells to more advanced RNN architectures like gated recurrent unit (GRU) and bi-directional LSTM/GRU. The new variational RNNs are employed for slot filling, which is an intriguing but challenging task in spoken… ▽ More

    Submitted 23 August, 2020; originally announced September 2020.

    Comments: conference paper, 5 pages

  47. arXiv:2008.07281  [pdf, ps, other

    eess.AS cs.LG cs.SD eess.SP stat.ML

    On Mean Absolute Error for Deep Neural Network Based Vector-to-Vector Regression

    Authors: Jun Qi, Jun Du, Sabato Marco Siniscalchi, Xiaoli Ma, Chin-Hui Lee

    Abstract: In this paper, we exploit the properties of mean absolute error (MAE) as a loss function for the deep neural network (DNN) based vector-to-vector regression. The goal of this work is two-fold: (i) presenting performance bounds of MAE, and (ii) demonstrating new properties of MAE that make it more appropriate than mean squared error (MSE) as a loss function for DNN based vector-to-vector regression… ▽ More

    Submitted 12 August, 2020; originally announced August 2020.

    Journal ref: IEEE Signal Processing Letters, 2020

  48. arXiv:2008.06896  [pdf, other

    cs.RO eess.SY

    Adaptive Shape Servoing of Elastic Rods using Parameterized Regression Features and Auto-Tuning Motion Controls

    Authors: Jiaming Qi, Guangtao Ran, Bohui Wang, Jian Liu, Wanyu Ma, Peng Zhou, David Navarro-Alarcon

    Abstract: The robotic manipulation of deformable linear objects has shown great potential in a wide range of real-world applications. However, it presents many challenges due to the objects' complex nonlinearity and high-dimensional configuration. In this paper, we propose a new shape servoing framework to automatically manipulate elastic rods through visual feedback. Our new method uses parameterized regre… ▽ More

    Submitted 9 September, 2023; v1 submitted 16 August, 2020; originally announced August 2020.

    Comments: 8 pages, 12 figures

  49. arXiv:2008.05459  [pdf, other

    cs.LG eess.SP stat.ML

    Analyzing Upper Bounds on Mean Absolute Errors for Deep Neural Network Based Vector-to-Vector Regression

    Authors: Jun Qi, Jun Du, Sabato Marco Siniscalchi, Xiaoli Ma, Chin-Hui Lee

    Abstract: In this paper, we show that, in vector-to-vector regression utilizing deep neural networks (DNNs), a generalized loss of mean absolute error (MAE) between the predicted and expected feature vectors is upper bounded by the sum of an approximation error, an estimation error, and an optimization error. Leveraging upon error decomposition techniques in statistical learning theory and non-convex optimi… ▽ More

    Submitted 4 August, 2020; originally announced August 2020.

    Journal ref: IEEE Transactions on Signal Processing, Vol 68, pp. 3411-3422, 2020

  50. arXiv:2007.13024  [pdf, other

    eess.AS cs.CL cs.LG cs.NE cs.SD

    Exploring Deep Hybrid Tensor-to-Vector Network Architectures for Regression Based Speech Enhancement

    Authors: Jun Qi, Hu Hu, Yannan Wang, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Chin-Hui Lee

    Abstract: This paper investigates different trade-offs between the number of model parameters and enhanced speech qualities by employing several deep tensor-to-vector regression models for speech enhancement. We find that a hybrid architecture, namely CNN-TT, is capable of maintaining a good quality performance with a reduced model parameter size. CNN-TT is composed of several convolutional layers at the bo… ▽ More

    Submitted 2 August, 2020; v1 submitted 25 July, 2020; originally announced July 2020.

    Comments: Accepted to InterSpeech 2020