Skip to main content

Showing 1–15 of 15 results for author: Nie, J

Searching in archive eess. Search in all archives.
.
  1. arXiv:2509.15473  [pdf, ps, other

    eess.AS cs.CL cs.LG cs.SD

    Breathing and Semantic Pause Detection and Exertion-Level Classification in Post-Exercise Speech

    Authors: Yuyu Wang, Wuyue Xia, Huaxiu Yao, Jingping Nie

    Abstract: Post-exercise speech contains rich physiological and linguistic cues, often marked by semantic pauses, breathing pauses, and combined breathing-semantic pauses. Detecting these events enables assessment of recovery rate, lung function, and exertion-related abnormalities. However, existing works on identifying and distinguishing different types of pauses in this context are limited. In this work, b… ▽ More

    Submitted 18 September, 2025; originally announced September 2025.

    Comments: 6 pages, 3rd ACM International Workshop on Intelligent Acoustic Systems and Applications (IASA 25)

  2. arXiv:2509.14243  [pdf, ps, other

    eess.SP

    InWaveSR: Topography-Aware Super-Resolution Network for Internal Solitary Waves

    Authors: Xinjie Wang, Zhongrui Li, Peng Han, Chunxin Yuan, Jiexin Xu, Zhiqiang Wei, Jie Nie

    Abstract: The effective utilization of observational data is frequently hindered by insufficient resolution. To address this problem, we present a new spatio-temporal super-resolution (STSR) model, called InWaveSR. It is built on a deep learning framework with physical restrictions and can efficiently generate high-resolution data from low-resolution input, especially for data featuring internal solitary wa… ▽ More

    Submitted 3 September, 2025; originally announced September 2025.

  3. arXiv:2506.15947  [pdf, ps, other

    cs.NI eess.SP

    HybridRAG-based LLM Agents for Low-Carbon Optimization in Low-Altitude Economy Networks

    Authors: Jinbo Wen, Cheng Su, Jiawen Kang, Jiangtian Nie, Yang Zhang, Jianhang Tang, Dusit Niyato, Chau Yuen

    Abstract: Low-Altitude Economy Networks (LAENets) are emerging as a promising paradigm to support various low-altitude services through integrated air-ground infrastructure. To satisfy low-latency and high-computation demands, the integration of Unmanned Aerial Vehicles (UAVs) with Mobile Edge Computing (MEC) systems plays a vital role, which offloads computing tasks from terminal devices to nearby UAVs, en… ▽ More

    Submitted 18 June, 2025; originally announced June 2025.

  4. arXiv:2505.20745  [pdf, ps, other

    cs.SD cs.LG eess.AS

    Foundation Model Hidden Representations for Heart Rate Estimation from Auscultation

    Authors: Jingping Nie, Dung T. Tran, Karan Thakkar, Vasudha Kowtha, Jon Huang, Carlos Avendano, Erdrin Azemi, Vikramjit Mitra

    Abstract: Auscultation, particularly heart sound, is a non-invasive technique that provides essential vital sign information. Recently, self-supervised acoustic representation foundation models (FMs) have been proposed to offer insights into acoustics-based vital signs. However, there has been little exploration of the extent to which auscultation is encoded in these pre-trained FM representations. In this… ▽ More

    Submitted 29 May, 2025; v1 submitted 27 May, 2025; originally announced May 2025.

    Comments: 5 pages, Interspeech 2025 conference

  5. arXiv:2501.13130  [pdf, other

    eess.IV

    A Novel Scene Coupling Semantic Mask Network for Remote Sensing Image Segmentation

    Authors: Xiaowen Ma, Rongrong Lian, Zhenkai Wu, Renxiang Guan, Tingfeng Hong, Mengjiao Zhao, Mengting Ma, Jiangtao Nie, Zhenhong Du, Siyang Song, Wei Zhang

    Abstract: As a common method in the field of computer vision, spatial attention mechanism has been widely used in semantic segmentation of remote sensing images due to its outstanding long-range dependency modeling capability. However, remote sensing images are usually characterized by complex backgrounds and large intra-class variance that would degrade their analysis performance. While vanilla spatial att… ▽ More

    Submitted 21 January, 2025; originally announced January 2025.

    Comments: Accepted by ISPRS Journal of Photogrammetry and Remote Sensing

  6. arXiv:2409.04050  [pdf, other

    eess.IV cs.CV

    EigenSR: Eigenimage-Bridged Pre-Trained RGB Learners for Single Hyperspectral Image Super-Resolution

    Authors: Xi Su, Xiangfei Shen, Mingyang Wan, Jing Nie, Lihui Chen, Haijun Liu, Xichuan Zhou

    Abstract: Single hyperspectral image super-resolution (single-HSI-SR) aims to improve the resolution of a single input low-resolution HSI. Due to the bottleneck of data scarcity, the development of single-HSI-SR lags far behind that of RGB natural images. In recent years, research on RGB SR has shown that models pre-trained on large-scale benchmark datasets can greatly improve performance on unseen data, wh… ▽ More

    Submitted 30 December, 2024; v1 submitted 6 September, 2024; originally announced September 2024.

    Comments: AAAI 2025 conference paper

  7. arXiv:2407.18424  [pdf, other

    cs.SD cs.LG eess.AS

    Model-driven Heart Rate Estimation and Heart Murmur Detection based on Phonocardiogram

    Authors: Jingping Nie, Ran Liu, Behrooz Mahasseni, Erdrin Azemi, Vikramjit Mitra

    Abstract: Acoustic signals are crucial for health monitoring, particularly heart sounds which provide essential data like heart rate and detect cardiac anomalies such as murmurs. This study utilizes a publicly available phonocardiogram (PCG) dataset to estimate heart rate using model-driven methods and extends the best-performing model to a multi-task learning (MTL) framework for simultaneous heart rate est… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

    Comments: 6 pages, 10 figures

  8. arXiv:2403.19983  [pdf, other

    eess.IV cs.CV

    A multi-stage semi-supervised learning for ankle fracture classification on CT images

    Authors: Hongzhi Liu, Guicheng Li, Jiacheng Nie, Hui Tang, Chunfeng Yang, Qianjin Feng, Hailin Xu, Yang Chen

    Abstract: Because of the complicated mechanism of ankle injury, it is very difficult to diagnose ankle fracture in clinic. In order to simplify the process of fracture diagnosis, an automatic diagnosis model of ankle fracture was proposed. Firstly, a tibia-fibula segmentation network is proposed for the joint tibiofibular region of the ankle joint, and the corresponding segmentation dataset is established o… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

  9. arXiv:2309.05927  [pdf, other

    cs.LG cs.AI eess.SP

    Frequency-Aware Masked Autoencoders for Multimodal Pretraining on Biosignals

    Authors: Ran Liu, Ellen L. Zippi, Hadi Pouransari, Chris Sandino, Jingping Nie, Hanlin Goh, Erdrin Azemi, Ali Moin

    Abstract: Leveraging multimodal information from biosignals is vital for building a comprehensive representation of people's physical and mental states. However, multimodal biosignals often exhibit substantial distributional shifts between pretraining and inference datasets, stemming from changes in task specification or variations in modality compositions. To achieve effective pretraining in the presence o… ▽ More

    Submitted 18 April, 2024; v1 submitted 11 September, 2023; originally announced September 2023.

    Comments: Extended version of ICLR 2024 Learning from Time Series for Health workshop

  10. arXiv:2308.03027  [pdf, other

    cs.LG cs.CV eess.SP

    Causal Disentanglement Hidden Markov Model for Fault Diagnosis

    Authors: Rihao Chang, Yongtao Ma, Weizhi Nie, Jie Nie, An-an Liu

    Abstract: In modern industries, fault diagnosis has been widely applied with the goal of realizing predictive maintenance. The key issue for the fault diagnosis system is to extract representative characteristics of the fault signal and then accurately predict the fault type. In this paper, we propose a Causal Disentanglement Hidden Markov model (CDHM) to learn the causality in the bearing fault mechanism a… ▽ More

    Submitted 6 August, 2023; originally announced August 2023.

  11. arXiv:2206.12559  [pdf, other

    cs.SD cs.AI cs.CL eess.AS

    Self-supervised Context-aware Style Representation for Expressive Speech Synthesis

    Authors: Yihan Wu, Xi Wang, Shaofei Zhang, Lei He, Ruihua Song, Jian-Yun Nie

    Abstract: Expressive speech synthesis, like audiobook synthesis, is still challenging for style representation learning and prediction. Deriving from reference audio or predicting style tags from text requires a huge amount of labeled data, which is costly to acquire and difficult to define and annotate accurately. In this paper, we propose a novel framework for learning style representation from abundant p… ▽ More

    Submitted 25 June, 2022; originally announced June 2022.

    Comments: Accepted by Interspeech 2022

  12. arXiv:2012.01745  [pdf, other

    eess.IV cs.CV

    Unsupervised Alternating Optimization for Blind Hyperspectral Imagery Super-resolution

    Authors: Jiangtao Nie, Lei Zhang, Wei Wei, Zhiqiang Lang, Yanning Zhang

    Abstract: Despite the great success of deep model on Hyperspectral imagery (HSI) super-resolution(SR) for simulated data, most of them function unsatisfactory when applied to the real data, especially for unsupervised HSI SR methods. One of the main reason comes from the fact that the predefined degeneration models (e.g. blur in spatial domain) utilized by most HSI SR methods often exist great discrepancy w… ▽ More

    Submitted 3 December, 2020; originally announced December 2020.

    Comments: 14 page, 13 figure

  13. arXiv:2009.06943  [pdf, other

    eess.IV cs.CV

    AIM 2020 Challenge on Efficient Super-Resolution: Methods and Results

    Authors: Kai Zhang, Martin Danelljan, Yawei Li, Radu Timofte, Jie Liu, Jie Tang, Gangshan Wu, Yu Zhu, Xiangyu He, Wenjie Xu, Chenghua Li, Cong Leng, Jian Cheng, Guangyang Wu, Wenyi Wang, Xiaohong Liu, Hengyuan Zhao, Xiangtao Kong, Jingwen He, Yu Qiao, Chao Dong, Xiaotong Luo, Liang Chen, Jiangtao Zhang, Maitreya Suin , et al. (60 additional authors not shown)

    Abstract: This paper reviews the AIM 2020 challenge on efficient single image super-resolution with focus on the proposed solutions and results. The challenge task was to super-resolve an input image with a magnification factor x4 based on a set of prior examples of low and corresponding high resolution images. The goal is to devise a network that reduces one or several aspects such as runtime, parameter co… ▽ More

    Submitted 15 September, 2020; originally announced September 2020.

  14. Federated Learning in the Sky: Aerial-Ground Air Quality Sensing Framework with UAV Swarms

    Authors: Yi Liu, Jiangtian Nie, Xuandi Li, Syed Hassan Ahmed, Wei Yang Bryan Lim, Chunyan Miao

    Abstract: Due to air quality significantly affects human health, it is becoming increasingly important to accurately and timely predict the Air Quality Index (AQI). To this end, this paper proposes a new federated learning-based aerial-ground air quality sensing framework for fine-grained 3D air quality monitoring and forecasting. Specifically, in the air, this framework leverages a light-weight Dense-Mobil… ▽ More

    Submitted 23 July, 2020; originally announced July 2020.

    Comments: IEEE Internet of Things

  15. arXiv:1911.01249  [pdf, other

    eess.IV cs.CV

    AIM 2019 Challenge on Constrained Super-Resolution: Methods and Results

    Authors: Kai Zhang, Shuhang Gu, Radu Timofte, Zheng Hui, Xiumei Wang, Xinbo Gao, Dongliang Xiong, Shuai Liu, Ruipeng Gang, Nan Nan, Chenghua Li, Xueyi Zou, Ning Kang, Zhan Wang, Hang Xu, Chaofeng Wang, Zheng Li, Linlin Wang, Jun Shi, Wenyu Sun, Zhiqiang Lang, Jiangtao Nie, Wei Wei, Lei Zhang, Yazhe Niu , et al. (4 additional authors not shown)

    Abstract: This paper reviews the AIM 2019 challenge on constrained example-based single image super-resolution with focus on proposed solutions and results. The challenge had 3 tracks. Taking the three main aspects (i.e., number of parameters, inference/running time, fidelity (PSNR)) of MSRResNet as the baseline, Track 1 aims to reduce the amount of parameters while being constrained to maintain or improve… ▽ More

    Submitted 4 November, 2019; originally announced November 2019.