Skip to main content

Showing 1–50 of 55 results for author: Feng, D

Searching in archive eess. Search in all archives.
.
  1. arXiv:2507.06717  [pdf, ps, other

    eess.IV cs.MM

    QoE Optimization for Semantic Self-Correcting Video Transmission in Multi-UAV Networks

    Authors: Xuyang Chen, Chong Huang, Daquan Feng, Lei Luo, Yao Sun, Xiang-Gen Xia

    Abstract: Real-time unmanned aerial vehicle (UAV) video streaming is essential for time-sensitive applications, including remote surveillance, emergency response, and environmental monitoring. However, it faces challenges such as limited bandwidth, latency fluctuations, and high packet loss. To address these issues, we propose a novel semantic self-correcting video transmission framework with ultra-fine bit… ▽ More

    Submitted 9 July, 2025; originally announced July 2025.

    Comments: 13 pages

  2. arXiv:2504.10357  [pdf, ps, other

    eess.SP

    The Communication and Computation Trade-off in Wireless Semantic Communications

    Authors: Xuyang Chen, Chong Huang, Gaojie Chen, Daquan Feng, Pei Xiao

    Abstract: Semantic communications have emerged as a crucial research direction for future wireless communication networks. However, as wireless systems become increasingly complex, the demands for computation and communication resources in semantic communications continue to grow rapidly. This paper investigates the trade-off between computation and communication in wireless semantic communications, taking… ▽ More

    Submitted 13 May, 2025; v1 submitted 14 April, 2025; originally announced April 2025.

    Comments: Accepted for publication in IEEE Wireless Communications Letters

  3. arXiv:2504.04977  [pdf, other

    eess.SP

    Low-Rate Semantic Communication with Codebook-based Conditional Generative Models

    Authors: Kailang Ye, Mingze Gong, Shuoyao Wang, Daquan Feng

    Abstract: Generative semantic communication models are reshaping semantic communication frameworks by moving beyond pixel-wise optimization to align with human perception. However, many existing approaches prioritize image-level perceptual quality, often neglecting alignment with downstream tasks, which can lead to suboptimal semantic representation. This paper introduces an Ultra-Low Bitrate Semantic Commu… ▽ More

    Submitted 7 April, 2025; originally announced April 2025.

  4. arXiv:2502.00700  [pdf, other

    cs.CV eess.IV

    S2CFormer: Revisiting the RD-Latency Trade-off in Transformer-based Learned Image Compression

    Authors: Yunuo Chen, Qian Li, Bing He, Donghui Feng, Ronghua Wu, Qi Wang, Li Song, Guo Lu, Wenjun Zhang

    Abstract: Transformer-based Learned Image Compression (LIC) suffers from a suboptimal trade-off between decoding latency and rate-distortion (R-D) performance. Moreover, the critical role of the FeedForward Network (FFN)-based channel aggregation module has been largely overlooked. Our research reveals that efficient channel aggregation-rather than complex and time-consuming spatial operations-is the key to… ▽ More

    Submitted 24 March, 2025; v1 submitted 2 February, 2025; originally announced February 2025.

  5. arXiv:2412.17270  [pdf, other

    eess.IV

    AsymLLIC: Asymmetric Lightweight Learned Image Compression

    Authors: Shen Wang, Zhengxue Cheng, Donghui Feng, Guo Lu, Li Song, Wenjun Zhang

    Abstract: Learned image compression (LIC) methods often employ symmetrical encoder and decoder architectures, evitably increasing decoding time. However, practical scenarios demand an asymmetric design, where the decoder requires low complexity to cater to diverse low-end devices, while the encoder can accommodate higher complexity to improve coding performance. In this paper, we propose an asymmetric light… ▽ More

    Submitted 22 December, 2024; originally announced December 2024.

  6. arXiv:2412.12853  [pdf, other

    eess.IV cs.CV

    Automatic Left Ventricular Cavity Segmentation via Deep Spatial Sequential Network in 4D Computed Tomography Studies

    Authors: Yuyu Guo, Lei Bi, Zhengbin Zhu, David Dagan Feng, Ruiyan Zhang, Qian Wang, Jinman Kim

    Abstract: Automated segmentation of left ventricular cavity (LVC) in temporal cardiac image sequences (multiple time points) is a fundamental requirement for quantitative analysis of its structural and functional changes. Deep learning based methods for the segmentation of LVC are the state of the art; however, these methods are generally formulated to work on single time points, and fails to exploit the co… ▽ More

    Submitted 17 December, 2024; originally announced December 2024.

    Comments: 9 pages

  7. arXiv:2412.11907  [pdf, other

    cs.SD eess.AS

    AudioCIL: A Python Toolbox for Audio Class-Incremental Learning with Multiple Scenes

    Authors: Qisheng Xu, Yulin Sun, Yi Su, Qian Zhu, Xiaoyi Tan, Hongyu Wen, Zijian Gao, Kele Xu, Yong Dou, Dawei Feng

    Abstract: Deep learning, with its robust aotomatic feature extraction capabilities, has demonstrated significant success in audio signal processing. Typically, these methods rely on static, pre-collected large-scale datasets for training, performing well on a fixed number of classes. However, the real world is characterized by constant change, with new audio classes emerging from streaming or temporary avai… ▽ More

    Submitted 18 December, 2024; v1 submitted 16 December, 2024; originally announced December 2024.

  8. arXiv:2409.18701  [pdf

    eess.IV cs.CV

    3DPX: Single Panoramic X-ray Analysis Guided by 3D Oral Structure Reconstruction

    Authors: Xiaoshuang Li, Zimo Huang, Mingyuan Meng, Eduardo Delamare, Dagan Feng, Lei Bi, Bin Sheng, Lingyong Jiang, Bo Li, Jinman Kim

    Abstract: Panoramic X-ray (PX) is a prevalent modality in dentistry practice owing to its wide availability and low cost. However, as a 2D projection of a 3D structure, PX suffers from anatomical information loss and PX diagnosis is limited compared to that with 3D imaging modalities. 2D-to-3D reconstruction methods have been explored for the ability to synthesize the absent 3D anatomical information from 2… ▽ More

    Submitted 27 September, 2024; originally announced September 2024.

  9. arXiv:2408.14089  [pdf, other

    cs.IT eess.SP

    Mini-Slot-Assisted Short Packet URLLC:Differential or Coherent Detection?

    Authors: Canjian Zheng, Fu-Chun Zheng, Jingjing Luo, Pengcheng Zhu, Xiaohu You, Daquan Feng

    Abstract: One of the primary challenges in short packet ultra-reliable and low-latency communications (URLLC) is to achieve reliable channel estimation and data detection while minimizing the impact on latency performance. Given the small packet size in mini-slot-assisted URLLC, relying solely on pilot-based coherent detection is almost impossible to meet the seemingly contradictory requirements of high cha… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

    Comments: 14 pages, 8 figures, journal

  10. arXiv:2408.06645  [pdf

    eess.SY

    Dynamic Pricing of Electric Vehicle Charging Station Alliances Under Information Asymmetry

    Authors: Zeyu Liu, Yun Zhou, Donghan Feng, Shaolun Xu, Yin Yi, Hengjie Li, Haojing Wang

    Abstract: Due to the centralization of charging stations (CSs), CSs are organized as charging station alliances (CSAs) in the commercial competition. Under this situation, this paper studies the profit-oriented dynamic pricing strategy of CSAs. As the practicability basis, a privacy-protected bidirectional real-time information interaction framework is designed, under which the status of EVs is utilized as… ▽ More

    Submitted 13 August, 2024; originally announced August 2024.

  11. arXiv:2408.01292  [pdf

    eess.IV cs.AI cs.CV

    3DPX: Progressive 2D-to-3D Oral Image Reconstruction with Hybrid MLP-CNN Networks

    Authors: Xiaoshuang Li, Mingyuan Meng, Zimo Huang, Lei Bi, Eduardo Delamare, Dagan Feng, Bin Sheng, Jinman Kim

    Abstract: Panoramic X-ray (PX) is a prevalent modality in dental practice for its wide availability and low cost. However, as a 2D projection image, PX does not contain 3D anatomical information, and therefore has limited use in dental applications that can benefit from 3D information, e.g., tooth angular misa-lignment detection and classification. Reconstructing 3D structures directly from 2D PX has recent… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

    Comments: accepted by MICCAI 2024

  12. arXiv:2407.11018  [pdf, other

    cs.NI eess.SP

    Online Multi-Task Offloading for Semantic-Aware Edge Computing Systems

    Authors: Xuyang Chen, Daquan Feng, Wei Jiang, Qu Luo, Gaojie Chen, Yao Sun

    Abstract: Mobile edge computing (MEC) provides low-latency offloading solutions for computationally intensive tasks, effectively improving the computing efficiency and battery life of mobile devices. However, for data-intensive tasks or scenarios with limited uplink bandwidth, network congestion might occur due to massive simultaneous offloading nodes, increasing transmission latency and affecting task perf… ▽ More

    Submitted 21 April, 2025; v1 submitted 28 June, 2024; originally announced July 2024.

    Comments: 13 pages

  13. arXiv:2406.09356  [pdf, other

    cs.CV eess.IV

    CMC-Bench: Towards a New Paradigm of Visual Signal Compression

    Authors: Chunyi Li, Xiele Wu, Haoning Wu, Donghui Feng, Zicheng Zhang, Guo Lu, Xiongkuo Min, Xiaohong Liu, Guangtao Zhai, Weisi Lin

    Abstract: Ultra-low bitrate image compression is a challenging and demanding topic. With the development of Large Multimodal Models (LMMs), a Cross Modality Compression (CMC) paradigm of Image-Text-Image has emerged. Compared with traditional codecs, this semantic-level compression can reduce image data size to 0.1\% or even lower, which has strong potential applications. However, CMC has certain defects in… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  14. arXiv:2406.00123  [pdf

    eess.IV cs.CV

    Correlation-aware Coarse-to-fine MLPs for Deformable Medical Image Registration

    Authors: Mingyuan Meng, Dagan Feng, Lei Bi, Jinman Kim

    Abstract: Deformable image registration is a fundamental step for medical image analysis. Recently, transformers have been used for registration and outperformed Convolutional Neural Networks (CNNs). Transformers can capture long-range dependence among image features, which have been shown beneficial for registration. However, due to the high computation/memory loads of self-attention, transformers are typi… ▽ More

    Submitted 12 June, 2024; v1 submitted 31 May, 2024; originally announced June 2024.

    Comments: Accepted at CVPR2024 as Oral Presentation && Best Paper Candidate

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024, pp. 9645-9654

  15. arXiv:2404.18105  [pdf, other

    cs.RO eess.SP

    Tightly-Coupled VLP/INS Integrated Navigation by Inclination Estimation and Blockage Handling

    Authors: Xiao Sun, Yuan Zhuang, Xiansheng Yang, Jianzhu Huai, Tianming Huang, Daquan Feng

    Abstract: Visible Light Positioning (VLP) has emerged as a promising technology capable of delivering indoor localization with high accuracy. In VLP systems that use Photodiodes (PDs) as light receivers, the Received Signal Strength (RSS) is affected by the incidence angle of light, making the inclination of PDs a critical parameter in the positioning model. Currently, most studies assume the inclination to… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

  16. arXiv:2402.16749  [pdf, other

    cs.CV cs.AI eess.IV

    MISC: Ultra-low Bitrate Image Semantic Compression Driven by Large Multimodal Model

    Authors: Chunyi Li, Guo Lu, Donghui Feng, Haoning Wu, Zicheng Zhang, Xiaohong Liu, Guangtao Zhai, Weisi Lin, Wenjun Zhang

    Abstract: With the evolution of storage and communication protocols, ultra-low bitrate image compression has become a highly demanding topic. However, existing compression algorithms must sacrifice either consistency with the ground truth or perceptual quality at ultra-low bitrate. In recent years, the rapid development of the Large Multimodal Model (LMM) has made it possible to balance these two goals. To… ▽ More

    Submitted 17 April, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: 13 page, 11 figures, 4 tables

  17. arXiv:2311.16707  [pdf

    eess.IV cs.CV

    Full-resolution MLPs Empower Medical Dense Prediction

    Authors: Mingyuan Meng, Yuxin Xue, Dagan Feng, Lei Bi, Jinman Kim

    Abstract: Dense prediction is a fundamental requirement for many medical vision tasks such as medical image restoration, registration, and segmentation. The most popular vision model, Convolutional Neural Networks (CNNs), has reached bottlenecks due to the intrinsic locality of convolution operations. Recently, transformers have been widely adopted for dense prediction for their capability to capture long-r… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: Under Review

  18. arXiv:2310.15550  [pdf

    eess.IV cs.CV cs.LG

    PET Synthesis via Self-supervised Adaptive Residual Estimation Generative Adversarial Network

    Authors: Yuxin Xue, Lei Bi, Yige Peng, Michael Fulham, David Dagan Feng, Jinman Kim

    Abstract: Positron emission tomography (PET) is a widely used, highly sensitive molecular imaging in clinical diagnosis. There is interest in reducing the radiation exposure from PET but also maintaining adequate image quality. Recent methods using convolutional neural networks (CNNs) to generate synthesized high-quality PET images from low-dose counterparts have been reported to be state-of-the-art for low… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: This work has been submitted to the IEEE for possible publication

  19. arXiv:2309.05271  [pdf

    eess.IV cs.AI cs.CV

    AutoFuse: Automatic Fusion Networks for Deformable Medical Image Registration

    Authors: Mingyuan Meng, Michael Fulham, Dagan Feng, Lei Bi, Jinman Kim

    Abstract: Deformable image registration aims to find a dense non-linear spatial correspondence between a pair of images, which is a crucial step for many medical tasks such as tumor growth monitoring and population analysis. Recently, Deep Neural Networks (DNNs) have been widely recognized for their ability to perform fast end-to-end registration. However, DNN-based registration needs to explore the spatial… ▽ More

    Submitted 8 January, 2025; v1 submitted 11 September, 2023; originally announced September 2023.

    Comments: Published at Pattern Recognition

    Journal ref: Pattern Recognition, vol. 161, p. 111338, 2025

  20. arXiv:2307.03427  [pdf

    eess.IV cs.CV cs.LG

    Merging-Diverging Hybrid Transformer Networks for Survival Prediction in Head and Neck Cancer

    Authors: Mingyuan Meng, Lei Bi, Michael Fulham, Dagan Feng, Jinman Kim

    Abstract: Survival prediction is crucial for cancer patients as it provides early prognostic information for treatment planning. Recently, deep survival models based on deep learning and medical images have shown promising performance for survival prediction. However, existing deep survival models are not well developed in utilizing multi-modality images (e.g., PET-CT) and in extracting region-specific info… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

    Comments: Early Accepted at International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI 2023)

    Journal ref: International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI), pp. 400-410, 2023

  21. arXiv:2307.03421  [pdf

    cs.CV cs.AI eess.IV

    Non-iterative Coarse-to-fine Transformer Networks for Joint Affine and Deformable Image Registration

    Authors: Mingyuan Meng, Lei Bi, Michael Fulham, Dagan Feng, Jinman Kim

    Abstract: Image registration is a fundamental requirement for medical image analysis. Deep registration methods based on deep learning have been widely recognized for their capabilities to perform fast end-to-end registration. Many deep registration methods achieved state-of-the-art performance by performing coarse-to-fine registration, where multiple registration steps were iterated with cascaded networks.… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

    Comments: Accepted at International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI 2023)

    Journal ref: International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI), pp.750-760, 2023

  22. arXiv:2305.09946  [pdf

    eess.IV cs.CV cs.LG

    AdaMSS: Adaptive Multi-Modality Segmentation-to-Survival Learning for Survival Outcome Prediction from PET/CT Images

    Authors: Mingyuan Meng, Bingxin Gu, Michael Fulham, Shaoli Song, Dagan Feng, Lei Bi, Jinman Kim

    Abstract: Survival prediction is a major concern for cancer management. Deep survival models based on deep learning have been widely adopted to perform end-to-end survival prediction from medical images. Recent deep survival models achieved promising performance by jointly performing tumor segmentation with survival prediction, where the models were guided to extract tumor-related information through Multi-… ▽ More

    Submitted 15 October, 2024; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: The extended version of this paper has been published at npj Precision Oncology as "Adaptive segmentation-to-survival learning for survival prediction from multi-modality medical images"

    Journal ref: npj Precision Oncology, vol. 8, p. 232, 2024

  23. arXiv:2305.07584  [pdf, other

    cs.IT eess.SP

    Proactive Content Caching Scheme in Urban Vehicular Networks

    Authors: Biqian Feng, Chenyuan Feng, Daquan Feng, Yongpeng Wu, Xiang-Gen Xia

    Abstract: Stream media content caching is a key enabling technology to promote the value chain of future urban vehicular networks. Nevertheless, the high mobility of vehicles, intermittency of information transmissions, high dynamics of user requests, limited caching capacities and extreme complexity of business scenarios pose an enormous challenge to content caching and distribution in vehicular networks.… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

    Comments: Accepted by IEEE Transactions on Communications

  24. arXiv:2304.00725  [pdf

    eess.IV cs.CV

    CG-3DSRGAN: A classification guided 3D generative adversarial network for image quality recovery from low-dose PET images

    Authors: Yuxin Xue, Yige Peng, Lei Bi, Dagan Feng, Jinman Kim

    Abstract: Positron emission tomography (PET) is the most sensitive molecular imaging modality routinely applied in our modern healthcare. High radioactivity caused by the injected tracer dose is a major concern in PET imaging and limits its clinical applications. However, reducing the dose leads to inadequate image quality for diagnostic practice. Motivated by the need to produce high quality images with mi… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

  25. xURLLC-Aware Service Provisioning in Vehicular Networks: A Semantic Communication Perspective

    Authors: Le Xia, Yao Sun, Dusit Niyato, Daquan Feng, Lei Feng, Muhammad Ali Imran

    Abstract: Semantic communication (SemCom), as an emerging paradigm focusing on meaning delivery, has recently been considered a promising solution for the inevitable crisis of scarce communication resources. This trend stimulates us to explore the potential of applying SemCom to wireless vehicular networks, which normally consume a tremendous amount of resources to meet stringent reliability and latency req… ▽ More

    Submitted 23 September, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

    Comments: This paper has been accepted for publication by IEEE Transactions on Wireless Communications

  26. arXiv:2301.01732  [pdf, ps, other

    eess.IV cs.CV physics.med-ph

    Explicit Abnormality Extraction for Unsupervised Motion Artifact Reduction in Magnetic Resonance Imaging

    Authors: Yusheng Zhou, Hao Li, Jianan Liu, Zhengmin Kong, Tao Huang, Euijoon Ahn, Zhihan Lv, Jinman Kim, David Dagan Feng

    Abstract: Motion artifacts compromise the quality of magnetic resonance imaging (MRI) and pose challenges to achieving diagnostic outcomes and image-guided therapies. In recent years, supervised deep learning approaches have emerged as successful solutions for motion artifact reduction (MAR). One disadvantage of these methods is their dependency on acquiring paired sets of motion artifact-corrupted (MA-corr… ▽ More

    Submitted 14 August, 2024; v1 submitted 4 January, 2023; originally announced January 2023.

    Comments: Accepted by IEEE Journal of Biomedical and Health Informatics

  27. arXiv:2212.05808  [pdf, other

    eess.IV cs.CV

    Z-SSMNet: Zonal-aware Self-supervised Mesh Network for Prostate Cancer Detection and Diagnosis with Bi-parametric MRI

    Authors: Yuan Yuan, Euijoon Ahn, Dagan Feng, Mohamad Khadra, Jinman Kim

    Abstract: Bi-parametric magnetic resonance imaging (bpMRI) has become a pivotal modality in the detection and diagnosis of clinically significant prostate cancer (csPCa). Developing AI-based systems to identify csPCa using bpMRI can transform PCa management by improving efficiency and cost-effectiveness. However, current state-of-the-art methods using convolutional neural networks (CNNs) are limited in lear… ▽ More

    Submitted 22 September, 2024; v1 submitted 12 December, 2022; originally announced December 2022.

    Comments: 13 pages, 7 figures

  28. arXiv:2211.05409  [pdf

    eess.IV cs.CV cs.LG

    Radiomics-enhanced Deep Multi-task Learning for Outcome Prediction in Head and Neck Cancer

    Authors: Mingyuan Meng, Lei Bi, Dagan Feng, Jinman Kim

    Abstract: Outcome prediction is crucial for head and neck cancer patients as it can provide prognostic information for early treatment planning. Radiomics methods have been widely used for outcome prediction from medical images. However, these methods are limited by their reliance on intractable manual segmentation of tumor regions. Recently, deep learning methods have been proposed to perform end-to-end ou… ▽ More

    Submitted 10 November, 2022; originally announced November 2022.

    Comments: HEad and neCK TumOR segmentation and outcome prediction challenge (HECKTOR 2022)

    Journal ref: Head and Neck Tumor Segmentation and Outcome Prediction (HECKTOR 2022), pp.135-143

  29. arXiv:2211.01241  [pdf, other

    eess.IV cs.AI eess.SP

    WiserVR: Semantic Communication Enabled Wireless Virtual Reality Delivery

    Authors: Le Xia, Yao Sun, Chengsi Liang, Daquan Feng, Runze Cheng, Yang Yang, Muhammad Ali Imran

    Abstract: Virtual reality (VR) over wireless is expected to be one of the killer applications in next-generation communication networks. Nevertheless, the huge data volume along with stringent requirements on latency and reliability under limited bandwidth resources makes untethered wireless VR delivery increasingly challenging. Such bottlenecks, therefore, motivate this work to seek the potential of using… ▽ More

    Submitted 13 March, 2023; v1 submitted 2 November, 2022; originally announced November 2022.

    Comments: This magazine article has been accepted for publication by IEEE Wireless Communications

  30. arXiv:2210.15808  [pdf

    eess.IV cs.CV

    Hyper-Connected Transformer Network for Multi-Modality PET-CT Segmentation

    Authors: Lei Bi, Michael Fulham, Shaoli Song, David Dagan Feng, Jinman Kim

    Abstract: [18F]-Fluorodeoxyglucose (FDG) positron emission tomography - computed tomography (PET-CT) has become the imaging modality of choice for diagnosing many cancers. Co-learning complementary PET-CT imaging features is a fundamental requirement for automatic tumor segmentation and for developing computer aided cancer diagnosis systems. In this study, we propose a hyper-connected transformer (HCT) netw… ▽ More

    Submitted 7 August, 2023; v1 submitted 27 October, 2022; originally announced October 2022.

    Comments: EMBC 2023

  31. arXiv:2209.07705  [pdf, other

    eess.IV cs.CV cs.LG

    Automatic Tumor Segmentation via False Positive Reduction Network for Whole-Body Multi-Modal PET/CT Images

    Authors: Yige Peng, Jinman Kim, Dagan Feng, Lei Bi

    Abstract: Multi-modality Fluorodeoxyglucose (FDG) positron emission tomography / computed tomography (PET/CT) has been routinely used in the assessment of common cancers, such as lung cancer, lymphoma, and melanoma. This is mainly attributed to the fact that PET/CT combines the high sensitivity for tumor detection of PET and anatomical information from CT. In PET/CT image assessment, automatic tumor segment… ▽ More

    Submitted 16 September, 2022; originally announced September 2022.

    Comments: Pre-print paper for 2022 MICCAI AutoPET Challenge

  32. arXiv:2205.06891  [pdf, ps, other

    eess.IV cs.CV physics.med-ph

    Unsupervised Representation Learning for 3D MRI Super Resolution with Degradation Adaptation

    Authors: Jianan Liu, Hao Li, Tao Huang, Euijoon Ahn, Kang Han, Adeel Razi, Wei Xiang, Jinman Kim, David Dagan Feng

    Abstract: High-resolution (HR) magnetic resonance imaging is critical in aiding doctors in their diagnoses and image-guided treatments. However, acquiring HR images can be time-consuming and costly. Consequently, deep learning-based super-resolution reconstruction (SRR) has emerged as a promising solution for generating super-resolution (SR) images from low-resolution (LR) images. Unfortunately, training su… ▽ More

    Submitted 24 April, 2024; v1 submitted 13 May, 2022; originally announced May 2022.

    Comments: Accepted by IEEE Transactions on Artificial Intelligence

  33. arXiv:2203.02384  [pdf, other

    eess.IV cs.CV cs.LG

    AutoMO-Mixer: An automated multi-objective Mixer model for balanced, safe and robust prediction in medicine

    Authors: Xi Chen, Jiahuan Lv, Dehua Feng, Xuanqin Mou, Ling Bai, Shu Zhang, Zhiguo Zhou

    Abstract: Accurately identifying patient's status through medical images plays an important role in diagnosis and treatment. Artificial intelligence (AI), especially the deep learning, has achieved great success in many fields. However, more reliable AI model is needed in image guided diagnosis and therapy. To achieve this goal, developing a balanced, safe and robust model with a unified framework is desira… ▽ More

    Submitted 4 March, 2022; originally announced March 2022.

  34. arXiv:2112.12424  [pdf, other

    eess.IV

    Complexity-Oriented Per-shot Video Coding Optimization

    Authors: Hongcheng Zhong, Jun Xu, Chen Zhu, Donghui Feng, Li Song

    Abstract: Current per-shot encoding schemes aim to improve the compression efficiency by shot-level optimization. It splits a source video sequence into shots and imposes optimal sets of encoding parameters to each shot. Per-shot encoding achieved approximately 20% bitrate savings over baseline fixed QP encoding at the expense of pre-processing complexity. However, the adjustable parameter space of the curr… ▽ More

    Submitted 23 December, 2021; originally announced December 2021.

  35. arXiv:2112.06979  [pdf, other

    eess.IV cs.CV

    The Brain Tumor Sequence Registration (BraTS-Reg) Challenge: Establishing Correspondence Between Pre-Operative and Follow-up MRI Scans of Diffuse Glioma Patients

    Authors: Bhakti Baheti, Satrajit Chakrabarty, Hamed Akbari, Michel Bilello, Benedikt Wiestler, Julian Schwarting, Evan Calabrese, Jeffrey Rudie, Syed Abidi, Mina Mousa, Javier Villanueva-Meyer, Brandon K. K. Fields, Florian Kofler, Russell Takeshi Shinohara, Juan Eugenio Iglesias, Tony C. W. Mok, Albert C. S. Chung, Marek Wodzinski, Artur Jurgas, Niccolo Marini, Manfredo Atzori, Henning Muller, Christoph Grobroehmer, Hanna Siebert, Lasse Hansen , et al. (48 additional authors not shown)

    Abstract: Registration of longitudinal brain MRI scans containing pathologies is challenging due to dramatic changes in tissue appearance. Although there has been progress in developing general-purpose medical image registration techniques, they have not yet attained the requisite precision and reliability for this task, highlighting its inherent complexity. Here we describe the Brain Tumor Sequence Registr… ▽ More

    Submitted 17 April, 2024; v1 submitted 13 December, 2021; originally announced December 2021.

  36. arXiv:2111.10635  [pdf, other

    cs.DC cs.AI cs.LG eess.SY

    HeterPS: Distributed Deep Learning With Reinforcement Learning Based Scheduling in Heterogeneous Environments

    Authors: Ji Liu, Zhihua Wu, Dianhai Yu, Yanjun Ma, Danlei Feng, Minxu Zhang, Xinxuan Wu, Xuefeng Yao, Dejing Dou

    Abstract: Deep neural networks (DNNs) exploit many layers and a large number of parameters to achieve excellent performance. The training process of DNN models generally handles large-scale input data with many sparse features, which incurs high Input/Output (IO) cost, while some layers are compute-intensive. The training process generally exploits distributed computing resources to reduce training time. In… ▽ More

    Submitted 7 June, 2023; v1 submitted 20 November, 2021; originally announced November 2021.

    Comments: 14 pages, 11 figures, 2 tables; To appear in Future Generation Computer Systems (FGCS)

  37. arXiv:2109.14805  [pdf, other

    eess.IV cs.CV

    Unsupervised Landmark Detection Based Spatiotemporal Motion Estimation for 4D Dynamic Medical Images

    Authors: Yuyu Guo, Lei Bi, Dongming Wei, Liyun Chen, Zhengbin Zhu, Dagan Feng, Ruiyan Zhang, Qian Wang, Jinman Kim

    Abstract: Motion estimation is a fundamental step in dynamic medical image processing for the assessment of target organ anatomy and function. However, existing image-based motion estimation methods, which optimize the motion field by evaluating the local image similarity, are prone to produce implausible estimation, especially in the presence of large motion. In this study, we provide a novel motion estima… ▽ More

    Submitted 7 November, 2021; v1 submitted 29 September, 2021; originally announced September 2021.

    Comments: accepted by IEEE Transactions on Cybernetics

  38. arXiv:2109.07711  [pdf

    eess.IV cs.CV cs.LG

    DeepMTS: Deep Multi-task Learning for Survival Prediction in Patients with Advanced Nasopharyngeal Carcinoma using Pretreatment PET/CT

    Authors: Mingyuan Meng, Bingxin Gu, Lei Bi, Shaoli Song, David Dagan Feng, Jinman Kim

    Abstract: Nasopharyngeal Carcinoma (NPC) is a malignant epithelial cancer arising from the nasopharynx. Survival prediction is a major concern for NPC patients, as it provides early prognostic information to plan treatments. Recently, deep survival models based on deep learning have demonstrated the potential to outperform traditional radiomics-based survival prediction models. Deep survival models usually… ▽ More

    Submitted 7 June, 2022; v1 submitted 16 September, 2021; originally announced September 2021.

    Comments: Accepted at IEEE Journal of Biomedical and Health Informatics (JBHI)

    Journal ref: IEEE Journal of Biomedical and Health Informatics, vol. 26, no. 9, pp. 4497-4507, 2022

  39. arXiv:2107.06170  [pdf

    eess.SP cs.IT math.OC

    Robust Blind Source Separation by Soft Decision-Directed Non-Unitary Joint Diagonalization

    Authors: Wenjuan Liu, Dazheng Feng, Bingnan Pei, Mengdao Xing, Xinhong Meng, Qianru Wei

    Abstract: Approximate joint diagonalization of a set of matrices provides a powerful framework for numerous statistical signal processing applications. For non-unitary joint diagonalization (NUJD) based on the least-squares (LS) criterion, outliers, also referred to as anomaly or discordant observations, have a negative influence on the performance, since squaring the residuals magnifies the effects of them… ▽ More

    Submitted 28 June, 2021; originally announced July 2021.

    Comments: 19 pages, 9 figures

  40. arXiv:2104.11416  [pdf

    eess.IV cs.CV cs.LG

    Predicting Distant Metastases in Soft-Tissue Sarcomas from PET-CT scans using Constrained Hierarchical Multi-Modality Feature Learning

    Authors: Yige Peng, Lei Bi, Ashnil Kumar, Michael Fulham, Dagan Feng, Jinman Kim

    Abstract: Distant metastases (DM) refer to the dissemination of tumors, usually, beyond the organ where the tumor originated. They are the leading cause of death in patients with soft-tissue sarcomas (STSs). Positron emission tomography-computed tomography (PET-CT) is regarded as the imaging modality of choice for the management of STSs. It is difficult to determine from imaging studies which STS patients w… ▽ More

    Submitted 23 April, 2021; originally announced April 2021.

    Comments: Under Review

  41. arXiv:2103.05220  [pdf

    eess.IV cs.CV cs.LG stat.AP

    Prediction of 5-year Progression-Free Survival in Advanced Nasopharyngeal Carcinoma with Pretreatment PET/CT using Multi-Modality Deep Learning-based Radiomics

    Authors: Bingxin Gu, Mingyuan Meng, Lei Bi, Jinman Kim, David Dagan Feng, Shaoli Song

    Abstract: Objective: Deep Learning-based Radiomics (DLR) has achieved great success in medical image analysis and has been considered a replacement for conventional radiomics that relies on handcrafted features. In this study, we aimed to explore the capability of DLR for the prediction of 5-year Progression-Free Survival (PFS) in Nasopharyngeal Carcinoma (NPC) using pretreatment PET/CT. Methods: A total of… ▽ More

    Submitted 4 July, 2022; v1 submitted 8 March, 2021; originally announced March 2021.

    Comments: Accepted at Frontiers in Oncology

    Journal ref: Frontiers in Oncology, vol. 12, pp. 899352, 2022

  42. Enhancing Medical Image Registration via Appearance Adjustment Networks

    Authors: Mingyuan Meng, Lei Bi, Michael Fulham, David Dagan Feng, Jinman Kim

    Abstract: Deformable image registration is fundamental for many medical image analyses. A key obstacle for accurate image registration lies in image appearance variations such as the variations in texture, intensities, and noise. These variations are readily apparent in medical images, especially in brain images where registration is frequently used. Recently, deep learning-based registration methods (DLRs)… ▽ More

    Submitted 3 July, 2022; v1 submitted 8 March, 2021; originally announced March 2021.

    Comments: Published at NeuroImage

    Journal ref: NeuroImage, vol. 259, pp. 119444, 2022

  43. arXiv:2102.02998  [pdf, other

    eess.AS

    Beam-Guided TasNet: An Iterative Speech Separation Framework with Multi-Channel Output

    Authors: Hangting Chen, Yang Yi, Dang Feng, Pengyuan Zhang

    Abstract: Time-domain audio separation network (TasNet) has achieved remarkable performance in blind source separation (BSS). Classic multi-channel speech processing framework employs signal estimation and beamforming. For example, Beam-TasNet links multi-channel convolutional TasNet (MC-Conv-TasNet) with minimum variance distortionless response (MVDR) beamforming, which leverages the strong modeling abilit… ▽ More

    Submitted 12 April, 2022; v1 submitted 4 February, 2021; originally announced February 2021.

    Comments: Submitted to Inerspeech 2022

  44. arXiv:2012.12472  [pdf, ps, other

    cs.IT eess.SY

    Understanding Age of Information in Large-Scale Wireless Networks

    Authors: Howard H. Yang, Chao Xu, Xijun Wang, Daquan Feng, Tony Q. S. Quek

    Abstract: The notion of age-of-information (AoI) is investigated in the context of large-scale wireless networks, in which transmitters need to send a sequence of information packets, which are generated as independent Bernoulli processes, to their intended receivers over a shared spectrum. Due to interference, the rate of packet depletion at any given node is entangled with both the spatial configurations,… ▽ More

    Submitted 22 December, 2020; originally announced December 2020.

  45. arXiv:2007.06002  [pdf, other

    cs.CV cs.LG eess.IV

    Multi-Modality Information Fusion for Radiomics-based Neural Architecture Search

    Authors: Yige Peng, Lei Bi, Michael Fulham, Dagan Feng, Jinman Kim

    Abstract: 'Radiomics' is a method that extracts mineable quantitative features from radiographic images. These features can then be used to determine prognosis, for example, predicting the development of distant metastases (DM). Existing radiomics methods, however, require complex manual effort including the design of hand-crafted radiomic features and their extraction and selection. Recent radiomics method… ▽ More

    Submitted 12 July, 2020; originally announced July 2020.

    Comments: Accepted by MICCAI 2020

  46. arXiv:2002.12680  [pdf, other

    cs.CV eess.IV

    A Spatiotemporal Volumetric Interpolation Network for 4D Dynamic Medical Image

    Authors: Yuyu Guo, Lei Bi, Euijoon Ahn, Dagan Feng, Qian Wang, Jinman Kim

    Abstract: Dynamic medical imaging is usually limited in application due to the large radiation doses and longer image scanning and reconstruction times. Existing methods attempt to reduce the dynamic sequence by interpolating the volumes between the acquired image volumes. However, these methods are limited to either 2D images and/or are unable to support large variations in the motion between the image vol… ▽ More

    Submitted 24 April, 2020; v1 submitted 28 February, 2020; originally announced February 2020.

    Comments: 10 pages, 8 figures, Conference on Computer Vision and Pattern Recognition (CVPR) 2020

  47. arXiv:1911.10468  [pdf

    physics.app-ph eess.SP

    Extending the dynamic strain sensing rang of phase-OTDR with frequency modulation pulse and frequency interrogation

    Authors: Jingdong Zhang, Haoting Wu, Jingsheng Huang, Hua Zheng, Danqi Feng, Guolu Yin, Tao Zhu

    Abstract: We propose and experimentally demonstrate a technique to extend the dynamic sensing range of phase sensitive optical time domain reflectometry system based on the frequency interrogation. Benefitting from the range Doppler coupling feature, the frequency modulation pulse is capable of measuring the frequency shift induced by the dynamic strain, thus the large dynamic strain can be recovered. The p… ▽ More

    Submitted 24 November, 2019; originally announced November 2019.

  48. arXiv:1909.00971  [pdf

    eess.SY math.OC

    Load Forecasting Model and Day-ahead Operation Strategy for City-located EV Quick Charge Stations

    Authors: Zeyu Liu, Yaxin Xie, Donghan Feng, Yun Zhou, Shanshan Shi, Chen Fang

    Abstract: Charging demands of electric vehicles (EVs) are sharply increasing due to the rapid development of EVs. Hence, reliable and convenient quick charge stations are required to respond to the needs of EV drivers. Due to the uncertainty of EV charging loads, load forecasting becomes vital for the operation of quick charge stations to formulate the day-ahead plan. In this paper, based on trip chain theo… ▽ More

    Submitted 3 September, 2019; originally announced September 2019.

    Comments: This article has been accepted in the 2019 International Conference on Renewable Power Generation (RPG 2019), Shanghai, China, October 24-25, 2019

  49. arXiv:1906.08497  [pdf

    eess.SY eess.SP math.OC

    Optimal Decision Making Model of Battery Energy Storage-Assisted Electric Vehicle Charging Station Considering Incentive Demand Response

    Authors: Bishal Upadhaya, Donghan Feng, Yun Zhou, Qiang Gui, Xiaojin Zhao, Dan Wu

    Abstract: Considering large scale implementation of electric vehicles (EVs), public EV charging stations are served as fuel tanks for EVs to meet the need of longer travelling distance and overcome the shortage of private charging piles. The allocation of local battery energy storage (BES) can enhance the flexibility of the EV charging station. This paper proposes an optimal decision making model of the BES… ▽ More

    Submitted 20 June, 2019; originally announced June 2019.

  50. arXiv:1906.08411  [pdf

    eess.SP eess.SY math.OC

    A novel linear battery energy storage system (BESS) life loss calculation model for BESS-integrated wind farm in scheduled power tracking

    Authors: Qiang Gui, Hao Su, Donghan Feng, Yun Zhou, Ran Xu, Ting Lei

    Abstract: Recently, rapid development of battery technology makes it feasible to integrate renewable generations with battery energy storage system (BESS). The consideration of BESS life loss for different BESS application scenarios is economic imperative. In this paper, a novel linear BESS life loss calculation model for BESS-integrated wind farm in scheduled power tracking is proposed. Firstly, based on t… ▽ More

    Submitted 27 October, 2019; v1 submitted 19 June, 2019; originally announced June 2019.

    Comments: This article has been accepted in the 2019 International Conference on Renewable Power Generation (RPG 2019), Shanghai, China, October 24-25, 2019