Skip to main content

Showing 1–18 of 18 results for author: Ji, W

Searching in archive eess. Search in all archives.
.
  1. arXiv:2503.13257  [pdf, other

    eess.IV

    Anatomically and Metabolically Informed Diffusion for Unified Denoising and Segmentation in Low-Count PET Imaging

    Authors: Menghua Xia, Kuan-Yin Ko, Der-Shiun Wang, Ming-Kai Chen, Qiong Liu, Huidong Xie, Liang Guo, Wei Ji, Jinsong Ouyang, Reimund Bayerlein, Benjamin A. Spencer, Quanzheng Li, Ramsey D. Badawi, Georges El Fakhri, Chi Liu

    Abstract: Positron emission tomography (PET) image denoising, along with lesion and organ segmentation, are critical steps in PET-aided diagnosis. However, existing methods typically treat these tasks independently, overlooking inherent synergies between them as correlated steps in the analysis pipeline. In this work, we present the anatomically and metabolically informed diffusion (AMDiff) model, a unified… ▽ More

    Submitted 17 March, 2025; originally announced March 2025.

  2. arXiv:2503.04258  [pdf, other

    cs.SD cs.AI cs.CV eess.AS

    TAIL: Text-Audio Incremental Learning

    Authors: Yingfei Sun, Xu Gu, Wei Ji, Hanbin Zhao, Hao Fei, Yifang Yin, Roger Zimmermann

    Abstract: Many studies combine text and audio to capture multi-modal information but they overlook the model's generalization ability on new datasets. Introducing new datasets may affect the feature space of the original dataset, leading to catastrophic forgetting. Meanwhile, large model parameters can significantly impact training performance. To address these limitations, we introduce a novel task called… ▽ More

    Submitted 6 March, 2025; originally announced March 2025.

    Comments: 4 figures, 5 tables

    ACM Class: I.2

  3. arXiv:2502.11946  [pdf, other

    cs.CL cs.AI cs.HC cs.SD eess.AS

    Step-Audio: Unified Understanding and Generation in Intelligent Speech Interaction

    Authors: Ailin Huang, Boyong Wu, Bruce Wang, Chao Yan, Chen Hu, Chengli Feng, Fei Tian, Feiyu Shen, Jingbei Li, Mingrui Chen, Peng Liu, Ruihang Miao, Wang You, Xi Chen, Xuerui Yang, Yechang Huang, Yuxiang Zhang, Zheng Gong, Zixin Zhang, Hongyu Zhou, Jianjian Sun, Brian Li, Chengting Feng, Changyi Wan, Hanpeng Hu , et al. (120 additional authors not shown)

    Abstract: Real-time speech interaction, serving as a fundamental interface for human-machine collaboration, holds immense potential. However, current open-source models face limitations such as high costs in voice data collection, weakness in dynamic control, and limited intelligence. To address these challenges, this paper introduces Step-Audio, the first production-ready open-source solution. Key contribu… ▽ More

    Submitted 18 February, 2025; v1 submitted 17 February, 2025; originally announced February 2025.

  4. arXiv:2412.16573  [pdf, other

    eess.IV physics.med-ph

    A Generalizable 3D Diffusion Framework for Low-Dose and Few-View Cardiac SPECT

    Authors: Huidong Xie, Weijie Gan, Wei Ji, Xiongchao Chen, Alaa Alashi, Stephanie L. Thorn, Bo Zhou, Qiong Liu, Menghua Xia, Xueqi Guo, Yi-Hwa Liu, Hongyu An, Ulugbek S. Kamilov, Ge Wang, Albert J. Sinusas, Chi Liu

    Abstract: Myocardial perfusion imaging using SPECT is widely utilized to diagnose coronary artery diseases, but image quality can be negatively affected in low-dose and few-view acquisition settings. Although various deep learning methods have been introduced to improve image quality from low-dose or few-view SPECT data, previous approaches often fail to generalize across different acquisition settings, lim… ▽ More

    Submitted 21 December, 2024; originally announced December 2024.

    Comments: 13 pages, 6 figures, 2 tables. Paper under review. Oral presentation at IEEE MIC 2024

  5. arXiv:2411.04844  [pdf, other

    eess.IV cs.CV

    Discretized Gaussian Representation for Tomographic Reconstruction

    Authors: Shaokai Wu, Yuxiang Lu, Wei Ji, Suizhi Huang, Fengyu Yang, Shalayiding Sirejiding, Qichen He, Jing Tong, Yanbiao Ji, Yue Ding, Hongtao Lu

    Abstract: Computed Tomography (CT) is a widely used imaging technique that provides detailed cross-sectional views of objects. Over the past decade, Deep Learning-based Reconstruction (DLR) methods have led efforts to enhance image quality and reduce noise, yet they often require large amounts of data and are computationally intensive. Inspired by recent advancements in scene reconstruction, some approaches… ▽ More

    Submitted 27 March, 2025; v1 submitted 7 November, 2024; originally announced November 2024.

  6. arXiv:2405.12996  [pdf, ps, other

    eess.IV

    Dose-aware Diffusion Model for 3D PET Image Denoising: Multi-institutional Validation with Reader Study and Real Low-dose Data

    Authors: Huidong Xie, Weijie Gan, Reimund Bayerlein, Bo Zhou, Ming-Kai Chen, Michal Kulon, Annemarie Boustani, Kuan-Yin Ko, Der-Shiun Wang, Benjamin A. Spencer, Wei Ji, Xiongchao Chen, Qiong Liu, Xueqi Guo, Menghua Xia, Yinchi Zhou, Hui Liu, Liang Guo, Hongyu An, Ulugbek S. Kamilov, Hanzhong Wang, Biao Li, Axel Rominger, Kuangyu Shi, Ge Wang , et al. (2 additional authors not shown)

    Abstract: Reducing scan times, radiation dose, and enhancing image quality for lower-performance scanners, are critical in low-dose PET imaging. Deep learning techniques have been investigated for PET image denoising. However, existing models have often resulted in compromised image quality when achieving low-count/low-dose PET and have limited generalizability to different image noise-levels, acquisition p… ▽ More

    Submitted 16 June, 2025; v1 submitted 2 May, 2024; originally announced May 2024.

    Comments: 18 Pages, 16 Figures, 5 Tables. Paper under review. First-place Freek J. Beekman Young Investigator Award at SNMMI 2024. Code available after paper publication. arXiv admin note: substantial text overlap with arXiv:2311.04248

  7. arXiv:2310.20151  [pdf, other

    cs.CL cs.RO eess.SY

    Multi-Agent Consensus Seeking via Large Language Models

    Authors: Huaben Chen, Wenkang Ji, Lufeng Xu, Shiyu Zhao

    Abstract: Multi-agent systems driven by large language models (LLMs) have shown promising abilities for solving complex tasks in a collaborative manner. This work considers a fundamental problem in multi-agent collaboration: consensus seeking. When multiple agents work together, we are interested in how they can reach a consensus through inter-agent negotiation. To that end, this work studies a consensus-se… ▽ More

    Submitted 21 January, 2025; v1 submitted 30 October, 2023; originally announced October 2023.

  8. arXiv:2306.08329  [pdf

    cs.SD cs.CL eess.AS

    Research on an improved Conformer end-to-end Speech Recognition Model with R-Drop Structure

    Authors: Weidong Ji, Shijie Zan, Guohui Zhou, Xu Wang

    Abstract: To address the issue of poor generalization ability in end-to-end speech recognition models within deep learning, this study proposes a new Conformer-based speech recognition model called "Conformer-R" that incorporates the R-drop structure. This model combines the Conformer model, which has shown promising results in speech recognition, with the R-drop structure. By doing so, the model is able to… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

    Comments: 15 pages, 9 figures

  9. arXiv:2302.05298  [pdf

    cs.NI eess.SY

    Optical Switching Data Center Networks: Understanding Techniques and Challenges

    Authors: Xuwei Xue, Shaojuan Zhang, Bingli Guo, Wei Ji, Rui Yin, Bin Chen, Shanguo Huang

    Abstract: Relying on the flexible-access interconnects to the scalable storage and compute resources, data centers deliver critical communications connectivity among numerous servers to support the housed applications and services. To provide the high-speeds and long-distance communications, the data centers have turned to fiber interconnections. With the stringently increased traffic volume, the data cente… ▽ More

    Submitted 13 January, 2023; originally announced February 2023.

  10. arXiv:2301.11798  [pdf, other

    eess.IV cs.CV

    MedSegDiff-V2: Diffusion based Medical Image Segmentation with Transformer

    Authors: Junde Wu, Wei Ji, Huazhu Fu, Min Xu, Yueming Jin, Yanwu Xu

    Abstract: The Diffusion Probabilistic Model (DPM) has recently gained popularity in the field of computer vision, thanks to its image generation applications, such as Imagen, Latent Diffusion Models, and Stable Diffusion, which have demonstrated impressive capabilities and sparked much discussion within the community. Recent investigations have further unveiled the utility of DPM in the domain of medical im… ▽ More

    Submitted 23 December, 2023; v1 submitted 18 January, 2023; originally announced January 2023.

    Comments: Code will be released at https://github.com/KidsWithTokens/MedSegDiff

  11. Driving Style Recognition at First Impression for Online Trajectory Prediction

    Authors: Tu Xu, Kan Wu, Yongdong Zhu, Wei Ji

    Abstract: This paper proposes a new driving style recognition approach that allows autonomous vehicles (AVs) to perform trajectory predictions for surrounding vehicles with minimal data. Toward that end, we use a hybrid of offline and online methods in the proposed approach. We first learn typical driving styles with PCA and K-means algorithms in the offline part. After that, local Maximum-Likelihood techni… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

  12. arXiv:2202.13123  [pdf, other

    cs.CV eess.IV

    Content-Variant Reference Image Quality Assessment via Knowledge Distillation

    Authors: Guanghao Yin, Wei Wang, Zehuan Yuan, Chuchu Han, Wei Ji, Shouqian Sun, Changhu Wang

    Abstract: Generally, humans are more skilled at perceiving differences between high-quality (HQ) and low-quality (LQ) images than directly judging the quality of a single LQ image. This situation also applies to image quality assessment (IQA). Although recent no-reference (NR-IQA) methods have made great progress to predict image quality free from the reference image, they still have the potential to achiev… ▽ More

    Submitted 26 February, 2022; originally announced February 2022.

    Comments: AAAI2022 oral accepted

  13. arXiv:2107.06172  [pdf, other

    physics.chem-ph eess.SP

    Arrhenius.jl: A Differentiable Combustion SimulationPackage

    Authors: Weiqi Ji, Xingyu Su, Bin Pang, Sean Joseph Cassady, Alison M. Ferris, Yujuan Li, Zhuyin Ren, Ronald Hanson, Sili Deng

    Abstract: Combustion kinetic modeling is an integral part of combustion simulation, and extensive studies have been devoted to developing both high fidelity and computationally affordable models. Despite these efforts, modeling combustion kinetics is still challenging due to the demand for expert knowledge and optimization against experiments, as well as the lack of understanding of the associated uncertain… ▽ More

    Submitted 19 June, 2021; originally announced July 2021.

  14. arXiv:2106.13064  [pdf

    physics.bio-ph cs.CV eess.IV

    Advancing biological super-resolution microscopy through deep learning: a brief review

    Authors: Tianjie Yang, Yaoru Luo, Wei Ji, Ge Yang

    Abstract: Super-resolution microscopy overcomes the diffraction limit of conventional light microscopy in spatial resolution. By providing novel spatial or spatio-temporal information on biological processes at nanometer resolution with molecular specificity, it plays an increasingly important role in life sciences. However, its technical limitations require trade-offs to balance its spatial resolution, tem… ▽ More

    Submitted 24 June, 2021; originally announced June 2021.

    Journal ref: Biophysics Reports, 7(4): 253-266, 2021

  15. arXiv:2005.03832  [pdf, other

    eess.IV cs.CV cs.LG

    Synergistic Learning of Lung Lobe Segmentation and Hierarchical Multi-Instance Classification for Automated Severity Assessment of COVID-19 in CT Images

    Authors: Kelei He, Wei Zhao, Xingzhi Xie, Wen Ji, Mingxia Liu, Zhenyu Tang, Feng Shi, Yang Gao, Jun Liu, Junfeng Zhang, Dinggang Shen

    Abstract: Understanding chest CT imaging of the coronavirus disease 2019 (COVID-19) will help detect infections early and assess the disease progression. Especially, automated severity assessment of COVID-19 in CT images plays an essential role in identifying cases that are in great need of intensive clinical care. However, it is often challenging to accurately assess the severity of this disease in CT imag… ▽ More

    Submitted 24 May, 2020; v1 submitted 7 May, 2020; originally announced May 2020.

  16. arXiv:2005.00096  [pdf, other

    eess.AS cs.CL cs.SD

    An Early Study on Intelligent Analysis of Speech under COVID-19: Severity, Sleep Quality, Fatigue, and Anxiety

    Authors: Jing Han, Kun Qian, Meishu Song, Zijiang Yang, Zhao Ren, Shuo Liu, Juan Liu, Huaiyuan Zheng, Wei Ji, Tomoya Koike, Xiao Li, Zixing Zhang, Yoshiharu Yamamoto, Björn W. Schuller

    Abstract: The COVID-19 outbreak was announced as a global pandemic by the World Health Organisation in March 2020 and has affected a growing number of people in the past few weeks. In this context, advanced artificial intelligence techniques are brought to the fore in responding to fight against and reduce the impact of this global health crisis. In this study, we focus on developing some potential use-case… ▽ More

    Submitted 14 May, 2020; v1 submitted 30 April, 2020; originally announced May 2020.

  17. arXiv:2004.02640  [pdf, other

    eess.IV cs.CV

    Coronavirus Detection and Analysis on Chest CT with Deep Learning

    Authors: Ophir Gozes, Maayan Frid-Adar, Nimrod Sagie, Huangqi Zhang, Wenbin Ji, Hayit Greenspan

    Abstract: The outbreak of the novel coronavirus, officially declared a global pandemic, has a severe impact on our daily lives. As of this writing there are approximately 197,188 confirmed cases of which 80,881 are in "Mainland China" with 7,949 deaths, a mortality rate of 3.4%. In order to support radiologists in this overwhelming challenge, we develop a deep learning based algorithm that can detect, local… ▽ More

    Submitted 6 April, 2020; originally announced April 2020.

  18. arXiv:2003.05037  [pdf

    eess.IV cs.CV cs.LG

    Rapid AI Development Cycle for the Coronavirus (COVID-19) Pandemic: Initial Results for Automated Detection & Patient Monitoring using Deep Learning CT Image Analysis

    Authors: Ophir Gozes, Maayan Frid-Adar, Hayit Greenspan, Patrick D. Browning, Huangqi Zhang, Wenbin Ji, Adam Bernheim, Eliot Siegel

    Abstract: Purpose: Develop AI-based automated CT image analysis tools for detection, quantification, and tracking of Coronavirus; demonstrate they can differentiate coronavirus patients from non-patients. Materials and Methods: Multiple international datasets, including from Chinese disease-infected areas were included. We present a system that utilizes robust 2D and 3D deep learning models, modifying and a… ▽ More

    Submitted 24 March, 2020; v1 submitted 10 March, 2020; originally announced March 2020.

    Comments: 19 pages, 6 figures