Skip to main content

Showing 1–11 of 11 results for author: Tie, X

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.00528  [pdf, other

    cs.CV cs.CL

    Vision-Language Modeling in PET/CT for Visual Grounding of Positive Findings

    Authors: Zachary Huemann, Samuel Church, Joshua D. Warner, Daniel Tran, Xin Tie, Alan B McMillan, Junjie Hu, Steve Y. Cho, Meghan Lubner, Tyler J. Bradshaw

    Abstract: Vision-language models can connect the text description of an object to its specific location in an image through visual grounding. This has potential applications in enhanced radiology reporting. However, these models require large annotated image-text datasets, which are lacking for PET/CT. We developed an automated pipeline to generate weak labels linking PET/CT report descriptions to their ima… ▽ More

    Submitted 1 February, 2025; originally announced February 2025.

  2. arXiv:2412.07489  [pdf, ps, other

    cs.IT eess.SP

    DFT-s-OFDM-based On-Off Keying for Low-Power Wake-Up Signal

    Authors: Renaud-Alexandre Pitaval, Xiaolei Tie

    Abstract: 5G-Advanced and likely 6G will support a new low-power wake-up signal (LP-WUS) enabling low-power devices, equipped with a complementary ultra low-power receiver to monitor wireless traffic, to completely switch off their main radio. This orthogonal frequency-division multiplexed (OFDM) signal will emulate an on-off keying (OOK) modulation to enable very low-energy envelope detection at the receiv… ▽ More

    Submitted 17 June, 2025; v1 submitted 10 December, 2024; originally announced December 2024.

    Comments: Accepted for publication in IEEE Transactions on Communications

  3. arXiv:2412.00663  [pdf

    eess.IV cs.AI cs.CV physics.med-ph

    Deep Learning for Longitudinal Gross Tumor Volume Segmentation in MRI-Guided Adaptive Radiotherapy for Head and Neck Cancer

    Authors: Xin Tie, Weijie Chen, Zachary Huemann, Brayden Schott, Nuohao Liu, Tyler J. Bradshaw

    Abstract: Accurate segmentation of gross tumor volume (GTV) is essential for effective MRI-guided adaptive radiotherapy (MRgART) in head and neck cancer. However, manual segmentation of the GTV over the course of therapy is time-consuming and prone to interobserver variability. Deep learning (DL) has the potential to overcome these challenges by automatically delineating GTVs. In this study, our team,… ▽ More

    Submitted 30 November, 2024; originally announced December 2024.

    Comments: 12 pages, 4 figures, 4 tables

  4. arXiv:2404.08611  [pdf, other

    cs.CV cs.AI physics.med-ph

    Automatic Quantification of Serial PET/CT Images for Pediatric Hodgkin Lymphoma Patients Using a Longitudinally-Aware Segmentation Network

    Authors: Xin Tie, Muheon Shin, Changhee Lee, Scott B. Perlman, Zachary Huemann, Amy J. Weisman, Sharon M. Castellino, Kara M. Kelly, Kathleen M. McCarten, Adina L. Alazraki, Junjie Hu, Steve Y. Cho, Tyler J. Bradshaw

    Abstract: $\textbf{Purpose}$: Automatic quantification of longitudinal changes in PET scans for lymphoma patients has proven challenging, as residual disease in interim-therapy scans is often subtle and difficult to detect. Our goal was to develop a longitudinally-aware segmentation network (LAS-Net) that can quantify serial PET/CT images for pediatric Hodgkin lymphoma patients. $\textbf{Materials and Metho… ▽ More

    Submitted 30 September, 2024; v1 submitted 12 April, 2024; originally announced April 2024.

    Comments: There are 6 figures and 4 tables in the main text. The supplementary material is appended to the main text

  5. arXiv:2309.10066  [pdf, other

    cs.AI cs.CL physics.med-ph

    Automatic Personalized Impression Generation for PET Reports Using Large Language Models

    Authors: Xin Tie, Muheon Shin, Ali Pirasteh, Nevein Ibrahim, Zachary Huemann, Sharon M. Castellino, Kara M. Kelly, John Garrett, Junjie Hu, Steve Y. Cho, Tyler J. Bradshaw

    Abstract: In this study, we aimed to determine if fine-tuned large language models (LLMs) can generate accurate, personalized impressions for whole-body PET reports. Twelve language models were trained on a corpus of PET reports using the teacher-forcing algorithm, with the report findings as input and the clinical impressions as reference. An extra input token encodes the reading physician's identity, allo… ▽ More

    Submitted 17 October, 2023; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: 25 pages in total. 6 figures and 3 tables in the main body. The manuscript has been submitted to a journal for potential publication

    Journal ref: J Digit Imaging. Inform. Med. (2024)

  6. arXiv:2303.01615  [pdf, other

    cs.CV

    ConTEXTual Net: A Multimodal Vision-Language Model for Segmentation of Pneumothorax

    Authors: Zachary Huemann, Xin Tie, Junjie Hu, Tyler J. Bradshaw

    Abstract: Radiology narrative reports often describe characteristics of a patient's disease, including its location, size, and shape. Motivated by the recent success of multimodal learning, we hypothesized that this descriptive text could guide medical image analysis algorithms. We proposed a novel vision-language model, ConTEXTual Net, for the task of pneumothorax segmentation on chest radiographs. ConTEXT… ▽ More

    Submitted 15 September, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

  7. arXiv:2210.02189  [pdf

    eess.IV cs.CV cs.LG

    A Generalizable Artificial Intelligence Model for COVID-19 Classification Task Using Chest X-ray Radiographs: Evaluated Over Four Clinical Datasets with 15,097 Patients

    Authors: Ran Zhang, Xin Tie, John W. Garrett, Dalton Griner, Zhihua Qi, Nicholas B. Bevins, Scott B. Reeder, Guang-Hong Chen

    Abstract: Purpose: To answer the long-standing question of whether a model trained from a single clinical site can be generalized to external sites. Materials and Methods: 17,537 chest x-ray radiographs (CXRs) from 3,264 COVID-19-positive patients and 4,802 COVID-19-negative patients were collected from a single site for AI model development. The generalizability of the trained model was retrospectively e… ▽ More

    Submitted 4 October, 2022; originally announced October 2022.

  8. arXiv:2104.09752  [pdf, other

    cs.CV

    Flow-based Video Segmentation for Human Head and Shoulders

    Authors: Zijian Kuang, Xinran Tie

    Abstract: Video segmentation for the human head and shoulders is essential in creating elegant media for videoconferencing and virtual reality applications. The main challenge is to process high-quality background subtraction in a real-time manner and address the segmentation issues under motion blurs, e.g., shaking the head or waving hands during conference video. To overcome the motion blur problem in vid… ▽ More

    Submitted 20 April, 2021; originally announced April 2021.

  9. arXiv:2103.13477  [pdf, ps, other

    cs.MM cs.CV

    A Survey of Multimedia Technologies and Robust Algorithms

    Authors: Zijian Kuang, Xinran Tie

    Abstract: Multimedia technologies are now more practical and deployable in real life, and the algorithms are widely used in various researching areas such as deep learning, signal processing, haptics, computer vision, robotics, and medical multimedia processing. This survey provides an overview of multimedia technologies and robust algorithms in multimedia data processing, medical multimedia processing, hum… ▽ More

    Submitted 25 March, 2021; v1 submitted 24 March, 2021; originally announced March 2021.

    Comments: arXiv admin note: text overlap with arXiv:2010.12968

  10. arXiv:2012.06737  [pdf, other

    cs.CV

    Computer Vision and Normalizing Flow-Based Defect Detection

    Authors: Zijian Kuang, Xinran Tie, Lihang Ying, Shi Jin

    Abstract: Visual defect detection is critical to ensure the quality of most products. However, the majority of small and medium-sized manufacturing enterprises still rely on tedious and error-prone human manual inspection. The main reasons include: 1) the existing automated visual defect detection systems require altering production assembly lines, which is time consuming and expensive 2) the existing syste… ▽ More

    Submitted 13 February, 2022; v1 submitted 12 December, 2020; originally announced December 2020.

  11. Improved Actor Relation Graph based Group Activity Recognition

    Authors: Zijian Kuang, Xinran Tie

    Abstract: Video understanding is to recognize and classify different actions or activities appearing in the video. A lot of previous work, such as video captioning, has shown promising performance in producing general video understanding. However, it is still challenging to generate a fine-grained description of human actions and their interactions using state-of-the-art video captioning techniques. The det… ▽ More

    Submitted 29 December, 2020; v1 submitted 24 October, 2020; originally announced October 2020.

    Journal ref: ICSM 2022