Search | arXiv e-print repository

Robust Phantom-Assisted Framework for Multi-Person Localization and Vital Signs Monitoring Using MIMO FMCW Radar

Authors: Yonathan Eder, Emma Zagoury, Shlomi Savariego, Moshe Namer, Oded Cohen, Yonina C. Eldar

Abstract: With the rising prevalence of cardiovascular and respiratory disorders and an aging global population, healthcare systems face increasing pressure to adopt efficient, non-contact vital sign monitoring (NCVSM) solutions. This study introduces a robust framework for multi-person localization and vital signs monitoring, using multiple-input-multiple-output frequency-modulated continuous wave radar, a… ▽ More With the rising prevalence of cardiovascular and respiratory disorders and an aging global population, healthcare systems face increasing pressure to adopt efficient, non-contact vital sign monitoring (NCVSM) solutions. This study introduces a robust framework for multi-person localization and vital signs monitoring, using multiple-input-multiple-output frequency-modulated continuous wave radar, addressing challenges in real-world, cluttered environments. Two key contributions are presented. First, a custom hardware phantom was developed to simulate multi-person NCVSM scenarios, utilizing recorded thoracic impedance signals to replicate realistic cardiopulmonary dynamics. The phantom's design facilitates repeatable and rapid validation of radar systems and algorithms under diverse conditions to accelerate deployment in human monitoring. Second, aided by the phantom, we designed a robust algorithm for multi-person localization utilizing joint sparsity and cardiopulmonary properties, alongside harmonics-resilient dictionary-based vital signs estimation, to mitigate interfering respiration harmonics. Additionally, an adaptive signal refinement procedure is introduced to enhance the accuracy of continuous NCVSM by leveraging the continuity of the estimates. Performance was validated and compared to existing techniques through 12 phantom trials and 12 human trials, including both single- and multi-person scenarios, demonstrating superior localization and NCVSM performance. For example, in multi-person human trials, our method achieved average respiration rate estimation accuracies of 94.14%, 98.12%, and 98.69% within error thresholds of 2, 3, and 4 breaths per minute, respectively, and heart rate accuracies of 87.10%, 94.12%, and 95.54% within the same thresholds. These results highlight the potential of this framework for reliable multi-person NCVSM in healthcare and IoT applications. △ Less

Submitted 12 January, 2025; originally announced January 2025.

arXiv:2410.19197 [pdf, other]

doi 10.1364/OL.545836

Single-shot X-ray ptychography as a structured illumination method

Authors: Abraham Levitan, Klaus Wakonig, Zirui Gao, Adam Kubec, Bing Kuan Chen, Oren Cohen, Manuel Guizar-Sicairos

Abstract: Single-shot ptychography is a quantitative phase imaging method wherein overlapping beams of light arranged in a grid pattern simultaneously illuminate a sample, allowing a full ptychographic dataset to be collected in a single shot. It is primarily used at optical wavelengths, but there is interest in using it for X-ray imaging. However, the constraints imposed by X-ray optics have limited the re… ▽ More Single-shot ptychography is a quantitative phase imaging method wherein overlapping beams of light arranged in a grid pattern simultaneously illuminate a sample, allowing a full ptychographic dataset to be collected in a single shot. It is primarily used at optical wavelengths, but there is interest in using it for X-ray imaging. However, the constraints imposed by X-ray optics have limited the resolution achievable to date. In this work, we reinterpret single-shot ptychography as a structured illumination method by viewing the grid of beams as a single, highly structured illumination function. Pre-calibrating this illumination and reconstructing single-shot data using the randomized probe imaging algorithm allows us to account for the overlap and coherent interference between the diffraction arising from each beam. We achieve a resolution 3.5 times finer than the numerical aperture-based limit imposed by traditional algorithms for single-shot ptychography. We argue that this reconstruction method will work better for most single-shot ptychography experiments and discuss the implications for the design of future single-shot X-ray microscopes. △ Less

Submitted 24 October, 2024; originally announced October 2024.

Comments: 4 pages, 3 figures

Journal ref: Opt. Lett. 50 (2025) 443-446

arXiv:2409.09545 [pdf, other]

Multi-Microphone and Multi-Modal Emotion Recognition in Reverberant Environment

Authors: Ohad Cohen, Gershon Hazan, Sharon Gannot

Abstract: This paper presents a Multi-modal Emotion Recognition (MER) system designed to enhance emotion recognition accuracy in challenging acoustic conditions. Our approach combines a modified and extended Hierarchical Token-semantic Audio Transformer (HTS-AT) for multi-channel audio processing with an R(2+1)D Convolutional Neural Networks (CNN) model for video analysis. We evaluate our proposed method on… ▽ More This paper presents a Multi-modal Emotion Recognition (MER) system designed to enhance emotion recognition accuracy in challenging acoustic conditions. Our approach combines a modified and extended Hierarchical Token-semantic Audio Transformer (HTS-AT) for multi-channel audio processing with an R(2+1)D Convolutional Neural Networks (CNN) model for video analysis. We evaluate our proposed method on a reverberated version of the Ryerson audio-visual database of emotional speech and song (RAVDESS) dataset using synthetic and real-world Room Impulse Responsess (RIRs). Our results demonstrate that integrating audio and video modalities yields superior performance compared to uni-modal approaches, especially in challenging acoustic conditions. Moreover, we show that the multimodal (audiovisual) approach that utilizes multiple microphones outperforms its single-microphone counterpart. △ Less

Submitted 17 September, 2024; v1 submitted 14 September, 2024; originally announced September 2024.

arXiv:2406.03272 [pdf, other]

Multi-Microphone Speech Emotion Recognition using the Hierarchical Token-semantic Audio Transformer Architecture

Authors: Ohad Cohen, Gershon Hazan, Sharon Gannot

Abstract: The performance of most emotion recognition systems degrades in real-life situations ('in the wild' scenarios) where the audio is contaminated by reverberation. Our study explores new methods to alleviate the performance degradation of SER algorithms and develop a more robust system for adverse conditions. We propose processing multi-microphone signals to address these challenges and improve emoti… ▽ More The performance of most emotion recognition systems degrades in real-life situations ('in the wild' scenarios) where the audio is contaminated by reverberation. Our study explores new methods to alleviate the performance degradation of SER algorithms and develop a more robust system for adverse conditions. We propose processing multi-microphone signals to address these challenges and improve emotion classification accuracy. We adopt a state-of-the-art transformer model, the HTS-AT, to handle multi-channel audio inputs. We evaluate two strategies: averaging mel-spectrograms across channels and summing patch-embedded representations. Our multi-microphone model achieves superior performance compared to single-channel baselines when tested on real-world reverberant environments. △ Less

Submitted 14 September, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

arXiv:2301.11640 [pdf, other]

Hardware Implementation of Task-based Quantization in Multi-user Signal Recovery

Authors: Xing Zhang, Haiyang Zhang, Nimrod Glazer, Oded Cohen, Eliya Reznitskiy, Shlomi Savariego, Moshe Namer, Yonina C. Eldar

Abstract: Quantization plays a critical role in digital signal processing systems, allowing the representation of continuous amplitude signals with a finite number of bits. However, accurately representing signals requires a large number of quantization bits, which causes severe cost, power consumption, and memory burden. A promising way to address this issue is task-based quantization. By exploiting the ta… ▽ More Quantization plays a critical role in digital signal processing systems, allowing the representation of continuous amplitude signals with a finite number of bits. However, accurately representing signals requires a large number of quantization bits, which causes severe cost, power consumption, and memory burden. A promising way to address this issue is task-based quantization. By exploiting the task information for the overall system design, task-based quantization can achieve satisfying performance with low quantization costs. In this work, we apply task-based quantization to multi-user signal recovery and present a hardware prototype implementation. The prototype consists of a tailored configurable combining board, and a software-based processing and demonstration system. Through experiments, we verify that with proper design, the task-based quantization achieves a reduction of 25 fold in memory by reducing from 16 receivers with 16 bits each to 2 receivers with 5 bits each, without compromising signal recovery performance. △ Less

Submitted 27 January, 2023; originally announced January 2023.

arXiv:2108.08333 [pdf]

doi 10.1002/mrm.29448

CEST MR fingerprinting (CEST-MRF) for Brain Tumor Quantification Using EPI Readout and Deep Learning Reconstruction

Authors: Ouri Cohen, Victoria Y. Yu, Kathryn R. Tringale, Robert J. Young, Or Perlman, Christian T. Farrar, Ricardo Otazo

Abstract: $\textbf{Purpose}$: To develop a clinical CEST MR fingerprinting (CEST-MRF) method for brain tumor quantification using EPI acquisition and deep learning reconstruction. $\textbf{Methods}$: A CEST-MRF pulse sequence originally designed for animal imaging was modified to conform to hardware limits on clinical scanners while keeping scan time $\leq… ▽ More $\textbf{Purpose}$: To develop a clinical CEST MR fingerprinting (CEST-MRF) method for brain tumor quantification using EPI acquisition and deep learning reconstruction. $\textbf{Methods}$: A CEST-MRF pulse sequence originally designed for animal imaging was modified to conform to hardware limits on clinical scanners while keeping scan time $\leq$ 2 minutes. Quantitative MRF reconstruction was performed using a deep reconstruction network (DRONE) to yield the water relaxation and chemical exchange parameters. The feasibility of the 6 parameter DRONE reconstruction was tested in simulations in a digital brain phantom. A healthy subject was scanned with the CEST-MRF sequence, conventional MRF and CEST sequences for comparison. Reproducibility was assessed via test-retest experiments and the concordance correlation coefficient (CCC) calculated for white matter (WM) and grey matter (GM). The clinical utility of CEST-MRF was demonstrated in 4 patients with brain metastases in comparison to standard clinical imaging sequences. Tumors were segmented into edema, solid core and necrotic core regions and the CEST-MRF values compared to the contra-lateral side. $\textbf{Results}$: The DRONE reconstruction of the digital phantom yielded a normalized RMS error of $\leq$ 7% for all parameters. The CEST-MRF parameters were in good agreement with those from conventional MRF and CEST sequences and previous studies. The mean CCC for all 6 parameters was 0.98$\pm$0.01 in WM and 0.98$\pm$0.02 in GM. The CEST-MRF values in nearly all tumor regions were significantly different (P=0.05) from each other and the contra-lateral side. $\textbf{Conclusion}$: Combination of EPI readout and deep learning reconstruction enabled fast, accurate and reproducible CEST-MRF in brain tumors. △ Less

Submitted 11 April, 2022; v1 submitted 18 August, 2021; originally announced August 2021.

Comments: 9 figures, 1 table

Showing 1–6 of 6 results for author: Cohen, O