Search | arXiv e-print repository

Ultrasound Report Generation with Multimodal Large Language Models for Standardized Texts

Authors: Peixuan Ge, Tongkun Su, Faqin Lv, Baoliang Zhao, Peng Zhang, Chi Hong Wong, Liang Yao, Yu Sun, Zenan Wang, Pak Kin Wong, Ying Hu

Abstract: Ultrasound (US) report generation is a challenging task due to the variability of US images, operator dependence, and the need for standardized text. Unlike X-ray and CT, US imaging lacks consistent datasets, making automation difficult. In this study, we propose a unified framework for multi-organ and multilingual US report generation, integrating fragment-based multilingual training and leveragi… ▽ More Ultrasound (US) report generation is a challenging task due to the variability of US images, operator dependence, and the need for standardized text. Unlike X-ray and CT, US imaging lacks consistent datasets, making automation difficult. In this study, we propose a unified framework for multi-organ and multilingual US report generation, integrating fragment-based multilingual training and leveraging the standardized nature of US reports. By aligning modular text fragments with diverse imaging data and curating a bilingual English-Chinese dataset, the method achieves consistent and clinically accurate text generation across organ sites and languages. Fine-tuning with selective unfreezing of the vision transformer (ViT) further improves text-image alignment. Compared to the previous state-of-the-art KMVE method, our approach achieves relative gains of about 2\% in BLEU scores, approximately 3\% in ROUGE-L, and about 15\% in CIDEr, while significantly reducing errors such as missing or incorrect content. By unifying multi-organ and multi-language report generation into a single, scalable framework, this work demonstrates strong potential for real-world clinical workflows. △ Less

Submitted 19 May, 2025; v1 submitted 13 May, 2025; originally announced May 2025.

arXiv:2411.15806 [pdf]

doi 10.1109/TNNLS.2025.3554082

Broad Critic Deep Actor Reinforcement Learning for Continuous Control

Authors: Shiron Thalagala, Pak Kin Wong, Xiaozheng Wang, Tianang Sun

Abstract: In the domain of continuous control, deep reinforcement learning (DRL) demonstrates promising results. However, the dependence of DRL on deep neural networks (DNNs) results in the demand for extensive data and increased computational cost. To address this issue, a novel hybrid actor-critic reinforcement learning (RL) framework is introduced. The proposed framework integrates the broad learning sys… ▽ More In the domain of continuous control, deep reinforcement learning (DRL) demonstrates promising results. However, the dependence of DRL on deep neural networks (DNNs) results in the demand for extensive data and increased computational cost. To address this issue, a novel hybrid actor-critic reinforcement learning (RL) framework is introduced. The proposed framework integrates the broad learning system (BLS) with DNN, aiming to merge the strengths of both distinct architectural paradigms. Specifically, the critic network employs BLS for rapid value estimation via ridge regression, while the actor network retains the DNN structure to optimize policy gradients. This hybrid design is generalizable and can enhance existing actor-critic algorithms. To demonstrate its versatility, the proposed framework is integrated into three widely used actor-critic algorithms -- deep deterministic policy gradient (DDPG), soft actor-critic (SAC), and twin delayed DDPG (TD3), resulting in BLS-augmented variants. Experimental results reveal that all BLS-enhanced versions surpass their original counterparts in terms of training efficiency and accuracy. These improvements highlight the suitability of the proposed framework for real-time control scenarios, where computational efficiency and rapid adaptation are critical. △ Less

Submitted 12 April, 2025; v1 submitted 24 November, 2024; originally announced November 2024.

Comments: 11 pages, The final published version is available at: https://ieeexplore.ieee.org/document/10957827 (DOI: 10.1109/TNNLS.2025.3554082)

Journal ref: IEEE Transactions on Neural Networks and Learning Systems, pp. 1-8, 2025

arXiv:2010.04542 [pdf, other]

Black-Box Optimization Revisited: Improving Algorithm Selection Wizards through Massive Benchmarking

Authors: Laurent Meunier, Herilalaina Rakotoarison, Pak Kan Wong, Baptiste Roziere, Jeremy Rapin, Olivier Teytaud, Antoine Moreau, Carola Doerr

Abstract: Existing studies in black-box optimization for machine learning suffer from low generalizability, caused by a typically selective choice of problem instances used for training and testing different optimization algorithms. Among other issues, this practice promotes overfitting and poor-performing user guidelines. To address this shortcoming, we propose in this work a benchmark suite, OptimSuite, w… ▽ More Existing studies in black-box optimization for machine learning suffer from low generalizability, caused by a typically selective choice of problem instances used for training and testing different optimization algorithms. Among other issues, this practice promotes overfitting and poor-performing user guidelines. To address this shortcoming, we propose in this work a benchmark suite, OptimSuite, which covers a broad range of black-box optimization problems, ranging from academic benchmarks to real-world applications, from discrete over numerical to mixed-integer problems, from small to very large-scale problems, from noisy over dynamic to static problems, etc. We demonstrate the advantages of such a broad collection by deriving from it Automated Black Box Optimizer (ABBO), a general-purpose algorithm selection wizard. Using three different types of algorithm selection techniques, ABBO achieves competitive performance on all benchmark suites. It significantly outperforms previous state of the art on some of them, including YABBOB and LSGO. ABBO relies on many high-quality base components. Its excellent performance is obtained without any task-specific parametrization. The OptimSuite benchmark collection, the ABBO wizard and its base solvers have all been merged into the open-source Nevergrad platform, where they are available for reproducible research. △ Less

Submitted 23 February, 2021; v1 submitted 8 October, 2020; originally announced October 2020.

arXiv:1103.2212 [pdf]

Stability and Queueing Analysis of IEEE 802.11 Distributed Coordination Function

Authors: Dongjie Yin, Pui King Wong, Tony T. Lee

Abstract: A widely adopted two-dimensional Markov chain model of the IEEE 802.11 DCF was introduced by Bianchi to characterize the backoff behavior of a single node under a saturated traffic condition. Using this approach, we propose a queuing model for the 802.11 DCF under a non-saturated traffic environment. The input buffer of each node is modeled as a Geo/G/1 queue, and the packet service time distribut… ▽ More A widely adopted two-dimensional Markov chain model of the IEEE 802.11 DCF was introduced by Bianchi to characterize the backoff behavior of a single node under a saturated traffic condition. Using this approach, we propose a queuing model for the 802.11 DCF under a non-saturated traffic environment. The input buffer of each node is modeled as a Geo/G/1 queue, and the packet service time distribution is derived from Markov state space of 802.11 DCF with the underlying scheduling algorithm. The DCF defines two access mechanisms, namely the Basic access mechanism and the request-to-send/clear-to-send (RTS/CTS) access mechanism. Based on our model, performance analyses of both schemes are studied with probabilistic exponential backoff scheduling. We obtain the characteristic equation of network throughput and expressions of packet queueing delay. Specifically, we obtain the stable throughput and bounded delay regions with respect to the retransmission factor according to the basic queueing analysis. For both access schemes, the bounded delay region is a subset of the stable throughput region. Our results show that the RTS/CTS access mechanism is more stable and performs better than the Basic access mechanism. The analysis in this paper is verified by simulation results. △ Less

Submitted 10 January, 2012; v1 submitted 11 March, 2011; originally announced March 2011.

Comments: 26 pages, 7 figures, and 2 table

arXiv:1008.1628 [pdf]

Performance Analysis of Markov Modulated 1-Persistent CSMA/CA Protocols with Exponential Backoff Scheduling

Authors: Pui King Wong, Dongjie Yin, Tony T. Lee

Abstract: This paper proposes a Markovian model of 1-persistent CSMA/CA protocols with K-Exponential Backoff scheduling algorithms. The input buffer of each access node is modeled as a Geo/G/1 queue, and the service time distribution of each individual head-of-line packet is derived from the Markov chain of the underlying scheduling algorithm. From the queuing model, we derive the characteristic equation of… ▽ More This paper proposes a Markovian model of 1-persistent CSMA/CA protocols with K-Exponential Backoff scheduling algorithms. The input buffer of each access node is modeled as a Geo/G/1 queue, and the service time distribution of each individual head-of-line packet is derived from the Markov chain of the underlying scheduling algorithm. From the queuing model, we derive the characteristic equation of network throughput and obtain the stable throughput and bounded delay regions with respect to the retransmission factor. Our results show that the stable throughput region of the exponential backoff scheme exists even for an infinite population. Moreover, we find that the bounded delay region of exponential backoff is only a sub-set of its stable throughput region due to the large variance of the service time of input packets caused by the capture effect. All analytical results presented in this paper are verified by simulations. △ Less

Submitted 7 March, 2011; v1 submitted 10 August, 2010; originally announced August 2010.

Comments: 24 pages including 11 figures

arXiv:1005.0178 [pdf]

Analysis of Non-Persistent CSMA Protocols with Exponential Backoff Scheduling

Authors: Pui King Wong, Dongjie Yin, Tony T. Lee

Abstract: This paper studies the performance of Non-persistent CSMA/CA protocols with K-Exponential Backoff scheduling algorithms. A multi-queue single-server system is proposed to model multiple access networks. The input buffer of each access node is modeled as a Geo/G/1 queue, and the service time distribution of head-of-line packets is derived from the Markov chain of underlying scheduling algorithm. Th… ▽ More This paper studies the performance of Non-persistent CSMA/CA protocols with K-Exponential Backoff scheduling algorithms. A multi-queue single-server system is proposed to model multiple access networks. The input buffer of each access node is modeled as a Geo/G/1 queue, and the service time distribution of head-of-line packets is derived from the Markov chain of underlying scheduling algorithm. The main results include the complete analysis of the throughput and delay distribution, from which we obtained stable regions with respect to the throughput and bounded mean delay of the Geometric Retransmission and Exponential Backoff schemes. We show that the throughput stable region of Geometric Retransmission will vanish as the number of nodes n \rightarrow \infty; thus, it is inherently unstable for large n. In contrast to Geometric Retransmission, the throughput stable region of Exponential Backoff can be obtained for an infinite population. We found that the bounded mean delay region of Geometric Retransmission remains the same as its throughput stable region. Besides, the variance of service time of Exponential Backoff can be unbounded due to the capture effect; thus, its bounded delay region is only a sub-set of its throughput stable region. Analytical results presented in this paper are all verified by simulation. △ Less

Submitted 6 December, 2010; v1 submitted 2 May, 2010; originally announced May 2010.

Showing 1–6 of 6 results for author: Wong, P K