Skip to main content

Showing 1–50 of 238 results for author: Wong, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.04315  [pdf, ps, other

    cs.AR

    HLStrans: Dataset for LLM-Driven C-to-HLS Hardware Code Synthesis

    Authors: Qingyun Zou, Nuo Chen, Yao Chen, Bingsheng He, WengFei Wong

    Abstract: High-level synthesis (HLS) enables software developers to describe and implement hardware at a higher level of abstraction by using C/C++ instead of traditional hardware description languages to automatically generate FPGA-ready designs. However, generating HLS code significantly differs from standard C/C++: it disallows certain coding idioms, relies on specialized libraries, and critically requir… ▽ More

    Submitted 6 July, 2025; originally announced July 2025.

  2. arXiv:2506.20048  [pdf, ps, other

    stat.ML cs.LG

    A Principled Path to Fitted Distributional Evaluation

    Authors: Sungee Hong, Jiayi Wang, Zhengling Qi, Raymond Ka Wai Wong

    Abstract: In reinforcement learning, distributional off-policy evaluation (OPE) focuses on estimating the return distribution of a target policy using offline data collected under a different policy. This work focuses on extending the widely used fitted-Q evaluation -- developed for expectation-based reinforcement learning -- to the distributional OPE setting. We refer to this extension as fitted distributi… ▽ More

    Submitted 24 June, 2025; originally announced June 2025.

  3. arXiv:2506.04467  [pdf

    physics.med-ph cs.AI

    Diffusion Transformer-based Universal Dose Denoising for Pencil Beam Scanning Proton Therapy

    Authors: Yuzhen Ding, Jason Holmes, Hongying Feng, Martin Bues, Lisa A. McGee, Jean-Claude M. Rwigema, Nathan Y. Yu, Terence S. Sio, Sameer R. Keole, William W. Wong, Steven E. Schild, Jonathan B. Ashman, Sujay A. Vora, Daniel J. Ma, Samir H. Patel, Wei Liu

    Abstract: Purpose: Intensity-modulated proton therapy (IMPT) offers precise tumor coverage while sparing organs at risk (OARs) in head and neck (H&N) cancer. However, its sensitivity to anatomical changes requires frequent adaptation through online adaptive radiation therapy (oART), which depends on fast, accurate dose calculation via Monte Carlo (MC) simulations. Reducing particle count accelerates MC but… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

  4. Data Mining-Based Techniques for Software Fault Localization

    Authors: Peggy Cellier, Mireille Ducassé, Sébastien Ferré, Olivier Ridoux, W. Eric Wong

    Abstract: This chapter illustrates the basic concepts of fault localization using a data mining technique. It utilizes the Trityp program to illustrate the general method. Formal concept analysis and association rule are two well-known methods for symbolic data mining. In their original inception, they both consider data in the form of an object-attribute table. In their original inception, they both consid… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

    Journal ref: Handbook of Software Fault Localization, 1, Wiley, Chapitre 7, 2023, Handbook of Software Fault Localization: Foundations and Advances, 9781119291824

  5. arXiv:2505.04603  [pdf, other

    stat.ME cs.LG stat.CO stat.ML

    Likelihood-Free Adaptive Bayesian Inference via Nonparametric Distribution Matching

    Authors: Wenhui Sophia Lu, Wing Hung Wong

    Abstract: When the likelihood is analytically unavailable and computationally intractable, approximate Bayesian computation (ABC) has emerged as a widely used methodology for approximate posterior inference; however, it suffers from severe computational inefficiency in high-dimensional settings or under diffuse priors. To overcome these limitations, we propose Adaptive Bayesian Inference (ABI), a framework… ▽ More

    Submitted 7 May, 2025; originally announced May 2025.

  6. arXiv:2504.17826  [pdf, other

    cs.CV cs.AI

    FashionM3: Multimodal, Multitask, and Multiround Fashion Assistant based on Unified Vision-Language Model

    Authors: Kaicheng Pang, Xingxing Zou, Waikeung Wong

    Abstract: Fashion styling and personalized recommendations are pivotal in modern retail, contributing substantial economic value in the fashion industry. With the advent of vision-language models (VLM), new opportunities have emerged to enhance retailing through natural language and visual interactions. This work proposes FashionM3, a multimodal, multitask, and multiround fashion assistant, built upon a VLM… ▽ More

    Submitted 23 April, 2025; originally announced April 2025.

  7. arXiv:2503.13357  [pdf, other

    cs.DM

    The Power of Amortization on Scheduling with Explorable Uncertainty

    Authors: Alison Hsiang-Hsuan Liu, Fu-Hong Liu, Prudence W. H. Wong, Xiao-Ou Zhang

    Abstract: In this work, we study a scheduling problem with explorable uncertainty. Each job comes with an upper limit of its processing time, which could be potentially reduced by testing the job, which also takes time. The objective is to schedule all jobs on a single machine with a minimum total completion time. The challenge lies in deciding which jobs to test and the order of testing/processing jobs.… ▽ More

    Submitted 17 March, 2025; originally announced March 2025.

  8. arXiv:2503.11948  [pdf

    cs.CL cs.AI

    Integration of Explainable AI Techniques with Large Language Models for Enhanced Interpretability for Sentiment Analysis

    Authors: Thivya Thogesan, Anupiya Nugaliyadde, Kok Wai Wong

    Abstract: Interpretability remains a key difficulty in sentiment analysis with Large Language Models (LLMs), particularly in high-stakes applications where it is crucial to comprehend the rationale behind forecasts. This research addressed this by introducing a technique that applies SHAP (Shapley Additive Explanations) by breaking down LLMs into components such as embedding layer,encoder,decoder and attent… ▽ More

    Submitted 14 March, 2025; originally announced March 2025.

  9. arXiv:2502.08514  [pdf, other

    cs.CL

    Faithful, Unfaithful or Ambiguous? Multi-Agent Debate with Initial Stance for Summary Evaluation

    Authors: Mahnaz Koupaee, Jake W. Vincent, Saab Mansour, Igor Shalyminov, Han He, Hwanjun Song, Raphael Shu, Jianfeng He, Yi Nian, Amy Wing-mei Wong, Kyu J. Han, Hang Su

    Abstract: Faithfulness evaluators based on large language models (LLMs) are often fooled by the fluency of the text and struggle with identifying errors in the summaries. We propose an approach to summary faithfulness evaluation in which multiple LLM-based agents are assigned initial stances (regardless of what their belief might be) and forced to come up with a reason to justify the imposed belief, thus en… ▽ More

    Submitted 13 February, 2025; v1 submitted 12 February, 2025; originally announced February 2025.

  10. arXiv:2501.14305  [pdf, other

    cs.CY cs.AI

    A Zero-Shot LLM Framework for Automatic Assignment Grading in Higher Education

    Authors: Calvin Yeung, Jeff Yu, King Chau Cheung, Tat Wing Wong, Chun Man Chan, Kin Chi Wong, Keisuke Fujii

    Abstract: Automated grading has become an essential tool in education technology due to its ability to efficiently assess large volumes of student work, provide consistent and unbiased evaluations, and deliver immediate feedback to enhance learning. However, current systems face significant limitations, including the need for large datasets in few-shot learning methods, a lack of personalized and actionable… ▽ More

    Submitted 24 January, 2025; originally announced January 2025.

  11. arXiv:2501.14249  [pdf, other

    cs.LG cs.AI cs.CL

    Humanity's Last Exam

    Authors: Long Phan, Alice Gatti, Ziwen Han, Nathaniel Li, Josephina Hu, Hugh Zhang, Chen Bo Calvin Zhang, Mohamed Shaaban, John Ling, Sean Shi, Michael Choi, Anish Agrawal, Arnav Chopra, Adam Khoja, Ryan Kim, Richard Ren, Jason Hausenloy, Oliver Zhang, Mantas Mazeika, Dmitry Dodonov, Tung Nguyen, Jaeho Lee, Daron Anderson, Mikhail Doroshenko, Alun Cennyth Stokes , et al. (1084 additional authors not shown)

    Abstract: Benchmarks are important tools for tracking the rapid advancements in large language model (LLM) capabilities. However, benchmarks are not keeping pace in difficulty: LLMs now achieve over 90\% accuracy on popular benchmarks like MMLU, limiting informed measurement of state-of-the-art LLM capabilities. In response, we introduce Humanity's Last Exam (HLE), a multi-modal benchmark at the frontier of… ▽ More

    Submitted 19 April, 2025; v1 submitted 24 January, 2025; originally announced January 2025.

    Comments: 29 pages, 6 figures

  12. arXiv:2501.10753  [pdf, ps, other

    cs.IT eess.SP

    Pinching Antennas: Principles, Applications and Challenges

    Authors: Zheng Yang, Ning Wang, Yanshi Sun, Zhiguo Ding, Robert Schober, George K. Karagiannidis, Vincent W. S. Wong, Octavia A. Dobre

    Abstract: Flexible-antenna systems, such as fluid antennas and movable antennas, have been recognized as key enabling technologies for sixth-generation (6G) wireless networks, as they can intelligently reconfigure the effective channel gains of the users and hence significantly improve their data transmission capabilities. However, existing flexible-antenna systems have been designed to combat small-scale f… ▽ More

    Submitted 18 January, 2025; originally announced January 2025.

  13. arXiv:2501.02814  [pdf

    physics.ao-ph cs.LG

    Analogue Forecast System for Daily Precipitation Prediction Using Autoencoder Feature Extraction: Application in Hong Kong

    Authors: Yee Chun Tsoi, Yu Ting Kwok, Ming Chun Lam, Wai Kin Wong

    Abstract: In the Hong Kong Observatory, the Analogue Forecast System (AFS) for precipitation has been providing useful reference in predicting possible daily rainfall scenarios for the next 9 days, by identifying historical cases with similar weather patterns to the latest output from the deterministic model of the European Centre for Medium-Range Weather Forecasts (ECMWF). Recent advances in machine learni… ▽ More

    Submitted 6 January, 2025; originally announced January 2025.

    Comments: 16 pages, 10 figures

    Journal ref: Hong Kong Meteorological Society E-BULLETIN Vol. 28, 2 (2024)

  14. arXiv:2501.00755  [pdf, other

    stat.ML cs.AI cs.LG stat.ME

    An AI-powered Bayesian generative modeling approach for causal inference in observational studies

    Authors: Qiao Liu, Wing Hung Wong

    Abstract: Causal inference in observational studies with high-dimensional covariates presents significant challenges. We introduce CausalBGM, an AI-powered Bayesian generative modeling approach that captures the causal relationship among covariates, treatment, and outcome variables. The core innovation of CausalBGM lies in its ability to estimate the individual treatment effect (ITE) by learning individual-… ▽ More

    Submitted 1 January, 2025; originally announced January 2025.

  15. arXiv:2412.16897  [pdf, other

    cs.CV cs.AI

    MVREC: A General Few-shot Defect Classification Model Using Multi-View Region-Context

    Authors: Shuai Lyu, Rongchen Zhang, Zeqi Ma, Fangjian Liao, Dongmei Mo, Waikeung Wong

    Abstract: Few-shot defect multi-classification (FSDMC) is an emerging trend in quality control within industrial manufacturing. However, current FSDMC research often lacks generalizability due to its focus on specific datasets. Additionally, defect classification heavily relies on contextual information within images, and existing methods fall short of effectively extracting this information. To address the… ▽ More

    Submitted 30 March, 2025; v1 submitted 22 December, 2024; originally announced December 2024.

    Comments: Accepted by AAAI 2025

  16. T-Edge: Trusted Heterogeneous Edge Computing

    Authors: Jiamin Shen, Yao Chen, Weng-Fai Wong, Ee-Chien Chang

    Abstract: Heterogeneous computing, which incorporates GPUs, NPUs, and FPGAs, is increasingly utilized to improve the efficiency of computer systems. However, this shift has given rise to significant security and privacy concerns, especially when the execution platform is remote. One way to tackle these challenges is to establish a trusted and isolated environment for remote program execution, while maintain… ▽ More

    Submitted 18 December, 2024; originally announced December 2024.

    Comments: 13 pages, 6 figures

  17. arXiv:2411.17101  [pdf

    cs.SE

    Software Fault Localization Based on Multi-objective Feature Fusion and Deep Learning

    Authors: Xiaolei Hu, Dongcheng Li, W. Eric Wong, Ya Zou

    Abstract: Software fault localization remains challenging due to limited feature diversity and low precision in traditional methods. This paper proposes a novel approach that integrates multi-objective optimization with deep learning models to improve both accuracy and efficiency in fault localization (FL). By framing feature selection as a multi-objective optimization problem (MOP), we extract and fuse thr… ▽ More

    Submitted 25 November, 2024; originally announced November 2024.

  18. arXiv:2411.14939  [pdf, other

    cs.LG

    Many happy returns: machine learning to support platelet issuing and waste reduction in hospital blood banks

    Authors: Joseph Farrington, Samah Alimam, Martin Utley, Kezhi Li, Wai Keong Wong

    Abstract: Efforts to reduce platelet wastage in hospital blood banks have focused on ordering policies, but the predominant practice of issuing the oldest unit first may not be optimal when some units are returned unused. We propose a novel, machine learning (ML)-guided issuing policy to increase the likelihood of returned units being reissued before expiration. Our ML model trained to predict returns on 17… ▽ More

    Submitted 22 November, 2024; originally announced November 2024.

    MSC Class: 90B05 (Primary) 62P10; 68T05; 92C60 (Secondary) ACM Class: I.2.1; I.6.3; J.3; H.4.2

  19. arXiv:2411.10548  [pdf, ps, other

    cs.LG q-bio.BM

    BioNeMo Framework: a modular, high-performance library for AI model development in drug discovery

    Authors: Peter St. John, Dejun Lin, Polina Binder, Malcolm Greaves, Vega Shah, John St. John, Adrian Lange, Patrick Hsu, Rajesh Illango, Arvind Ramanathan, Anima Anandkumar, David H Brookes, Akosua Busia, Abhishaike Mahajan, Stephen Malina, Neha Prasad, Sam Sinai, Lindsay Edwards, Thomas Gaudelet, Cristian Regep, Martin Steinegger, Burkhard Rost, Alexander Brace, Kyle Hippe, Luca Naef , et al. (68 additional authors not shown)

    Abstract: Artificial Intelligence models encoding biology and chemistry are opening new routes to high-throughput and high-quality in-silico drug development. However, their training increasingly relies on computational scale, with recent protein language models (pLM) training on hundreds of graphical processing units (GPUs). We introduce the BioNeMo Framework to facilitate the training of computational bio… ▽ More

    Submitted 12 June, 2025; v1 submitted 15 November, 2024; originally announced November 2024.

  20. arXiv:2411.09460  [pdf, other

    cs.IT

    Analysis Methodology for Age of Information under Sequence Based Scheduling

    Authors: Fang Liu, Wing Shing Wong, Yuan-Hsun Lo, Yijin Zhang, Chung Shue Chen

    Abstract: We focus on the Age of Information (AoI) performance in a system where each user generates packets periodically to send to a common access point (AP) for status updating. To avoid heavy overhead, we assume that channel sensing, feedback information from the AP, and time synchronization are not available in the system. We adopt a multi-access scheme called the sequence scheme, where each user is as… ▽ More

    Submitted 14 November, 2024; originally announced November 2024.

  21. arXiv:2411.05797  [pdf, other

    cs.NE stat.CO

    Metaheuristics is All You Need

    Authors: Eliuvish Cuicizion, Haowen Xu, Weng Kee Wong

    Abstract: Optimization plays an important role in tackling public health problems. Animal instincts can be used effectively to solve complex public health management issues by providing optimal or approximately optimal solutions to complicated optimization problems common in public health. BAT algorithm is an exemplary member of a class of nature-inspired metaheuristic optimization algorithms and designed t… ▽ More

    Submitted 21 March, 2025; v1 submitted 25 October, 2024; originally announced November 2024.

    Comments: 25 pages, many figures

  22. arXiv:2411.02453  [pdf, other

    gr-qc cs.LG

    Super-Resolution without High-Resolution Labels for Black Hole Simulations

    Authors: Thomas Helfer, Thomas D. P. Edwards, Jessica Dafflon, Kaze W. K. Wong, Matthew Lyle Olson

    Abstract: Generating high-resolution simulations is key for advancing our understanding of one of the universe's most violent events: Black Hole mergers. However, generating Black Hole simulations is limited by prohibitive computational costs and scalability issues, reducing the simulation's fidelity and resolution achievable within reasonable time frames and resources. In this work, we introduce a novel me… ▽ More

    Submitted 3 November, 2024; originally announced November 2024.

    Comments: Code available at https://github.com/ThomasHelfer/TorchGRTL and data at https://huggingface.co/datasets/thelfer/BinaryBlackHole

  23. arXiv:2411.01033  [pdf

    cs.SE

    Many-Objective Search-Based Coverage-Guided Automatic Test Generation for Deep Neural Networks

    Authors: Dongcheng Li, W. Eric Wong, Hu Liu, Man Zhao

    Abstract: To ensure the reliability of DNN systems and address the test generation problem for neural networks, this paper proposes a fuzzing test generation technique based on many-objective optimization algorithms. Traditional fuzz testing employs random search, leading to lower testing efficiency and tends to generate numerous invalid test cases. By utilizing many-objective optimization techniques, effec… ▽ More

    Submitted 1 November, 2024; originally announced November 2024.

  24. arXiv:2410.23159  [pdf, other

    cs.CV cs.AI cs.LG

    Fourier Amplitude and Correlation Loss: Beyond Using L2 Loss for Skillful Precipitation Nowcasting

    Authors: Chiu-Wai Yan, Shi Quan Foo, Van Hoan Trinh, Dit-Yan Yeung, Ka-Hing Wong, Wai-Kin Wong

    Abstract: Deep learning approaches have been widely adopted for precipitation nowcasting in recent years. Previous studies mainly focus on proposing new model architectures to improve pixel-wise metrics. However, they frequently result in blurry predictions which provide limited utility to forecasting operations. In this work, we propose a new Fourier Amplitude and Correlation Loss (FACL) which consists of… ▽ More

    Submitted 30 October, 2024; originally announced October 2024.

    Comments: Accepted by NeurIPS 2024. Camera-ready submission

  25. arXiv:2410.21076  [pdf, other

    astro-ph.IM astro-ph.HE cs.LG gr-qc

    Accelerated Bayesian parameter estimation and model selection for gravitational waves with normalizing flows

    Authors: Alicja Polanska, Thibeau Wouters, Peter T. H. Pang, Kaze K. W. Wong, Jason D. McEwen

    Abstract: We present an accelerated pipeline, based on high-performance computing techniques and normalizing flows, for joint Bayesian parameter estimation and model selection and demonstrate its efficiency in gravitational wave astrophysics. We integrate the Jim inference toolkit, a normalizing flow-enhanced Markov chain Monte Carlo (MCMC) sampler, with the learned harmonic mean estimator. Our Bayesian evi… ▽ More

    Submitted 31 October, 2024; v1 submitted 28 October, 2024; originally announced October 2024.

    Comments: accepted to NeurIPS 2024 workshop on Machine Learning and the Physical Sciences

  26. Enabling Energy-Efficient Deployment of Large Language Models on Memristor Crossbar: A Synergy of Large and Small

    Authors: Zhehui Wang, Tao Luo, Cheng Liu, Weichen Liu, Rick Siow Mong Goh, Weng-Fai Wong

    Abstract: Large language models (LLMs) have garnered substantial attention due to their promising applications in diverse domains. Nevertheless, the increasing size of LLMs comes with a significant surge in the computational requirements for training and deployment. Memristor crossbars have emerged as a promising solution, which demonstrated a small footprint and remarkably high energy efficiency in compute… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence (2024 early access)

  27. arXiv:2410.15428  [pdf, other

    cs.IT

    Multiset Combinatorial Gray Codes with Application to Proximity Sensor Networks

    Authors: Chung Shue Chen, Wing Shing Wong, Yuan-Hsun Lo, Tsai-Lien Wong

    Abstract: We investigate coding schemes that map source symbols into multisets of an alphabet set. Such a formulation of source coding is an alternative approach to the traditional framework and is inspired by an object tracking problem over proximity sensor networks. We define a \textit{multiset combinatorial Gray code} as a mulitset code with fixed multiset cardinality that possesses combinatorial Gray co… ▽ More

    Submitted 20 October, 2024; originally announced October 2024.

    Comments: 30 pages, 4 figures

  28. arXiv:2410.10046  [pdf

    cs.SE

    A Hybrid Sampling and Multi-Objective Optimization Approach for Enhanced Software Defect Prediction

    Authors: Jie Zhang, Dongcheng Li, W. Eric Wong, Shengrong Wang

    Abstract: Accurate early prediction of software defects is essential to maintain software quality and reduce maintenance costs. However, the field of software defect prediction (SDP) faces challenges such as class imbalances, high-dimensional feature spaces, and suboptimal prediction accuracy. To mitigate these challenges, this paper introduces a novel SDP framework that integrates hybrid sampling technique… ▽ More

    Submitted 13 October, 2024; originally announced October 2024.

  29. arXiv:2410.00282  [pdf

    cs.SE

    Smart Contract Vulnerability Detection based on Static Analysis and Multi-Objective Search

    Authors: Dongcheng Li, W. Eric Wong, Xiaodan Wang, Sean Pan, Liang-Seng Koh

    Abstract: This paper introduces a method for detecting vulnerabilities in smart contracts using static analysis and a multi-objective optimization algorithm. We focus on four types of vulnerabilities: reentrancy, call stack overflow, integer overflow, and timestamp dependencies. Initially, smart contracts are compiled into an abstract syntax tree to analyze relationships between contracts and functions, inc… ▽ More

    Submitted 30 September, 2024; originally announced October 2024.

  30. arXiv:2409.15298  [pdf, other

    cs.NE cs.CL cs.LG

    Sorbet: A Neuromorphic Hardware-Compatible Transformer-Based Spiking Language Model

    Authors: Kaiwen Tang, Zhanglu Yan, Weng-Fai Wong

    Abstract: For reasons such as privacy, there are use cases for language models at the edge. This has given rise to small language models (SLMs) targeted for deployment in resource-constrained devices where energy efficiency is a significant concern. Spiking neural networks (SNNs) offer a promising solution due to their energy efficiency, and there are already works on realizing transformer-based models on S… ▽ More

    Submitted 4 September, 2024; originally announced September 2024.

  31. arXiv:2409.13902  [pdf

    cs.CL cs.AI

    Enhancing Large Language Models with Domain-specific Retrieval Augment Generation: A Case Study on Long-form Consumer Health Question Answering in Ophthalmology

    Authors: Aidan Gilson, Xuguang Ai, Thilaka Arunachalam, Ziyou Chen, Ki Xiong Cheong, Amisha Dave, Cameron Duic, Mercy Kibe, Annette Kaminaka, Minali Prasad, Fares Siddig, Maxwell Singer, Wendy Wong, Qiao Jin, Tiarnan D. L. Keenan, Xia Hu, Emily Y. Chew, Zhiyong Lu, Hua Xu, Ron A. Adelman, Yih-Chung Tham, Qingyu Chen

    Abstract: Despite the potential of Large Language Models (LLMs) in medicine, they may generate responses lacking supporting evidence or based on hallucinated evidence. While Retrieval Augment Generation (RAG) is popular to address this issue, few studies implemented and evaluated RAG in downstream domain-specific applications. We developed a RAG pipeline with 70,000 ophthalmology-specific documents that ret… ▽ More

    Submitted 20 September, 2024; originally announced September 2024.

  32. arXiv:2409.08290  [pdf, ps, other

    cs.NE cs.AI cs.LG

    Reconsidering the energy efficiency of spiking neural networks

    Authors: Zhanglu Yan, Zhenyu Bai, Weng-Fai Wong

    Abstract: Spiking Neural Networks (SNNs) promise higher energy efficiency over conventional Quantized Artificial Neural Networks (QNNs) due to their event-driven, spike-based computation. However, prevailing energy evaluations often oversimplify, focusing on computational aspects while neglecting critical overheads like comprehensive data movement and memory access. Such simplifications can lead to misleadi… ▽ More

    Submitted 3 July, 2025; v1 submitted 29 August, 2024; originally announced September 2024.

  33. arXiv:2409.07931  [pdf, other

    cs.CV

    Task-Augmented Cross-View Imputation Network for Partial Multi-View Incomplete Multi-Label Classification

    Authors: Lian Zhao, Jie Wen, Xiaohuan Lu, Wai Keung Wong, Jiang Long, Wulin Xie

    Abstract: In real-world scenarios, multi-view multi-label learning often encounters the challenge of incomplete training data due to limitations in data collection and unreliable annotation processes. The absence of multi-view features impairs the comprehensive understanding of samples, omitting crucial details essential for classification. To address this issue, we present a task-augmented cross-view imput… ▽ More

    Submitted 24 March, 2025; v1 submitted 12 September, 2024; originally announced September 2024.

  34. arXiv:2407.01926  [pdf

    physics.med-ph cs.CV

    Chemical Shift Encoding based Double Bonds Quantification in Triglycerides using Deep Image Prior

    Authors: Chaoxing Huang, Ziqiang Yu, Zijian Gao, Qiuyi Shen, Queenie Chan, Vincent Wai-Sun Wong, Winnie Chiu-Wing Chu, Weitian Chen

    Abstract: Fatty acid can potentially serve as biomarker for evaluating metabolic disorder and inflammation condition, and quantifying the double bonds is the key for revealing fatty acid information. This study presents an assessment of a deep learning approach utilizing Deep Image Prior (DIP) for the quantification of double bonds and methylene-interrupted double bonds of triglyceride derived from chemical… ▽ More

    Submitted 29 October, 2024; v1 submitted 1 July, 2024; originally announced July 2024.

    Comments: This technical note is accepted by Quantitative Imaging in Medicine and Surgery as a breif report

  35. arXiv:2406.09317  [pdf, other

    eess.IV cs.CV

    Enhancing Diagnostic Accuracy in Rare and Common Fundus Diseases with a Knowledge-Rich Vision-Language Model

    Authors: Meng Wang, Tian Lin, Aidi Lin, Kai Yu, Yuanyuan Peng, Lianyu Wang, Cheng Chen, Ke Zou, Huiyu Liang, Man Chen, Xue Yao, Meiqin Zhang, Binwei Huang, Chaoxin Zheng, Peixin Zhang, Wei Chen, Yilong Luo, Yifan Chen, Honghe Xia, Tingkun Shi, Qi Zhang, Jinming Guo, Xiaolin Chen, Jingcheng Wang, Yih Chung Tham , et al. (24 additional authors not shown)

    Abstract: Previous foundation models for fundus images were pre-trained with limited disease categories and knowledge base. Here we introduce a knowledge-rich vision-language model (RetiZero) that leverages knowledge from more than 400 fundus diseases. For RetiZero's pretraining, we compiled 341,896 fundus images paired with texts, sourced from public datasets, ophthalmic literature, and online resources, e… ▽ More

    Submitted 10 April, 2025; v1 submitted 13 June, 2024; originally announced June 2024.

  36. arXiv:2406.07574  [pdf, other

    cs.SI cs.LG

    Biharmonic Distance of Graphs and its Higher-Order Variants: Theoretical Properties with Applications to Centrality and Clustering

    Authors: Mitchell Black, Lucy Lin, Amir Nayyeri, Weng-Keen Wong

    Abstract: Effective resistance is a distance between vertices of a graph that is both theoretically interesting and useful in applications. We study a variant of effective resistance called the biharmonic distance. While the effective resistance measures how well-connected two vertices are, we prove several theoretical results supporting the idea that the biharmonic distance measures how important an edge i… ▽ More

    Submitted 17 February, 2025; v1 submitted 4 June, 2024; originally announced June 2024.

    Comments: Accepted to ICML 2024. In v2, we correct an error in the definition of electrical flows and, accordingly, the proofs of Lemma 2.2 and Theorem 4.1

  37. arXiv:2406.06543  [pdf, other

    cs.AR cs.LG cs.NE eess.SP

    SparrowSNN: A Hardware/software Co-design for Energy Efficient ECG Classification

    Authors: Zhanglu Yan, Zhenyu Bai, Tulika Mitra, Weng-Fai Wong

    Abstract: Heart disease is one of the leading causes of death worldwide. Given its high risk and often asymptomatic nature, real-time continuous monitoring is essential. Unlike traditional artificial neural networks (ANNs), spiking neural networks (SNNs) are well-known for their energy efficiency, making them ideal for wearable devices and energy-constrained edge computing platforms. However, current energy… ▽ More

    Submitted 6 May, 2024; originally announced June 2024.

  38. arXiv:2405.17940  [pdf, other

    cs.RO cs.AI

    World Models for General Surgical Grasping

    Authors: Hongbin Lin, Bin Li, Chun Wai Wong, Juan Rojas, Xiangyu Chu, Kwok Wai Samuel Au

    Abstract: Intelligent vision control systems for surgical robots should adapt to unknown and diverse objects while being robust to system disturbances. Previous methods did not meet these requirements due to mainly relying on pose estimation and feature tracking. We propose a world-model-based deep reinforcement learning framework "Grasp Anything for Surgery" (GAS), that learns a pixel-level visuomotor poli… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Journal ref: Robotics: Science and Systems 2024

  39. arXiv:2405.12386  [pdf, other

    stat.ML cs.LG stat.AP stat.CO

    Particle swarm optimization with Applications to Maximum Likelihood Estimation and Penalized Negative Binomial Regression

    Authors: Sisi Shao, Junhyung Park, Weng Kee Wong

    Abstract: General purpose optimization routines such as nlminb, optim (R) or nlmixed (SAS) are frequently used to estimate model parameters in nonstandard distributions. This paper presents Particle Swarm Optimization (PSO), as an alternative to many of the current algorithms used in statistics. We find that PSO can not only reproduce the same results as the above routines, it can also produce results that… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  40. arXiv:2405.04206  [pdf, other

    cs.AR cs.AI cs.LG

    NOVA: NoC-based Vector Unit for Mapping Attention Layers on a CNN Accelerator

    Authors: Mohit Upadhyay, Rohan Juneja, Weng-Fai Wong, Li-Shiuan Peh

    Abstract: Attention mechanisms are becoming increasingly popular, being used in neural network models in multiple domains such as natural language processing (NLP) and vision applications, especially at the edge. However, attention layers are difficult to map onto existing neuro accelerators since they have a much higher density of non-linear operations, which lead to inefficient utilization of today's vect… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: 6 pages, 8 figures

    ACM Class: B.2.4

  41. Table-Lookup MAC: Scalable Processing of Quantised Neural Networks in FPGA Soft Logic

    Authors: Daniel Gerlinghoff, Benjamin Chen Ming Choong, Rick Siow Mong Goh, Weng-Fai Wong, Tao Luo

    Abstract: Recent advancements in neural network quantisation have yielded remarkable outcomes, with three-bit networks reaching state-of-the-art full-precision accuracy in complex tasks. These achievements present valuable opportunities for accelerating neural networks by computing in reduced precision. Implementing it on FPGAs can take advantage of bit-level reconfigurability, which is not available on con… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  42. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1112 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 16 December, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  43. arXiv:2403.04036  [pdf, other

    cs.LG cs.AI eess.SP

    Unsupervised Contrastive Learning for Robust RF Device Fingerprinting Under Time-Domain Shift

    Authors: Jun Chen, Weng-Keen Wong, Bechir Hamdaoui

    Abstract: Radio Frequency (RF) device fingerprinting has been recognized as a potential technology for enabling automated wireless device identification and classification. However, it faces a key challenge due to the domain shift that could arise from variations in the channel conditions and environmental settings, potentially degrading the accuracy of RF-based device classification when testing and traini… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: 6 pages, 5 figures, accepted by 2024 IEEE International Conference on Communications (ICC)

  44. arXiv:2403.00192  [pdf, other

    cs.IT

    Block-MDS QC-LDPC Codes for Information Reconciliation in Key Distribution

    Authors: Lev Tauz, Debarnab Mitra, Jayanth Shreekumar, Murat Can Sarihan, Chee Wei Wong, Lara Dolecek

    Abstract: Quantum key distribution (QKD) is a popular protocol that provides information theoretically secure keys to multiple parties. Two important post-processing steps of QKD are 1) the information reconciliation (IR) step, where parties reconcile mismatches in generated keys through classical communication, and 2) the privacy amplification (PA) step, where parties distill their common key into a new se… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

    Comments: 7 pages, 1 figure, submitted to the International Symposium on Information Theory (ISIT) 2024

  45. arXiv:2402.15525  [pdf, other

    cs.CL cs.CY

    Detecting misinformation through Framing Theory: the Frame Element-based Model

    Authors: Guan Wang, Rebecca Frederick, Jinglong Duan, William Wong, Verica Rupar, Weihua Li, Quan Bai

    Abstract: In this paper, we delve into the rapidly evolving challenge of misinformation detection, with a specific focus on the nuanced manipulation of narrative frames - an under-explored area within the AI community. The potential for Generative AI models to generate misleading narratives underscores the urgency of this problem. Drawing from communication and framing theories, we posit that the presentati… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: 17 pages, 9 figures, 7 tables

  46. arXiv:2402.13297  [pdf, other

    q-bio.QM cs.AI

    Integrating Deep Learning and Synthetic Biology: A Co-Design Approach for Enhancing Gene Expression via N-terminal Coding Sequences

    Authors: Zhanglu Yan, Weiran Chu, Yuhua Sheng, Kaiwen Tang, Shida Wang, Yanfeng Liu, Weng-Fai Wong

    Abstract: N-terminal coding sequence (NCS) influences gene expression by impacting the translation initiation rate. The NCS optimization problem is to find an NCS that maximizes gene expression. The problem is important in genetic engineering. However, current methods for NCS optimization such as rational design and statistics-guided approaches are labor-intensive yield only relatively small improvements. T… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  47. arXiv:2402.13249  [pdf, other

    cs.CL cs.AI

    TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization

    Authors: Liyan Tang, Igor Shalyminov, Amy Wing-mei Wong, Jon Burnsky, Jake W. Vincent, Yu'an Yang, Siffi Singh, Song Feng, Hwanjun Song, Hang Su, Lijia Sun, Yi Zhang, Saab Mansour, Kathleen McKeown

    Abstract: Single document news summarization has seen substantial progress on faithfulness in recent years, driven by research on the evaluation of factual consistency, or hallucinations. We ask whether these advances carry over to other text summarization domains. We propose a new evaluation benchmark on topic-focused dialogue summarization, generated by LLMs of varying sizes. We provide binary sentence-le… ▽ More

    Submitted 31 March, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Comments: NAACL 2024; Linguistic annotations available at https://github.com/amazon-science/tofueval

  48. arXiv:2402.10456  [pdf, other

    stat.ML cs.LG stat.AP stat.ME

    Efficient Generative Modeling via Penalized Optimal Transport Network

    Authors: Wenhui Sophia Lu, Chenyang Zhong, Wing Hung Wong

    Abstract: The generation of synthetic data with distributions that faithfully emulate the underlying data-generating mechanism holds paramount significance. Wasserstein Generative Adversarial Networks (WGANs) have emerged as a prominent tool for this task; however, due to the delicate equilibrium of the minimax formulation and the instability of Wasserstein distance in high dimensions, WGAN often manifests… ▽ More

    Submitted 7 January, 2025; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: 54 pages, 12 figures

  49. arXiv:2402.01900  [pdf, other

    stat.ML cs.LG

    Distributional Off-policy Evaluation with Bellman Residual Minimization

    Authors: Sungee Hong, Zhengling Qi, Raymond K. W. Wong

    Abstract: We study distributional off-policy evaluation (OPE), of which the goal is to learn the distribution of the return for a target policy using offline data generated by a different policy. The theoretical foundation of many existing work relies on the supremum-extended statistical distances such as supremum-Wasserstein distance, which are hard to estimate. In contrast, we study the more manageable ex… ▽ More

    Submitted 12 March, 2025; v1 submitted 2 February, 2024; originally announced February 2024.

  50. arXiv:2401.16623  [pdf, other

    cs.DS cs.IT

    Towards Optimal Grammars for RNA Structures

    Authors: Evarista Onokpasa, Sebastian Wild, Prudence W. H. Wong

    Abstract: In past work (Onokpasa, Wild, Wong, DCC 2023), we showed that (a) for joint compression of RNA sequence and structure, stochastic context-free grammars are the best known compressors and (b) that grammars which have better compression ability also show better performance in ab initio structure prediction. Previous grammars were manually curated by human experts. In this work, we develop a framewor… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: to be presented at DCC 2024