Search | arXiv e-print repository

arXiv:2506.22313 [pdf, ps, other]

Manifold-Constrained Gaussian Processes for Inference of Mixed-effects Ordinary Differential Equations with Application to Pharmacokinetics

Authors: Yuxuan Zhao, Samuel W. K. Wong

Abstract: Pharmacokinetic modeling using ordinary differential equations (ODEs) has an important role in dose optimization studies, where dosing must balance sustained therapeutic efficacy with the risk of adverse side effects. Such ODE models characterize drug plasma concentration over time and allow pharmacokinetic parameters to be inferred, such as drug absorption and elimination rates. For time-course s… ▽ More Pharmacokinetic modeling using ordinary differential equations (ODEs) has an important role in dose optimization studies, where dosing must balance sustained therapeutic efficacy with the risk of adverse side effects. Such ODE models characterize drug plasma concentration over time and allow pharmacokinetic parameters to be inferred, such as drug absorption and elimination rates. For time-course studies involving treatment groups with multiple subjects, mixed-effects ODE models are commonly used. However, existing methods tend to lack uncertainty quantification on a subject-level, for key measures such as peak or trough concentration and for making predictions of drug concentration. To address such limitations, we propose an extension of manifold-constrained Gaussian processes for inference of general mixed-effects ODE models within a Bayesian statistical framework. We evaluate our method on simulated examples, demonstrating its ability to provide fast and accurate inference for parameters and trajectories using nested optimization. To illustrate the practical efficacy of the proposed method, we provide a real data analysis of a pharmacokinetic model used for an HIV combination therapy study. △ Less

Submitted 27 June, 2025; originally announced June 2025.

Comments: 34 pages, 4 figures

arXiv:2506.20048 [pdf, ps, other]

A Principled Path to Fitted Distributional Evaluation

Authors: Sungee Hong, Jiayi Wang, Zhengling Qi, Raymond Ka Wai Wong

Abstract: In reinforcement learning, distributional off-policy evaluation (OPE) focuses on estimating the return distribution of a target policy using offline data collected under a different policy. This work focuses on extending the widely used fitted-Q evaluation -- developed for expectation-based reinforcement learning -- to the distributional OPE setting. We refer to this extension as fitted distributi… ▽ More In reinforcement learning, distributional off-policy evaluation (OPE) focuses on estimating the return distribution of a target policy using offline data collected under a different policy. This work focuses on extending the widely used fitted-Q evaluation -- developed for expectation-based reinforcement learning -- to the distributional OPE setting. We refer to this extension as fitted distributional evaluation (FDE). While only a few related approaches exist, there remains no unified framework for designing FDE methods. To fill this gap, we present a set of guiding principles for constructing theoretically grounded FDE methods. Building on these principles, we develop several new FDE methods with convergence analysis and provide theoretical justification for existing methods, even in non-tabular environments. Extensive experiments, including simulations on linear quadratic regulators and Atari games, demonstrate the superior performance of the FDE methods. △ Less

Submitted 24 June, 2025; originally announced June 2025.

arXiv:2506.17687 [pdf]

Eigenmode-Guided Amplification via Spatiotemporal Active Acoustic Metamaterials

Authors: Wai Chun Wong, Greggory Chaplain, Jensen Li

Abstract: We present a spatiotemporal gain-loss framework for selective acoustic mode amplification illustrated using coupled Helmholtz resonators. In our scheme, the gain or loss in each resonator is determined by the amplitude in adjacent cavities, forming a non-Hermitian yet energy-conserving system. Our scheme enables the system to collapse toward the eigenstate of an effective linear Hamiltonian with t… ▽ More We present a spatiotemporal gain-loss framework for selective acoustic mode amplification illustrated using coupled Helmholtz resonators. In our scheme, the gain or loss in each resonator is determined by the amplitude in adjacent cavities, forming a non-Hermitian yet energy-conserving system. Our scheme enables the system to collapse toward the eigenstate of an effective linear Hamiltonian with the eigenvalue of largest imaginary part, serving as a fixed-point attractor. In dimer systems, we identify exceptional points that separate oscillatory and convergent regimes, allowing controllable switching between mode collapsing and Rabi-type oscillation. By modulating the gain-loss profile over time, we achieve programmable transitions between eigenmodes, with strategically introduced symmetry-breaking perturbations accelerating convergence. Full-wave simulations validate our approach and highlight its potential for acoustic switching, signal processing, and analog computation. Our results establish a new paradigm for sustainable wave control by bridging non-Hermitian physics and time-varying physics in resonant metamaterials. △ Less

Submitted 21 June, 2025; originally announced June 2025.

Comments: 13 pages, 4 figures

arXiv:2506.11859 [pdf]

Bistable random momentum transfer in a linear on-chip resonator

Authors: Tingyi Gu, Lorry Chang, Jiagui Wu, Lijun Wu, Hwaseob Lee, Young-Kai Chen, Masudur Rahim, Po Dong, Chee Wei Wong

Abstract: Optical switches and bifurcation rely on the nonlinear response of materials. Here, we demonstrate linear temporal bifurcation responses in a passive multimode microresonator, with strongly coupled chaotic and whispering gallery modes or WGMs. In microdisks, the chaotic modes exhibit broadband transfer within the deformed cavities, but their transient response is less explored and yields a random… ▽ More Optical switches and bifurcation rely on the nonlinear response of materials. Here, we demonstrate linear temporal bifurcation responses in a passive multimode microresonator, with strongly coupled chaotic and whispering gallery modes or WGMs. In microdisks, the chaotic modes exhibit broadband transfer within the deformed cavities, but their transient response is less explored and yields a random output of the analog signal distributed uniformly from 0 to 1. Here, we build chaotic states by perturbing the multi-mode microring resonators with densely packed silicon nanocrystals on the waveguide surface. In vivo measurements reveal random and digitized output that ONLY populates around 0 and 1 intensity levels. The bus waveguide mode couples firstly to chaotic modes, then either dissipates or tunnels into stable WGMs. This binary pathway generates high-contrast, digitized outputs. The fully passive device enables real-time conversion of periodic clock signals into binary outputs with contrasts exceeding 12.3 dB, data rates of up to 100 Mbits per second, and 20dB dynamic range. △ Less

Submitted 13 June, 2025; originally announced June 2025.

arXiv:2506.09722 [pdf, ps, other]

Fully Bayesian Sequential Design for Mean Response Surface Prediction of Heteroscedastic Stochastic Simulations

Authors: Yuying Huang, Samuel W. K. Wong

Abstract: We present a fully Bayesian sequential strategy for predicting the mean response surface of heteroscedastic stochastic simulation functions. Leveraging dual Gaussian processes as the surrogate model and a criterion based on empirical expected integrated mean-square prediction error, our approach sequentially selects informative design points while fully accounting for parameter uncertainty. Sequen… ▽ More We present a fully Bayesian sequential strategy for predicting the mean response surface of heteroscedastic stochastic simulation functions. Leveraging dual Gaussian processes as the surrogate model and a criterion based on empirical expected integrated mean-square prediction error, our approach sequentially selects informative design points while fully accounting for parameter uncertainty. Sequential importance sampling is employed to efficiently update the posterior distribution of the parameters. Our strategy is tailored for expensive simulation functions, where achieving robust predictive accuracy under a limited budget is critical. We illustrate its potential advantages compared to existing approaches through synthetic examples. We then implement the proposed strategy on a real motivating application in seismic design of wood-frame podium buildings. △ Less

Submitted 11 June, 2025; originally announced June 2025.

Comments: 37 pages, 8 figures

arXiv:2506.04467 [pdf]

Diffusion Transformer-based Universal Dose Denoising for Pencil Beam Scanning Proton Therapy

Authors: Yuzhen Ding, Jason Holmes, Hongying Feng, Martin Bues, Lisa A. McGee, Jean-Claude M. Rwigema, Nathan Y. Yu, Terence S. Sio, Sameer R. Keole, William W. Wong, Steven E. Schild, Jonathan B. Ashman, Sujay A. Vora, Daniel J. Ma, Samir H. Patel, Wei Liu

Abstract: Purpose: Intensity-modulated proton therapy (IMPT) offers precise tumor coverage while sparing organs at risk (OARs) in head and neck (H&N) cancer. However, its sensitivity to anatomical changes requires frequent adaptation through online adaptive radiation therapy (oART), which depends on fast, accurate dose calculation via Monte Carlo (MC) simulations. Reducing particle count accelerates MC but… ▽ More Purpose: Intensity-modulated proton therapy (IMPT) offers precise tumor coverage while sparing organs at risk (OARs) in head and neck (H&N) cancer. However, its sensitivity to anatomical changes requires frequent adaptation through online adaptive radiation therapy (oART), which depends on fast, accurate dose calculation via Monte Carlo (MC) simulations. Reducing particle count accelerates MC but degrades accuracy. To address this, denoising low-statistics MC dose maps is proposed to enable fast, high-quality dose generation. Methods: We developed a diffusion transformer-based denoising framework. IMPT plans and 3D CT images from 80 H&N patients were used to generate noisy and high-statistics dose maps using MCsquare (1 min and 10 min per plan, respectively). Data were standardized into uniform chunks with zero-padding, normalized, and transformed into quasi-Gaussian distributions. Testing was done on 10 H&N, 10 lung, 10 breast, and 10 prostate cancer cases, preprocessed identically. The model was trained with noisy dose maps and CT images as input and high-statistics dose maps as ground truth, using a combined loss of mean square error (MSE), residual loss, and regional MAE (focusing on top/bottom 10% dose voxels). Performance was assessed via MAE, 3D Gamma passing rate, and DVH indices. Results: The model achieved MAEs of 0.195 (H&N), 0.120 (lung), 0.172 (breast), and 0.376 Gy[RBE] (prostate). 3D Gamma passing rates exceeded 92% (3%/2mm) across all sites. DVH indices for clinical target volumes (CTVs) and OARs closely matched the ground truth. Conclusion: A diffusion transformer-based denoising framework was developed and, though trained only on H&N data, generalizes well across multiple disease sites. △ Less

Submitted 4 June, 2025; originally announced June 2025.

arXiv:2505.21853 [pdf, ps, other]

Quantitative Macromolecular Proton Fraction Imaging using Pulsed Spin-Lock

Authors: Qianxue Shan, Ziqiang Yu, Baiyan Jiang, Jian Hou, Qiuyi Shen, Winnie CW Chu, Vincent WS Wong, Weitian Chen

Abstract: Purpose: Recent studies have shown that spin-lock MRI can simplify quantitative magnetization transfer (MT) by eliminating its dependency on water pool parameters, removing the need for a T1 map in macromolecular proton fraction (MPF) quantification. However, its application is often limited by the requirement for long radiofrequency (RF) pulse durations, which are constrained by RF hardware capab… ▽ More Purpose: Recent studies have shown that spin-lock MRI can simplify quantitative magnetization transfer (MT) by eliminating its dependency on water pool parameters, removing the need for a T1 map in macromolecular proton fraction (MPF) quantification. However, its application is often limited by the requirement for long radiofrequency (RF) pulse durations, which are constrained by RF hardware capabilities despite remaining within specific absorption rate (SAR) safety limits. Methods: To address this challenge, we propose a novel method, MPF mapping using pulsed spin-lock (MPF-PSL). MPF-PSL employs a pulsed spin-lock train with intermittent free precession periods, enabling extended total spin-lock durations without exceeding hardware and specific absorption rate limits. A comprehensive analytical framework was developed to model the magnetization dynamics of the two-pool MT system under pulsed spin-lock, demonstrating that MPF-PSL achieves MT-specific quantification while minimizing confounding effects from the water pool. The proposed method is validated with Bloch-McConnell simulations, phantoms, and in vivo studies at 3T. Results: Both Bloch-McConnell simulations and phantom validation demonstrated that MPF-PSL exhibits robust insensitivity to water pool parameters while enabling high-SNR MPF quantification. In vivo validation studies confirmed the method's clinical utility in detecting collagen deposition in patients with liver fibrosis. Conclusion: MPF-PSL presents a practical solution for quantitative MT imaging, with strong potential for clinical applications. △ Less

Submitted 27 May, 2025; originally announced May 2025.

Comments: 15 pages, 10 figures; Qianxue Shan and Ziqiang Yu contributed equally to this work

arXiv:2505.18216 [pdf]

doi 10.1002/9781119880929.ch7

Data Mining-Based Techniques for Software Fault Localization

Authors: Peggy Cellier, Mireille Ducassé, Sébastien Ferré, Olivier Ridoux, W. Eric Wong

Abstract: This chapter illustrates the basic concepts of fault localization using a data mining technique. It utilizes the Trityp program to illustrate the general method. Formal concept analysis and association rule are two well-known methods for symbolic data mining. In their original inception, they both consider data in the form of an object-attribute table. In their original inception, they both consid… ▽ More This chapter illustrates the basic concepts of fault localization using a data mining technique. It utilizes the Trityp program to illustrate the general method. Formal concept analysis and association rule are two well-known methods for symbolic data mining. In their original inception, they both consider data in the form of an object-attribute table. In their original inception, they both consider data in the form of an object-attribute table. The chapter considers a debugging process in which a program is tested against different test cases. Two attributes, PASS and FAIL, represent the issue of the test case. The chapter extends the analysis of data mining for fault localization for the multiple fault situations. It addresses how data mining can be further applied to fault localization for GUI components. Unlike traditional software, GUI test cases are usually event sequences, and each individual event has a unique corresponding event handler. △ Less

Submitted 23 May, 2025; originally announced May 2025.

Journal ref: Handbook of Software Fault Localization, 1, Wiley, Chapitre 7, 2023, Handbook of Software Fault Localization: Foundations and Advances, 9781119291824

arXiv:2505.08973 [pdf, other]

Dynamic restrengthening and stress heterogeneity explain megathrust earthquake complexity

Authors: Jeremy Wing Ching Wong, Alice-Agnes Gabriel, Wenyuan Fan

Abstract: Megathrusts host Earth's largest earthquakes. Understanding the physical conditions controlling their rupture dynamics is critical for assessing seismic and tsunami hazards. These earthquakes often display complex rupture dynamics, exemplified by the 2011 Tohoku-Oki earthquake, which exhibited multiple rupture episodes, depth-dependent seismic radiation, and substantial tsunamigenic slip near the… ▽ More Megathrusts host Earth's largest earthquakes. Understanding the physical conditions controlling their rupture dynamics is critical for assessing seismic and tsunami hazards. These earthquakes often display complex rupture dynamics, exemplified by the 2011 Tohoku-Oki earthquake, which exhibited multiple rupture episodes, depth-dependent seismic radiation, and substantial tsunamigenic slip near the trench. However, whether such complexity arises from preexisting physical conditions remains uncertain. Here, we demonstrate that the observed rupture complexity of the Tohoku-Oki earthquake can spontaneously and self-consistently emerge, driven by rapid coseismic frictional restrengthening and data-informed initial stress heterogeneity, without prescribing frictional asperities. We use an ensemble of 3D dynamic rupture simulations to identify that mixed downdip pulse-like and updip crack-like rupture are driven by dynamic stress redistribution with episodic rupture reactivation. By featuring low fault strength compared to its dynamic stress drop, a preferred model can consistently reproduce the observed complex depth-dependent propagation speeds, multiple rupture fronts as imaged by back-projection, and large tsunamigenic slip at the trench. Our findings demonstrate that preexisting stress heterogeneity conjointly with dynamic frictional weakening and restrengthening drives seemingly unexpected megathrust rupture complexity, highlighting the need to include dynamic effects into physics-based seismic and tsunami hazard assessments of future earthquakes. △ Less

Submitted 13 May, 2025; originally announced May 2025.

arXiv:2505.04603 [pdf, other]

Likelihood-Free Adaptive Bayesian Inference via Nonparametric Distribution Matching

Authors: Wenhui Sophia Lu, Wing Hung Wong

Abstract: When the likelihood is analytically unavailable and computationally intractable, approximate Bayesian computation (ABC) has emerged as a widely used methodology for approximate posterior inference; however, it suffers from severe computational inefficiency in high-dimensional settings or under diffuse priors. To overcome these limitations, we propose Adaptive Bayesian Inference (ABI), a framework… ▽ More When the likelihood is analytically unavailable and computationally intractable, approximate Bayesian computation (ABC) has emerged as a widely used methodology for approximate posterior inference; however, it suffers from severe computational inefficiency in high-dimensional settings or under diffuse priors. To overcome these limitations, we propose Adaptive Bayesian Inference (ABI), a framework that bypasses traditional data-space discrepancies and instead compares distributions directly in posterior space through nonparametric distribution matching. By leveraging a novel Marginally-augmented Sliced Wasserstein (MSW) distance on posterior measures and exploiting its quantile representation, ABI transforms the challenging problem of measuring divergence between posterior distributions into a tractable sequence of one-dimensional conditional quantile regression tasks. Moreover, we introduce a new adaptive rejection sampling scheme that iteratively refines the posterior approximation by updating the proposal distribution via generative density estimation. Theoretically, we establish parametric convergence rates for the trimmed MSW distance and prove that the ABI posterior converges to the true posterior as the tolerance threshold vanishes. Through extensive empirical evaluation, we demonstrate that ABI significantly outperforms data-based Wasserstein ABC, summary-based ABC, and state-of-the-art likelihood-free simulators, especially in high-dimensional or dependent observation regimes. △ Less

Submitted 7 May, 2025; originally announced May 2025.

arXiv:2505.00778 [pdf, ps, other]

On Sierpiński and Riesel Repdigits and Repintegers

Authors: Chris Bispels, Matthew Cohen, Joshua Harrington, Kaelyn Pontes, Leif Schaumann, Tony W. H. Wong

Abstract: For positive integers $b\geq 2$, $k<b$, and $t$, we say that an integer $k_b^{(t)}$ is a $b$-repdigit if $k_b^{(t)}$ can be expressed as the digit $k$ repeated $t$ times in base-$b$ representation, i.e., $k_b^{(t)} =k(b^t-1)/(b-1)$. In the case of $k=1$, we say that $1_b^{(t)}$ is a $b$-repunit. In this article, we investigate the existsence of $b$-repdigits and $b$-repunits among the sets of Sier… ▽ More For positive integers $b\geq 2$, $k<b$, and $t$, we say that an integer $k_b^{(t)}$ is a $b$-repdigit if $k_b^{(t)}$ can be expressed as the digit $k$ repeated $t$ times in base-$b$ representation, i.e., $k_b^{(t)} =k(b^t-1)/(b-1)$. In the case of $k=1$, we say that $1_b^{(t)}$ is a $b$-repunit. In this article, we investigate the existsence of $b$-repdigits and $b$-repunits among the sets of Sierpiński numbers and Riesel numbers. A Sierpiński number is defined as an odd integer $k$ for which $k\cdot 2^n+1$ is composite for all positive integers $n$ and Riesel numbers are similarly defined for the expression $k\cdot 2^n-1$. △ Less

Submitted 1 May, 2025; originally announced May 2025.

MSC Class: 11A63; 11B25

arXiv:2504.17826 [pdf, other]

FashionM3: Multimodal, Multitask, and Multiround Fashion Assistant based on Unified Vision-Language Model

Authors: Kaicheng Pang, Xingxing Zou, Waikeung Wong

Abstract: Fashion styling and personalized recommendations are pivotal in modern retail, contributing substantial economic value in the fashion industry. With the advent of vision-language models (VLM), new opportunities have emerged to enhance retailing through natural language and visual interactions. This work proposes FashionM3, a multimodal, multitask, and multiround fashion assistant, built upon a VLM… ▽ More Fashion styling and personalized recommendations are pivotal in modern retail, contributing substantial economic value in the fashion industry. With the advent of vision-language models (VLM), new opportunities have emerged to enhance retailing through natural language and visual interactions. This work proposes FashionM3, a multimodal, multitask, and multiround fashion assistant, built upon a VLM fine-tuned for fashion-specific tasks. It helps users discover satisfying outfits by offering multiple capabilities including personalized recommendation, alternative suggestion, product image generation, and virtual try-on simulation. Fine-tuned on the novel FashionRec dataset, comprising 331,124 multimodal dialogue samples across basic, personalized, and alternative recommendation tasks, FashionM3 delivers contextually personalized suggestions with iterative refinement through multiround interactions. Quantitative and qualitative evaluations, alongside user studies, demonstrate FashionM3's superior performance in recommendation effectiveness and practical value as a fashion assistant. △ Less

Submitted 23 April, 2025; originally announced April 2025.

arXiv:2504.13723 [pdf, other]

QoS-Aware NOMA Design for Downlink Pinching-Antenna Systems

Authors: Yanqing Xu, Zhiguo Ding, Donghong Cai, Vincent W. S. Wong

Abstract: Pinching antennas, implemented by applying small dielectric particles on a waveguide, have emerged as a promising flexible-antenna technology ideal for next-generation wireless communications systems. Unlike conventional flexible-antenna systems, pinching antennas offer the advantage of creating line-of-sight links by enabling antennas to be activated on the waveguide at a location close to the us… ▽ More Pinching antennas, implemented by applying small dielectric particles on a waveguide, have emerged as a promising flexible-antenna technology ideal for next-generation wireless communications systems. Unlike conventional flexible-antenna systems, pinching antennas offer the advantage of creating line-of-sight links by enabling antennas to be activated on the waveguide at a location close to the user. This paper investigates a typical two-user non-orthogonal multiple access (NOMA) downlink scenario, where multiple pinching antennas are activated on a single dielectric waveguide to assist NOMA transmission. We formulate the problem of maximizing the data rate of one user subject to the quality-of-service requirement of the other user by jointly optimizing the antenna locations and power allocation coefficients. The formulated problem is nonconvex and difficult to solve due to the impact of antenna locations on large-scale path loss and two types of phase shifts, namely in-waveguide phase shifts and free space propagation phase shifts. To this end, we propose an iterative algorithm based on block coordinate descent and successive convex approximation techniques. Moreover, we consider the special case with a single pinching antenna, which is a simplified version of the multi-antenna case. Although the formulated problem is still nonconvex, by using the inherent features of the formulated problem, we derive the global optimal solution in closed-form, which offers important insights on the performance of pinching-antenna systems. Simulation results demonstrate that the pinching-antenna system significantly outperforms conventional fixed-position antenna systems, and the proposed algorithm achieves performance comparable to the computationally intensive exhaustive search based approach. △ Less

Submitted 18 April, 2025; originally announced April 2025.

Comments: This paper has been submitted for possible publication

arXiv:2504.13467 [pdf, ps, other]

Efficient Estimation under Multiple Missing Patterns via Balancing Weights

Authors: Jianing Dong, Raymond K. W. Wong, Kwun Chuen Gary Chan

Abstract: As one of the most commonly seen data challenges, missing data, in particular, multiple, non-monotone missing patterns, complicates estimation and inference due to the fact that missingness mechanisms are often not missing at random, and conventional methods cannot be applied. Pattern graphs have recently been proposed as a tool to systematically relate various observed patterns in the sample. We… ▽ More As one of the most commonly seen data challenges, missing data, in particular, multiple, non-monotone missing patterns, complicates estimation and inference due to the fact that missingness mechanisms are often not missing at random, and conventional methods cannot be applied. Pattern graphs have recently been proposed as a tool to systematically relate various observed patterns in the sample. We extend its scope to the estimation of parameters defined by moment equations, including common regression models, via solving weighted estimating equations with weights constructed using a sequential balancing approach. These novel weights are carefully crafted to address the instability issue of the straightforward approach based on local balancing. We derive the efficiency bound for the model parameters and show that our proposed method, albeit relatively simple, is asymptotically efficient. Simulation results demonstrate the superior performance of the proposed method, and real-data applications illustrate how the results are robust to the choice of identification assumptions. △ Less

Submitted 18 April, 2025; originally announced April 2025.

Comments: arXiv admin note: substantial text overlap with arXiv:2402.08873

arXiv:2504.01117 [pdf]

Near-energy-free Photonic Fourier Transformation for Convolution Operation Acceler

Authors: Hangbo Yang, Nicola Peserico, Shurui Li, Xiaoxuan Ma, Russell L. T. Schwartz, Mostafa Hosseini, Aydin Babakhani, Chee Wei Wong, Puneet Gupta, Volker J. Sorger

Abstract: Convolutional operations are computationally intensive in artificial intelligence services, and their overhead in electronic hardware limits machine learning scaling. Here, we introduce a photonic joint transform correlator (pJTC) using a near-energy-free on-chip Fourier transformation to accelerate convolution operations. The pJTC reduces computational complexity for both convolution and cross-co… ▽ More Convolutional operations are computationally intensive in artificial intelligence services, and their overhead in electronic hardware limits machine learning scaling. Here, we introduce a photonic joint transform correlator (pJTC) using a near-energy-free on-chip Fourier transformation to accelerate convolution operations. The pJTC reduces computational complexity for both convolution and cross-correlation from O(N4) to O(N2), where N2 is the input data size. Demonstrating functional Fourier transforms and convolution, this pJTC achieves 98.0% accuracy on an exemplary MNIST inference task. Furthermore, a wavelength-multiplexed pJTC architecture shows potential for high throughput and energy efficiency, reaching 305 TOPS/W and 40.2 TOPS/mm2, based on currently available foundry processes. An efficient, compact, and low-latency convolution accelerator promises to advance next-generation AI capabilities across edge demands, high-performance computing, and cloud services. △ Less

Submitted 1 April, 2025; originally announced April 2025.

Comments: 14 pages, 5 figures, Journal paper

arXiv:2503.15722 [pdf, ps, other]

Leveraging MoE-based Large Language Model for Zero-Shot Multi-Task Semantic Communication

Authors: Sin-Yu Huang, Renjie Liao, Vincent W. S. Wong

Abstract: Multi-task semantic communication (SC) can reduce the computational resources in wireless systems since retraining is not required when switching between tasks. However, existing approaches typically rely on task-specific embeddings to identify the intended task, necessitating retraining the entire model when given a new task. Consequently, this drives the need for a multi-task SC system that can… ▽ More Multi-task semantic communication (SC) can reduce the computational resources in wireless systems since retraining is not required when switching between tasks. However, existing approaches typically rely on task-specific embeddings to identify the intended task, necessitating retraining the entire model when given a new task. Consequently, this drives the need for a multi-task SC system that can handle new tasks without additional training, known as zero-shot learning. Inspired by the superior zero-shot capabilities of large language models (LLMs), we leverage pre-trained instruction-tuned LLMs, referred to as fine-tuned language net (FLAN), to improve the generalization capability. We incorporate a mixture-of-experts (MoE) architecture in the FLAN model and propose MoE-FLAN-SC architecture for multi-task SC systems. Our proposed MoE-FLAN-SC architecture can further improve the performance of FLAN-T5 model without increasing the computational cost. Moreover, we design a multi-task feature extraction module (FEM) which can adaptively extract relevant features across various tasks given the provided features and signal-to-noise ratio (SNR). Simulation results show that our proposed MoE-FLAN-SC architecture outperforms three state-of-the-art models in terms of the average accuracy on four different unseen tasks. △ Less

Submitted 21 March, 2025; v1 submitted 19 March, 2025; originally announced March 2025.

Comments: Accepted by IEEE International Conference on Communications (ICC), June 2025, Montreal, Canada

arXiv:2503.13357 [pdf, other]

The Power of Amortization on Scheduling with Explorable Uncertainty

Authors: Alison Hsiang-Hsuan Liu, Fu-Hong Liu, Prudence W. H. Wong, Xiao-Ou Zhang

Abstract: In this work, we study a scheduling problem with explorable uncertainty. Each job comes with an upper limit of its processing time, which could be potentially reduced by testing the job, which also takes time. The objective is to schedule all jobs on a single machine with a minimum total completion time. The challenge lies in deciding which jobs to test and the order of testing/processing jobs.… ▽ More In this work, we study a scheduling problem with explorable uncertainty. Each job comes with an upper limit of its processing time, which could be potentially reduced by testing the job, which also takes time. The objective is to schedule all jobs on a single machine with a minimum total completion time. The challenge lies in deciding which jobs to test and the order of testing/processing jobs. The online problem was first introduced with unit testing time and later generalized to variable testing times. For this general setting, the upper bounds of the competitive ratio are shown to be $4$ and $3.3794$ for deterministic and randomized online algorithms; while the lower bounds for unit testing time stands, which are $1.8546$ (deterministic) and $1.6257$ (randomized). We continue the study on variable testing times setting. We first enhance the analysis framework and improve the competitive ratio of the deterministic algorithm from $4$ to $1+\sqrt{2} \approx 2.4143$. Using the new analysis framework, we propose a new deterministic algorithm that further improves the competitive ratio to $2.316513$. The new framework also enables us to develop a randomized algorithm improving the expected competitive ratio from $3.3794$ to $2.152271$. △ Less

Submitted 17 March, 2025; originally announced March 2025.

arXiv:2503.11948 [pdf]

Integration of Explainable AI Techniques with Large Language Models for Enhanced Interpretability for Sentiment Analysis

Authors: Thivya Thogesan, Anupiya Nugaliyadde, Kok Wai Wong

Abstract: Interpretability remains a key difficulty in sentiment analysis with Large Language Models (LLMs), particularly in high-stakes applications where it is crucial to comprehend the rationale behind forecasts. This research addressed this by introducing a technique that applies SHAP (Shapley Additive Explanations) by breaking down LLMs into components such as embedding layer,encoder,decoder and attent… ▽ More Interpretability remains a key difficulty in sentiment analysis with Large Language Models (LLMs), particularly in high-stakes applications where it is crucial to comprehend the rationale behind forecasts. This research addressed this by introducing a technique that applies SHAP (Shapley Additive Explanations) by breaking down LLMs into components such as embedding layer,encoder,decoder and attention layer to provide a layer-by-layer knowledge of sentiment prediction. The approach offers a clearer overview of how model interpret and categorise sentiment by breaking down LLMs into these parts. The method is evaluated using the Stanford Sentiment Treebank (SST-2) dataset, which shows how different sentences affect different layers. The effectiveness of layer-wise SHAP analysis in clarifying sentiment-specific token attributions is demonstrated by experimental evaluations, which provide a notable enhancement over current whole-model explainability techniques. These results highlight how the suggested approach could improve the reliability and transparency of LLM-based sentiment analysis in crucial applications. △ Less

Submitted 14 March, 2025; originally announced March 2025.

arXiv:2503.05907 [pdf, other]

Real-time Bus Travel Time Prediction and Reliability Quantification: A Hybrid Markov Model

Authors: Yuran Sun, James Spall, Wai Wong, Xilei Zhao

Abstract: Accurate and reliable bus travel time prediction in real-time is essential for improving the operational efficiency of public transportation systems. However, this remains a challenging task due to the limitations of existing models and data sources. This study proposed a hybrid Markovian framework for real-time bus travel time prediction, incorporating uncertainty quantification. Firstly, the bus… ▽ More Accurate and reliable bus travel time prediction in real-time is essential for improving the operational efficiency of public transportation systems. However, this remains a challenging task due to the limitations of existing models and data sources. This study proposed a hybrid Markovian framework for real-time bus travel time prediction, incorporating uncertainty quantification. Firstly, the bus link travel time distributions were modeled by integrating various influential factors while explicitly accounting for heteroscedasticity. Particularly, the parameters of the distributions were estimated using Maximum Likelihood Estimation, and the Fisher Information Matrix was then employed to calculate the 95\% uncertainty bounds for the estimated parameters, ensuring a robust and reliable quantification of prediction uncertainty of bus link travel times. Secondly, a Markovian framework with transition probabilities based on previously predicted bus link travel times was developed to predict travel times and their uncertainties from a current location to any future stop along the route. The framework was evaluated using the General Transit Feed Specification (GTFS) Static and Realtime data collected in 2023 from Gainesville, Florida. The results showed that the proposed model consistently achieved better prediction performance compared to the selected baseline approaches (including historical mean, statistical and AI-based models) while providing narrower uncertainty bounds. The model also demonstrated high interpretability, as the estimated coefficients provided insights into how different factors influencing bus travel times across links with varying characteristics. These findings suggest that the model could serve as a valuable tool for transit system performance evaluation and real-time trip planning. △ Less

Submitted 7 March, 2025; originally announced March 2025.

arXiv:2503.00002 [pdf, other]

Failure of Optimal Design Theory? A Case Study in Toxicology Using Sequential Robust Optimal Design Framework

Authors: Elvis Han Cui, Michael Collins, Jessica Munson, Weng Kee Wong

Abstract: This paper presents a quasi-sequential optimal design framework for toxicology experiments, specifically applied to sea urchin embryos. The authors propose a novel approach combining robust optimal design with adaptive, stage-based testing to improve efficiency in toxicological studies, particularly where traditional uniform designs fall short. The methodology uses statistical models to refine dos… ▽ More This paper presents a quasi-sequential optimal design framework for toxicology experiments, specifically applied to sea urchin embryos. The authors propose a novel approach combining robust optimal design with adaptive, stage-based testing to improve efficiency in toxicological studies, particularly where traditional uniform designs fall short. The methodology uses statistical models to refine dose levels across experimental phases, aiming for increased precision while reducing costs and complexity. Key components include selecting an initial design, iterative dose optimization based on preliminary results, and assessing various model fits to ensure robust, data-driven adjustments. Through case studies, we demonstrate improved statistical efficiency and adaptability in toxicology, with potential applications in other experimental domains. △ Less

Submitted 10 February, 2025; originally announced March 2025.

arXiv:2502.20916 [pdf, other]

doi 10.1016/j.astropartphys.2025.103135

COCOA: a compact Compton camera for astrophysical observation of MeV-scale gamma rays

Authors: LiquidO Collaboration, S. R. Soleti, J. J. Gómez-Cadenas, J. Apilluelo, L. Asquith, E. F. Bannister, N. P. Barradas, C. L. Baylis, J. L. Beney, M. Berberan e Santos, X. de la Bernardie, T. J. C. Bezerra, M. Bongrand, C. Bourgeois, D. Breton, J. Busto, K. Burns, A. Cabrera, A. Cadiou, E. Calvo, M. de Carlos Generowicz, E. Chauveau, B. J. Cattermole, M. Chen, P. Chimenti , et al. (67 additional authors not shown)

Abstract: COCOA (COmpact COmpton cAmera) is a next-generation gamma-ray telescope designed for astrophysical observations in the MeV energy range. The detector comprises a scatterer volume employing the LiquidO detection technology and an array of scintillating crystals acting as absorber. Surrounding plastic scintillator panels serve as a veto system for charged particles. The detector's compact, scalable… ▽ More COCOA (COmpact COmpton cAmera) is a next-generation gamma-ray telescope designed for astrophysical observations in the MeV energy range. The detector comprises a scatterer volume employing the LiquidO detection technology and an array of scintillating crystals acting as absorber. Surrounding plastic scintillator panels serve as a veto system for charged particles. The detector's compact, scalable design enables flexible deployment on microsatellites or high-altitude balloons. Gamma rays at MeV energies have not been well explored historically (the so-called "MeV gap") and COCOA has the potential to improve the sensitivity in this energy band. △ Less

Submitted 12 May, 2025; v1 submitted 28 February, 2025; originally announced February 2025.

Comments: 14 pages, 17 figures

Journal ref: Astropart. Phys. 172 (2025) 103135

arXiv:2502.08879 [pdf]

doi 10.1016/j.newton.2025.100024

Recent advances in high-dimensional mode-locked quantum frequency combs

Authors: Kai-Chi Chang, Xiang Cheng, Murat Can Sarihan, Chee Wei Wong

Abstract: High-dimensional entanglement in qudit states offers a promising pathway towards the realization of practical, large-scale quantum systems that are highly controllable. These systems can be leveraged for various applications, including advanced quantum information processing, secure communications, computation, and metrology. In this context, quantum frequency combs have a crucial role as they inh… ▽ More High-dimensional entanglement in qudit states offers a promising pathway towards the realization of practical, large-scale quantum systems that are highly controllable. These systems can be leveraged for various applications, including advanced quantum information processing, secure communications, computation, and metrology. In this context, quantum frequency combs have a crucial role as they inherently support multiple modes in both temporal and frequency domains, while preserving a single spatial mode. The multiple temporal and frequency modes of quantum frequency combs facilitate the generation, characterization, and control of high-dimensional time-frequency entanglement in extensive quantum systems. In this review article, we provide an overview of recent technological advancements in high-dimensional energy-time entangled quantum frequency combs. We explore how these time-frequency qudits, achieved using scalable telecommunications-wavelength components, can empower the creation of large-scale quantum states. Advances in quantum frequency combs can unlock new capabilities and versatility for promising developments in quantum science and technology. △ Less

Submitted 27 March, 2025; v1 submitted 12 February, 2025; originally announced February 2025.

Comments: 45 pages, 8 figures

Journal ref: Newton 1, 100024 (2025)

arXiv:2502.08770 [pdf]

Metalens array for complex-valued optical discrete Fourier transform

Authors: Randy Stefan Tanuwijaya, So Lap, Wai Chun Wong, Tailin An, Wing Yim Tam, Jensen Li

Abstract: Photonic computing has emerged as a promising platform for accelerating computational tasks with high degrees of parallelism, such as image processing and neural network. We present meta-DFT (discrete Fourier transform), a single layer metasurface device, designed to perform optical complex-to-complex DFT with O(N) time complexity. One critical challenge in free-space analog optical computing is t… ▽ More Photonic computing has emerged as a promising platform for accelerating computational tasks with high degrees of parallelism, such as image processing and neural network. We present meta-DFT (discrete Fourier transform), a single layer metasurface device, designed to perform optical complex-to-complex DFT with O(N) time complexity. One critical challenge in free-space analog optical computing is to control the measurement error. Our scheme addresses this issue by focusing light on spatially separated focal points and reconstructing the complex phase, which enable error correction. We systematically evaluate the device's performance using input vectors with random complex amplitudes and phases, to demonstrate its robust accuracy. Our findings pave the way towards advancement of metasurface-based computation, offering a robust framework that is readily extensible to an arbitrary complex-valued matrix-vector multiplication (MVM). △ Less

Submitted 12 February, 2025; originally announced February 2025.

Comments: 17 pages, 4 figures

arXiv:2502.08514 [pdf, other]

Faithful, Unfaithful or Ambiguous? Multi-Agent Debate with Initial Stance for Summary Evaluation

Authors: Mahnaz Koupaee, Jake W. Vincent, Saab Mansour, Igor Shalyminov, Han He, Hwanjun Song, Raphael Shu, Jianfeng He, Yi Nian, Amy Wing-mei Wong, Kyu J. Han, Hang Su

Abstract: Faithfulness evaluators based on large language models (LLMs) are often fooled by the fluency of the text and struggle with identifying errors in the summaries. We propose an approach to summary faithfulness evaluation in which multiple LLM-based agents are assigned initial stances (regardless of what their belief might be) and forced to come up with a reason to justify the imposed belief, thus en… ▽ More Faithfulness evaluators based on large language models (LLMs) are often fooled by the fluency of the text and struggle with identifying errors in the summaries. We propose an approach to summary faithfulness evaluation in which multiple LLM-based agents are assigned initial stances (regardless of what their belief might be) and forced to come up with a reason to justify the imposed belief, thus engaging in a multi-round debate to reach an agreement. The uniformly distributed initial assignments result in a greater diversity of stances leading to more meaningful debates and ultimately more errors identified. Furthermore, by analyzing the recent faithfulness evaluation datasets, we observe that naturally, it is not always the case for a summary to be either faithful to the source document or not. We therefore introduce a new dimension, ambiguity, and a detailed taxonomy to identify such special cases. Experiments demonstrate our approach can help identify ambiguities, and have even a stronger performance on non-ambiguous summaries. △ Less

Submitted 13 February, 2025; v1 submitted 12 February, 2025; originally announced February 2025.

arXiv:2502.03042 [pdf, other]

Energy Diffusion and Advection Coefficients in Kinetic Simulations of Relativistic Plasma Turbulence

Authors: Kai W. Wong, Vladimir Zhdankin, Dmitri A. Uzdensky, Gregory R. Werner, Mitchell C. Begelman

Abstract: Turbulent, relativistic nonthermal plasmas are ubiquitous in high-energy astrophysical systems, as inferred from broadband nonthermal emission spectra. The underlying turbulent nonthermal particle acceleration (NTPA) processes have traditionally been modelled with a Fokker-Planck (FP) diffusion-advection equation for the particle energy distribution. We test FP-type NTPA theories by performing and… ▽ More Turbulent, relativistic nonthermal plasmas are ubiquitous in high-energy astrophysical systems, as inferred from broadband nonthermal emission spectra. The underlying turbulent nonthermal particle acceleration (NTPA) processes have traditionally been modelled with a Fokker-Planck (FP) diffusion-advection equation for the particle energy distribution. We test FP-type NTPA theories by performing and analysing particle-in-cell (PIC) simulations of turbulence in collisionless relativistic pair plasma. By tracking large numbers of particles in simulations with different initial magnetisation and system size, we first test and confirm the applicability of the FP framework. We then measure the FP energy diffusion ($D$) and advection ($A$) coefficients as functions of particle energy $γm c^2$, and compare their dependence to theoretical predictions. At high energies, we robustly find $D \sim γ^2$ for all cases. Hence, we fit $D = D_0 γ^2$ and find a scaling consistent with $D_0 \sim σ^{3/2}$ at low instantaneous magnetisation $σ(t)$, flattening to $D_0 \sim σ$ at higher $σ\sim 1$. We also find that the power-law index $α(t)$ of the particle energy distribution converges exponentially in time. We build and test an analytic model connecting the FP coefficients and $α(t)$, predicting $A(γ) \sim γ\log γ$. We confirm this functional form in our measurements of $A(γ,t)$, which allows us to predict $α(t)$ through the model relations. Our results suggest that the basic second-order Fermi acceleration model, which predicts $D_0 \sim σ$, may not be a complete description of NTPA in turbulent plasmas. These findings encourage further application of tracked particles and FP coefficients as a diagnostic in kinetic simulations of various astrophysically relevant plasma processes like collisionless shocks and magnetic reconnection. △ Less

Submitted 5 February, 2025; originally announced February 2025.

Comments: 22 pages, 24 figures, submitted for publication. Comments are welcome!

arXiv:2501.17509 [pdf, other]

doi 10.1051/0004-6361/202453433

The current cratering rate on the regular satellites of Jupiter, Saturn, and Uranus

Authors: R. Brasser, E. W. Wong, S. C. Werner

Abstract: We aim to compute the impact rates for objects with a diameter of 1 km onto the regular satellites of Jupiter, Saturn and Uranus using our latest dynamical simulations of the evolution of outer solar system coupled with the best estimates of the current population of objects beyond Neptune and their size-frequency distribution. We use the outcome of the last 3.5~Gyr of evolution of the outer solar… ▽ More We aim to compute the impact rates for objects with a diameter of 1 km onto the regular satellites of Jupiter, Saturn and Uranus using our latest dynamical simulations of the evolution of outer solar system coupled with the best estimates of the current population of objects beyond Neptune and their size-frequency distribution. We use the outcome of the last 3.5~Gyr of evolution of the outer solar system from our database of simulations and combine this with observational constraints of the population beyond Neptune to compute the flux of objects entering the Centaur region, with uncertainties. The initial conditions resemble the current population rather than a near-circular, near-planar disc usually assumed just before the onset of giant planet migration. We obtain a better estimate of the impact probability of a Centaur with the satellites from enacting simulations of planetesimals flying past the satellites on hyperbolic orbits, which agree with literature precedents. We find that our impact rate of objects greater than 1 km in diameter with Jupiter is 0.0012/yr, which is a factor of 3--6 lower than previous estimates of 0.0044/yr from Nesvorny et al. (2023) and 0.0075/yr from Zahnle et al. (2003). On the other hand our impact probabilities with the satellites scaled to the giant planets are consistent with these earlier literature estimates, as is the leakage rate of objects from beyond Neptune into the Centaur region. However, our absolute impact probabilities with the giant planets are lower. We attribute this to our choice of initial conditions. △ Less

Submitted 29 January, 2025; originally announced January 2025.

Comments: In revision with Astronomy and Astrophysics

Journal ref: A&A 695, A276 (2025)

arXiv:2501.14305 [pdf, other]

A Zero-Shot LLM Framework for Automatic Assignment Grading in Higher Education

Authors: Calvin Yeung, Jeff Yu, King Chau Cheung, Tat Wing Wong, Chun Man Chan, Kin Chi Wong, Keisuke Fujii

Abstract: Automated grading has become an essential tool in education technology due to its ability to efficiently assess large volumes of student work, provide consistent and unbiased evaluations, and deliver immediate feedback to enhance learning. However, current systems face significant limitations, including the need for large datasets in few-shot learning methods, a lack of personalized and actionable… ▽ More Automated grading has become an essential tool in education technology due to its ability to efficiently assess large volumes of student work, provide consistent and unbiased evaluations, and deliver immediate feedback to enhance learning. However, current systems face significant limitations, including the need for large datasets in few-shot learning methods, a lack of personalized and actionable feedback, and an overemphasis on benchmark performance rather than student experience. To address these challenges, we propose a Zero-Shot Large Language Model (LLM)-Based Automated Assignment Grading (AAG) system. This framework leverages prompt engineering to evaluate both computational and explanatory student responses without requiring additional training or fine-tuning. The AAG system delivers tailored feedback that highlights individual strengths and areas for improvement, thereby enhancing student learning outcomes. Our study demonstrates the system's effectiveness through comprehensive evaluations, including survey responses from higher education students that indicate significant improvements in motivation, understanding, and preparedness compared to traditional grading methods. The results validate the AAG system's potential to transform educational assessment by prioritizing learning experiences and providing scalable, high-quality feedback. △ Less

Submitted 24 January, 2025; originally announced January 2025.

arXiv:2501.14249 [pdf, other]

Humanity's Last Exam

Authors: Long Phan, Alice Gatti, Ziwen Han, Nathaniel Li, Josephina Hu, Hugh Zhang, Chen Bo Calvin Zhang, Mohamed Shaaban, John Ling, Sean Shi, Michael Choi, Anish Agrawal, Arnav Chopra, Adam Khoja, Ryan Kim, Richard Ren, Jason Hausenloy, Oliver Zhang, Mantas Mazeika, Dmitry Dodonov, Tung Nguyen, Jaeho Lee, Daron Anderson, Mikhail Doroshenko, Alun Cennyth Stokes , et al. (1084 additional authors not shown)

Abstract: Benchmarks are important tools for tracking the rapid advancements in large language model (LLM) capabilities. However, benchmarks are not keeping pace in difficulty: LLMs now achieve over 90\% accuracy on popular benchmarks like MMLU, limiting informed measurement of state-of-the-art LLM capabilities. In response, we introduce Humanity's Last Exam (HLE), a multi-modal benchmark at the frontier of… ▽ More Benchmarks are important tools for tracking the rapid advancements in large language model (LLM) capabilities. However, benchmarks are not keeping pace in difficulty: LLMs now achieve over 90\% accuracy on popular benchmarks like MMLU, limiting informed measurement of state-of-the-art LLM capabilities. In response, we introduce Humanity's Last Exam (HLE), a multi-modal benchmark at the frontier of human knowledge, designed to be the final closed-ended academic benchmark of its kind with broad subject coverage. HLE consists of 2,500 questions across dozens of subjects, including mathematics, humanities, and the natural sciences. HLE is developed globally by subject-matter experts and consists of multiple-choice and short-answer questions suitable for automated grading. Each question has a known solution that is unambiguous and easily verifiable, but cannot be quickly answered via internet retrieval. State-of-the-art LLMs demonstrate low accuracy and calibration on HLE, highlighting a significant gap between current LLM capabilities and the expert human frontier on closed-ended academic questions. To inform research and policymaking upon a clear understanding of model capabilities, we publicly release HLE at https://lastexam.ai. △ Less

Submitted 19 April, 2025; v1 submitted 24 January, 2025; originally announced January 2025.

Comments: 29 pages, 6 figures

arXiv:2501.10753 [pdf, ps, other]

Pinching Antennas: Principles, Applications and Challenges

Authors: Zheng Yang, Ning Wang, Yanshi Sun, Zhiguo Ding, Robert Schober, George K. Karagiannidis, Vincent W. S. Wong, Octavia A. Dobre

Abstract: Flexible-antenna systems, such as fluid antennas and movable antennas, have been recognized as key enabling technologies for sixth-generation (6G) wireless networks, as they can intelligently reconfigure the effective channel gains of the users and hence significantly improve their data transmission capabilities. However, existing flexible-antenna systems have been designed to combat small-scale f… ▽ More Flexible-antenna systems, such as fluid antennas and movable antennas, have been recognized as key enabling technologies for sixth-generation (6G) wireless networks, as they can intelligently reconfigure the effective channel gains of the users and hence significantly improve their data transmission capabilities. However, existing flexible-antenna systems have been designed to combat small-scale fading in non-line-of-sight (NLoS) conditions. As a result, they lack the ability to establish line-of-sight links, which are typically 100 times stronger than NLoS links. In addition, existing flexible-antenna systems have limited flexibility, where adding/removing an antenna is not straightforward. This article introduces an innovative flexible-antenna system called pinching antennas, which are realized by applying small dielectric particles to waveguides. We first describe the basics of pinching-antenna systems and their ability to provide strong LoS links by deploying pinching antennas close to the users as well as their capability to scale up/down the antenna system. We then focus on communication scenarios with different numbers of waveguides and pinching antennas, where innovative approaches to implement multiple-input multiple-output and non-orthogonal multiple access are discussed. In addition, promising 6G-related applications of pinching antennas, including integrated sensing and communication and next-generation multiple access, are presented. Finally, important directions for future research, such as waveguide deployment and channel estimation, are highlighted. △ Less

Submitted 18 January, 2025; originally announced January 2025.

arXiv:2501.07559 [pdf, other]

Euclid: Optimising tomographic redshift binning for 3$\times$2pt power spectrum constraints on dark energy

Authors: J. H. W. Wong, M. L. Brown, C. A. J. Duncan, A. Amara, S. Andreon, C. Baccigalupi, M. Baldi, S. Bardelli, D. Bonino, E. Branchini, M. Brescia, J. Brinchmann, A. Caillat, S. Camera, V. Capobianco, C. Carbone, J. Carretero, S. Casas, M. Castellano, G. Castignani, S. Cavuoti, A. Cimatti, C. Colodro-Conde, G. Congedo, C. J. Conselice , et al. (114 additional authors not shown)

Abstract: We present a simulation-based method to explore the optimum tomographic redshift binning strategy for 3x2pt analyses with Euclid, focusing on the expected configuration of its first major data release (DR1). To do this, we 1) simulate a Euclid-like observation and generate mock shear catalogues from multiple realisations of the 3x2pt fields on the sky, and 2) measure the 3x2pt Pseudo-Cl power spec… ▽ More We present a simulation-based method to explore the optimum tomographic redshift binning strategy for 3x2pt analyses with Euclid, focusing on the expected configuration of its first major data release (DR1). To do this, we 1) simulate a Euclid-like observation and generate mock shear catalogues from multiple realisations of the 3x2pt fields on the sky, and 2) measure the 3x2pt Pseudo-Cl power spectra for a given tomographic configuration and derive the constraints that they place on the standard dark energy equation of state parameters (w0, wa). For a simulation including Gaussian-distributed photometric redshift uncertainty and shape noise under a LambdaCDM cosmology, we find that bins equipopulated with galaxies yield the best constraints on (w0, wa) for an analysis of the full 3x2pt signal, or the angular clustering component only. For the cosmic shear component, the optimum (w0, wa) constraints are achieved by bins equally spaced in fiducial comoving distance. However, the advantage with respect to alternative binning choices is only a few percent in the size of the $1\,σ\,$(w0, wa) contour, and we conclude that the cosmic shear is relatively insensitive to the binning methodology. We find that the information gain extracted on (w0, wa) for any 3x2pt component starts to saturate at $\gtrsim$ 7-8 bins. Any marginal gains resulting from a greater number of bins is likely to be limited by additional uncertainties present in a real measurement, and the increasing demand for accuracy of the covariance matrix. Finally, we consider a 5% contamination from catastrophic photometric redshift outliers and find that, if these errors are not mitigated in the analysis, the bias induced in the 3x2pt signal for 10 equipopulated bins results in dark energy constraints that are inconsistent with the fiducial LambdaCDM cosmology at $>5\,σ$. △ Less

Submitted 13 January, 2025; originally announced January 2025.

Comments: Euclid Consortium paper. 28 pages, 17 figures. For submission to A&A

arXiv:2501.03472 [pdf, ps, other]

Sharp bounds for product and sum throttling numbers

Authors: Ryan Blair, Gabriel Elvin, Veronika Furst, Leslie Hogben, Nandita Sahajpal, Tony W. H. Wong

Abstract: Throttling in graphs optimizes a sum or product of resources used, such as the number of vertices in an initial set, and time required, such as the propagation time, to complete a given task. We introduce a new technique to establish sharp upper bounds in terms of graph order for sum throttling and initial cost product throttling for power domination. Furthermore, we establish sharp bounds on poss… ▽ More Throttling in graphs optimizes a sum or product of resources used, such as the number of vertices in an initial set, and time required, such as the propagation time, to complete a given task. We introduce a new technique to establish sharp upper bounds in terms of graph order for sum throttling and initial cost product throttling for power domination. Furthermore, we establish sharp bounds on possible changes of the product throttling number, both with and without initial cost, caused by certain graph operations for standard zero forcing, positive semidefinite forcing, and power domination. △ Less

Submitted 13 January, 2025; v1 submitted 6 January, 2025; originally announced January 2025.

MSC Class: 05C57; 05C69; 68R10

arXiv:2501.02814 [pdf]

Analogue Forecast System for Daily Precipitation Prediction Using Autoencoder Feature Extraction: Application in Hong Kong

Authors: Yee Chun Tsoi, Yu Ting Kwok, Ming Chun Lam, Wai Kin Wong

Abstract: In the Hong Kong Observatory, the Analogue Forecast System (AFS) for precipitation has been providing useful reference in predicting possible daily rainfall scenarios for the next 9 days, by identifying historical cases with similar weather patterns to the latest output from the deterministic model of the European Centre for Medium-Range Weather Forecasts (ECMWF). Recent advances in machine learni… ▽ More In the Hong Kong Observatory, the Analogue Forecast System (AFS) for precipitation has been providing useful reference in predicting possible daily rainfall scenarios for the next 9 days, by identifying historical cases with similar weather patterns to the latest output from the deterministic model of the European Centre for Medium-Range Weather Forecasts (ECMWF). Recent advances in machine learning allow more sophisticated models to be trained using historical data and the patterns of high-impact weather events to be represented more effectively. As such, an enhanced AFS has been developed using the deep learning technique autoencoder. The datasets of the fifth generation of the ECMWF Reanalysis (ERA5) are utilised where more meteorological elements in higher horizontal, vertical and temporal resolutions are available as compared to the previous ECMWF reanalysis products used in the existing AFS. The enhanced AFS features four major steps in generating the daily rain class forecasts: (1) preprocessing of gridded ERA5 and ECMWF model forecast, (2) feature extraction by the pretrained autoencoder, (3) application of optimised feature weightings based on historical cases, and (4) calculation of the final rain class from a weighted ensemble of top analogues. The enhanced AFS demonstrates a consistent and superior performance over the existing AFS, especially in capturing heavy rain cases, during the verification period from 2019 to 2022. This paper presents the detailed formulation of the enhanced AFS and discusses its advantages and limitations in supporting precipitation forecasting in Hong Kong. △ Less

Submitted 6 January, 2025; originally announced January 2025.

Comments: 16 pages, 10 figures

Journal ref: Hong Kong Meteorological Society E-BULLETIN Vol. 28, 2 (2024)

arXiv:2501.01495 [pdf, other]

doi 10.3847/1538-4357/adb3a0

Search for continuous gravitational waves from known pulsars in the first part of the fourth LIGO-Virgo-KAGRA observing run

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, R. Abbott, I. Abouelfettouh, F. Acernese, K. Ackley, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, D. Agarwal, M. Agathos, M. Aghaei Abchouyeh, O. D. Aguiar, I. Aguilar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi, R. A. Alfaidi, A. Al-Jodah, C. Alléné , et al. (1794 additional authors not shown)

Abstract: Continuous gravitational waves (CWs) emission from neutron stars carries information about their internal structure and equation of state, and it can provide tests of General Relativity. We present a search for CWs from a set of 45 known pulsars in the first part of the fourth LIGO--Virgo--KAGRA observing run, known as O4a. We conducted a targeted search for each pulsar using three independent ana… ▽ More Continuous gravitational waves (CWs) emission from neutron stars carries information about their internal structure and equation of state, and it can provide tests of General Relativity. We present a search for CWs from a set of 45 known pulsars in the first part of the fourth LIGO--Virgo--KAGRA observing run, known as O4a. We conducted a targeted search for each pulsar using three independent analysis methods considering the single-harmonic and the dual-harmonic emission models. We find no evidence of a CW signal in O4a data for both models and set upper limits on the signal amplitude and on the ellipticity, which quantifies the asymmetry in the neutron star mass distribution. For the single-harmonic emission model, 29 targets have the upper limit on the amplitude below the theoretical spin-down limit. The lowest upper limit on the amplitude is $6.4\!\times\!10^{-27}$ for the young energetic pulsar J0537-6910, while the lowest constraint on the ellipticity is $8.8\!\times\!10^{-9}$ for the bright nearby millisecond pulsar J0437-4715. Additionally, for a subset of 16 targets we performed a narrowband search that is more robust regarding the emission model, with no evidence of a signal. We also found no evidence of non-standard polarizations as predicted by the Brans-Dicke theory. △ Less

Submitted 2 January, 2025; originally announced January 2025.

Comments: main paper: 12 pages, 6 figures, 4 tables

Report number: LIGO-P2400315

Journal ref: Astrophys.J. 983 (2025) 2, 99

arXiv:2501.00755 [pdf, other]

An AI-powered Bayesian generative modeling approach for causal inference in observational studies

Authors: Qiao Liu, Wing Hung Wong

Abstract: Causal inference in observational studies with high-dimensional covariates presents significant challenges. We introduce CausalBGM, an AI-powered Bayesian generative modeling approach that captures the causal relationship among covariates, treatment, and outcome variables. The core innovation of CausalBGM lies in its ability to estimate the individual treatment effect (ITE) by learning individual-… ▽ More Causal inference in observational studies with high-dimensional covariates presents significant challenges. We introduce CausalBGM, an AI-powered Bayesian generative modeling approach that captures the causal relationship among covariates, treatment, and outcome variables. The core innovation of CausalBGM lies in its ability to estimate the individual treatment effect (ITE) by learning individual-specific distributions of a low-dimensional latent feature set (e.g., latent confounders) that drives changes in both treatment and outcome. This approach not only effectively mitigates confounding effects but also provides comprehensive uncertainty quantification, offering reliable and interpretable causal effect estimates at the individual level. CausalBGM adopts a Bayesian model and uses a novel iterative algorithm to update the model parameters and the posterior distribution of latent features until convergence. This framework leverages the power of AI to capture complex dependencies among variables while adhering to the Bayesian principles. Extensive experiments demonstrate that CausalBGM consistently outperforms state-of-the-art methods, particularly in scenarios with high-dimensional covariates and large-scale datasets. Its Bayesian foundation ensures statistical rigor, providing robust and well-calibrated posterior intervals. By addressing key limitations of existing methods, CausalBGM emerges as a robust and promising framework for advancing causal inference in modern applications in fields such as genomics, healthcare, and social sciences. CausalBGM is maintained at the website https://causalbgm.readthedocs.io/. △ Less

Submitted 1 January, 2025; originally announced January 2025.

arXiv:2412.16897 [pdf, other]

MVREC: A General Few-shot Defect Classification Model Using Multi-View Region-Context

Authors: Shuai Lyu, Rongchen Zhang, Zeqi Ma, Fangjian Liao, Dongmei Mo, Waikeung Wong

Abstract: Few-shot defect multi-classification (FSDMC) is an emerging trend in quality control within industrial manufacturing. However, current FSDMC research often lacks generalizability due to its focus on specific datasets. Additionally, defect classification heavily relies on contextual information within images, and existing methods fall short of effectively extracting this information. To address the… ▽ More Few-shot defect multi-classification (FSDMC) is an emerging trend in quality control within industrial manufacturing. However, current FSDMC research often lacks generalizability due to its focus on specific datasets. Additionally, defect classification heavily relies on contextual information within images, and existing methods fall short of effectively extracting this information. To address these challenges, we propose a general FSDMC framework called MVREC, which offers two primary advantages: (1) MVREC extracts general features for defect instances by incorporating the pre-trained AlphaCLIP model. (2) It utilizes a region-context framework to enhance defect features by leveraging mask region input and multi-view context augmentation. Furthermore, Few-shot Zip-Adapter(-F) classifiers within the model are introduced to cache the visual features of the support set and perform few-shot classification. We also introduce MVTec-FS, a new FSDMC benchmark based on MVTec AD, which includes 1228 defect images with instance-level mask annotations and 46 defect types. Extensive experiments conducted on MVTec-FS and four additional datasets demonstrate its effectiveness in general defect classification and its ability to incorporate contextual information to improve classification performance. Code: https://github.com/ShuaiLYU/MVREC △ Less

Submitted 30 March, 2025; v1 submitted 22 December, 2024; originally announced December 2024.

Comments: Accepted by AAAI 2025

arXiv:2412.13905 [pdf, other]

doi 10.1109/ACSAC63791.2024.00027

T-Edge: Trusted Heterogeneous Edge Computing

Authors: Jiamin Shen, Yao Chen, Weng-Fai Wong, Ee-Chien Chang

Abstract: Heterogeneous computing, which incorporates GPUs, NPUs, and FPGAs, is increasingly utilized to improve the efficiency of computer systems. However, this shift has given rise to significant security and privacy concerns, especially when the execution platform is remote. One way to tackle these challenges is to establish a trusted and isolated environment for remote program execution, while maintain… ▽ More Heterogeneous computing, which incorporates GPUs, NPUs, and FPGAs, is increasingly utilized to improve the efficiency of computer systems. However, this shift has given rise to significant security and privacy concerns, especially when the execution platform is remote. One way to tackle these challenges is to establish a trusted and isolated environment for remote program execution, while maintaining minimal overhead and flexibility. While CPU-based trusted execution has been extensively explored and found commercial success, extension to heterogeneous computing systems remains a challenge. This paper proposes a practical trusted execution environment design for ARM/FPGA System-on-Chip platforms, leveraging TrustZone's unique characteristics. The design features a dedicated security controller within the ARM TrustZone, overseeing FPGA reconfiguration and managing communication between CPU cores and FPGA fabrics. This design involves a provisioning service that enables application users to establish trust in the FPGA fabric within cloud-based computing resources provided by the platform owner, running applications developed by third-party developers and hardware manufactured by the device manufacturer. To ensure the security of our proposed system, we employ an automated protocol verifier, ProVerif, to validate its compliance with essential security requirements. Furthermore, we demonstrate the practicality of our system model by implementing a prototype application on the Xilinx MPSoC development board. △ Less

Submitted 18 December, 2024; originally announced December 2024.

Comments: 13 pages, 6 figures

arXiv:2412.03005 [pdf]

gghic: A Versatile R Package for Exploring and Visualizing 3D Genome Organization

Authors: Minghao Jiang, Duohui Jing, Jason W. H. Wong

Abstract: Motivation: The three-dimensional (3D) organization of the genome plays a critical role in regulating gene expression and maintaining cellular homeostasis. Disruptions in this spatial organization can result in abnormal chromatin interactions, contributing to the development of various diseases including cancer. Advances in chromosome conformation capture technologies, such as Hi-C, have enabled r… ▽ More Motivation: The three-dimensional (3D) organization of the genome plays a critical role in regulating gene expression and maintaining cellular homeostasis. Disruptions in this spatial organization can result in abnormal chromatin interactions, contributing to the development of various diseases including cancer. Advances in chromosome conformation capture technologies, such as Hi-C, have enabled researchers to study genome architecture at high resolution. However, the efficient visualization and interpretation of these complex datasets remain a major challenge, particularly when integrating genomic annotations and inter-chromosomal interactions. Results: We present gghic, an R package that extends the ggplot2 framework to enable intuitive and customizable visualization of genomic interaction data. gghic introduces novel layers for generating triangular heatmaps of chromatin interactions and annotating them with features such as chromatin loops, topologically associated domains (TADs), gene/transcript models, and data tracks (e.g., ChIP-seq signals). The package supports data from multiple chromosomes, facilitating the exploration of inter-chromosomal interactions. Built to integrate seamlessly with the R/Bioconductor ecosystem, gghic is compatible with widely used genomic data formats, including HiCExperiment and GInteractions objects. We demonstrate the utility of gghic by replicating a published figure showing a translocation event in T-cell acute lymphoblastic leukemia (T-ALL), highlighting its ability to integrate genomic annotations and generate publication-quality figures. Availability and implementation: The R package can be accessed at https://github.com/jasonwong-lab/gghic and is distributed under the GNU General Public License version 3.0. △ Less

Submitted 3 December, 2024; originally announced December 2024.

arXiv:2411.17101 [pdf]

Software Fault Localization Based on Multi-objective Feature Fusion and Deep Learning

Authors: Xiaolei Hu, Dongcheng Li, W. Eric Wong, Ya Zou

Abstract: Software fault localization remains challenging due to limited feature diversity and low precision in traditional methods. This paper proposes a novel approach that integrates multi-objective optimization with deep learning models to improve both accuracy and efficiency in fault localization (FL). By framing feature selection as a multi-objective optimization problem (MOP), we extract and fuse thr… ▽ More Software fault localization remains challenging due to limited feature diversity and low precision in traditional methods. This paper proposes a novel approach that integrates multi-objective optimization with deep learning models to improve both accuracy and efficiency in fault localization (FL). By framing feature selection as a multi-objective optimization problem (MOP), we extract and fuse three critical fault-related feature sets: spectrum-based, mutation-based, and text-based features, into a comprehensive feature fusion model. These features are then embedded within a deep learning architecture, comprising a multilayer perceptron (MLP) and gated recurrent network (GRN), which together enhance localization accuracy and generalizability. Experiments on the Defects4J benchmark dataset with 434 faults show that the proposed algorithm reduces processing time by 78.2% compared to single-objective methods. Additionally, our MLP and GRN models achieve a 94.2% improvement in localization accuracy compared to traditional FL methods, outperforming state-of-the-art deep learning-based FL method by 7.67%. Further validation using the PROMISE dataset demonstrates the generalizability of the proposed model, showing a 4.6% accuracy improvement in cross-project tests over state-of-the-art deep learning-based FL method. △ Less

Submitted 25 November, 2024; originally announced November 2024.

arXiv:2411.14939 [pdf, other]

Many happy returns: machine learning to support platelet issuing and waste reduction in hospital blood banks

Authors: Joseph Farrington, Samah Alimam, Martin Utley, Kezhi Li, Wai Keong Wong

Abstract: Efforts to reduce platelet wastage in hospital blood banks have focused on ordering policies, but the predominant practice of issuing the oldest unit first may not be optimal when some units are returned unused. We propose a novel, machine learning (ML)-guided issuing policy to increase the likelihood of returned units being reissued before expiration. Our ML model trained to predict returns on 17… ▽ More Efforts to reduce platelet wastage in hospital blood banks have focused on ordering policies, but the predominant practice of issuing the oldest unit first may not be optimal when some units are returned unused. We propose a novel, machine learning (ML)-guided issuing policy to increase the likelihood of returned units being reissued before expiration. Our ML model trained to predict returns on 17,297 requests for platelets gave AUROC 0.74 on 9,353 held-out requests. Prior to ML model development we built a simulation of the blood bank operation that incorporated returns to understand the scale of benefits of such a model. Using our trained model in the simulation gave an estimated reduction in wastage of 14%. Our partner hospital is considering adopting our approach, which would be particularly beneficial for hospitals with higher return rates and where units have a shorter remaining useful life on arrival. △ Less

Submitted 22 November, 2024; originally announced November 2024.

MSC Class: 90B05 (Primary) 62P10; 68T05; 92C60 (Secondary) ACM Class: I.2.1; I.6.3; J.3; H.4.2

arXiv:2411.10548 [pdf, ps, other]

BioNeMo Framework: a modular, high-performance library for AI model development in drug discovery

Authors: Peter St. John, Dejun Lin, Polina Binder, Malcolm Greaves, Vega Shah, John St. John, Adrian Lange, Patrick Hsu, Rajesh Illango, Arvind Ramanathan, Anima Anandkumar, David H Brookes, Akosua Busia, Abhishaike Mahajan, Stephen Malina, Neha Prasad, Sam Sinai, Lindsay Edwards, Thomas Gaudelet, Cristian Regep, Martin Steinegger, Burkhard Rost, Alexander Brace, Kyle Hippe, Luca Naef , et al. (68 additional authors not shown)

Abstract: Artificial Intelligence models encoding biology and chemistry are opening new routes to high-throughput and high-quality in-silico drug development. However, their training increasingly relies on computational scale, with recent protein language models (pLM) training on hundreds of graphical processing units (GPUs). We introduce the BioNeMo Framework to facilitate the training of computational bio… ▽ More Artificial Intelligence models encoding biology and chemistry are opening new routes to high-throughput and high-quality in-silico drug development. However, their training increasingly relies on computational scale, with recent protein language models (pLM) training on hundreds of graphical processing units (GPUs). We introduce the BioNeMo Framework to facilitate the training of computational biology and chemistry AI models across hundreds of GPUs. Its modular design allows the integration of individual components, such as data loaders, into existing workflows and is open to community contributions. We detail technical features of the BioNeMo Framework through use cases such as pLM pre-training and fine-tuning. On 256 NVIDIA A100s, BioNeMo Framework trains a three billion parameter BERT-based pLM on over one trillion tokens in 4.2 days. The BioNeMo Framework is open-source and free for everyone to use. △ Less

Submitted 12 June, 2025; v1 submitted 15 November, 2024; originally announced November 2024.

arXiv:2411.09460 [pdf, other]

Analysis Methodology for Age of Information under Sequence Based Scheduling

Authors: Fang Liu, Wing Shing Wong, Yuan-Hsun Lo, Yijin Zhang, Chung Shue Chen

Abstract: We focus on the Age of Information (AoI) performance in a system where each user generates packets periodically to send to a common access point (AP) for status updating. To avoid heavy overhead, we assume that channel sensing, feedback information from the AP, and time synchronization are not available in the system. We adopt a multi-access scheme called the sequence scheme, where each user is as… ▽ More We focus on the Age of Information (AoI) performance in a system where each user generates packets periodically to send to a common access point (AP) for status updating. To avoid heavy overhead, we assume that channel sensing, feedback information from the AP, and time synchronization are not available in the system. We adopt a multi-access scheme called the sequence scheme, where each user is assigned a periodic binary sequence to schedule their transmissions. In our previous work [18], we have thoroughly studied the AoI performance under sequence scheme when the period of schedule sequences, $L$, is equal to the status generating period, $T$. The results can be extended to the case where $T>L$. However, the case of $T<L$ is not covered by [18]. Therefore, in this paper, we concentrate on analyzing the AoI performance in the case of $T<L$, which is more challenging and requires different approaches. We conduct in-depth analysis on this case and develop a mathematical tool based on integer partitions to facilitate the analysis. We derive low-complexity closed-form expressions for two scenarios under $T<L$. Based on the obtained analytical results, we propose an algorithm to optimize the construction parameters of the sequence scheme. Finally, we compare our proposed sequence scheme with two commonly used baselines, and show that our proposed scheme outperforms the baselines in terms of AoI performance while consuming less energy. △ Less

Submitted 14 November, 2024; originally announced November 2024.

arXiv:2411.09019 [pdf]

Quantum Nanophotonics with Energetic Particles:X-rays and Free Electrons

Authors: Xihang Shi, Wen Wei Lee, Aviv Karnieli, Leon Merten Lohse, Alexey Gorlach, Lee Wei Wesley Wong, Tim Saldit, Shanhui Fan, Ido Kaminer, Liang Jie Wong

Abstract: Rapid progress in precision nanofabrication and atomic design over the past 50 years has ushered in a succession of transformative eras for molding the generation and flow of light. The use of nanoscale and atomic features to design light sources and optical elements-encapsulated by the term nanophotonics-has led to new fundamental science and innovative technologies across the entire electromagne… ▽ More Rapid progress in precision nanofabrication and atomic design over the past 50 years has ushered in a succession of transformative eras for molding the generation and flow of light. The use of nanoscale and atomic features to design light sources and optical elements-encapsulated by the term nanophotonics-has led to new fundamental science and innovative technologies across the entire electromagnetic spectrum, with substantial emphasis on the microwave to visible regimes. In this review, we pay special attention to the impact and potential of nanophotonics in a relatively exotic yet technologically disruptive regime: high-energy particles such as X-ray photons and free electrons-where nanostructures and atomic design open the doors to unprecedented technologies in quantum science and versatile X-ray sources and optics. As the practical generation of X-rays is intrinsically linked to the existence of energetic free or quasi-free-electrons, our review will also capture related phenomena and technologies that combine free electrons with nanophotonics, including free-electron-driven nanophotonics at other photon energies. In particular, we delve into the demonstration and study of quantum recoil in the X-ray regime, the study of nanomaterial design and free-electron wave shaping as means to enhance and control X-ray radiation, examine the free-electron generation enabled by nanophotonics, and analyze the high-harmonic generation by quasi-free electrons. We also discuss applications of quantum nanophotonics for X-rays and free electrons, including nanostructure waveguides for X-rays, photon pair enhanced X-ray imaging, mirrors, and lenses for X-rays, among others. △ Less

Submitted 13 November, 2024; originally announced November 2024.

arXiv:2411.07303 [pdf, other]

Peering into the black box: forward-modeling the uncertainty budget of high-resolution spectroscopy of exoplanet atmospheres

Authors: Arjun B. Savel, Megan Bedell, Eliza M. -R. Kempton, Peter Smith, Jacob L. Bean, Lily L. Zhao, Kaze W. K. Wong, Jorge A. Sanchez, Michael R. Line

Abstract: Ground-based high-resolution cross-correlation spectroscopy (HRCCS; R >~ 15,000) is a powerful complement to space-based studies of exoplanet atmospheres. By resolving individual spectral lines, HRCCS can precisely measure chemical abundance ratios, directly constrain atmospheric dynamics, and robustly probe multidimensional physics. But the subtleties of HRCCS datasets -- e.g., the lack of exopla… ▽ More Ground-based high-resolution cross-correlation spectroscopy (HRCCS; R >~ 15,000) is a powerful complement to space-based studies of exoplanet atmospheres. By resolving individual spectral lines, HRCCS can precisely measure chemical abundance ratios, directly constrain atmospheric dynamics, and robustly probe multidimensional physics. But the subtleties of HRCCS datasets -- e.g., the lack of exoplanetary spectra visible by eye and the statistically complex process of telluric removal -- can make interpreting them difficult. In this work, we seek to clarify the uncertainty budget of HRCCS with a forward-modeling approach. We present a HRCCS observation simulator, scope (https://github.com/arjunsavel/scope), that incorporates spectral contributions from the exoplanet, star, tellurics, and instrument. This tool allows us to control the underlying dataset, enabling controlled experimentation with complex HRCCS methods. Simulating a fiducial hot Jupiter dataset (WASP-77Ab emission with IGRINS), we first confirm via multiple tests that the commonly used principal components analysis does not bias the planetary signal when few components are used. Furthermore, we demonstrate that mildly varying tellurics and moderate wavelength solution errors induce only mild decreases in HRCCS detection significance. However, limiting-case, strongly varying tellurics can bias the retrieved velocities and gas abundances. Additionally, in the low-SNR limit, constraints on gas abundances become highly non-Gaussian. Our investigation of the uncertainties and potential biases inherent in HRCCS data analysis enables greater confidence in scientific results from this maturing method. △ Less

Submitted 6 January, 2025; v1 submitted 11 November, 2024; originally announced November 2024.

Comments: 21 pages, 11 figures. Accepted for publication in AJ

arXiv:2411.06658 [pdf]

Remote picometric acoustic sensing via ultrastable laser interferometry

Authors: Yoon-Soo Jang, Dong Il Lee, Jaime Flor Flores, Wenting Wang, Chee Wei Wong

Abstract: Acoustic detection has many applications across science and technology, from medical to imaging and communications. However, most acoustic sensors have a common limitation in that the detection must be near the acoustic source. Alternatively laser interferometry with picometer-scale motional displacement detection can rapidly and precisely measure sound induced minute vibrations on remote surfaces… ▽ More Acoustic detection has many applications across science and technology, from medical to imaging and communications. However, most acoustic sensors have a common limitation in that the detection must be near the acoustic source. Alternatively laser interferometry with picometer-scale motional displacement detection can rapidly and precisely measure sound induced minute vibrations on remote surfaces. Here we demonstrate the feasibility of sound detection up to 100 kHz at remote sites with ~ 60 m of optical path length via laser homodyne interferometry. Based on our ultrastable Hz-linewidth laser with 10-15 fractional stability, our laser interferometer achieves 0.5 pm/Hz1/2 displacement sensitivity near 10 kHz, bounded only by laser frequency noise over 10 kHz. Between 140 Hz to 15 kHz, we achieve a homodyne acoustic sensing sensitivity of sub-nm/Pa across our conversational frequency overtones. The minimal sound pressure detectable over 60 m of optical path length is ~ 2 mPa, with dynamic ranges over 100 dB. With the demonstrated standoff picometric distance metrology, we successfully detected and reconstructed musical scores of normal conversational volumes with high fidelity. The acoustic detection via this precision laser interferometer could be applied to selective area sound sensing for remote acoustic metrology, optomechanical vibrational motion sensing and ultrasensitive optical microphones at the laser frequency noise limits. △ Less

Submitted 10 November, 2024; originally announced November 2024.

Comments: 25 pages, 10 figures

arXiv:2411.05797 [pdf, other]

Metaheuristics is All You Need

Authors: Eliuvish Cuicizion, Haowen Xu, Weng Kee Wong

Abstract: Optimization plays an important role in tackling public health problems. Animal instincts can be used effectively to solve complex public health management issues by providing optimal or approximately optimal solutions to complicated optimization problems common in public health. BAT algorithm is an exemplary member of a class of nature-inspired metaheuristic optimization algorithms and designed t… ▽ More Optimization plays an important role in tackling public health problems. Animal instincts can be used effectively to solve complex public health management issues by providing optimal or approximately optimal solutions to complicated optimization problems common in public health. BAT algorithm is an exemplary member of a class of nature-inspired metaheuristic optimization algorithms and designed to outperform existing metaheuristic algorithms in terms of efficiency and accuracy. It's inspiration comes from the foraging behavior of group of microbats that use echolocation to find their target in the surrounding environment. In recent years, BAT algorithm has been extensively used by researchers in the area of optimization, and various variants of BAT algorithm have been developed to improve its performance and extend its application to diverse disciplines. This paper first reviews the basic BAT algorithm and its variants, including their applications in various fields. As a specific application, we apply the BAT algorithm to a biostatistical estimation problem and show it has some clear advantages over existing algorithms. △ Less

Submitted 21 March, 2025; v1 submitted 25 October, 2024; originally announced November 2024.

Comments: 25 pages, many figures

arXiv:2411.04487 [pdf, other]

Accelerated Design of Microring Lasers with Multi-Objective Bayesian Optimization

Authors: Mihir R. Athavale, Ruqaiya Al-Abri, Stephen Church, Wei Wen Wong, Andre KY Low, Hark Hoe Tan, Kedar Hippalgaonkar, Patrick Parkinson

Abstract: On-chip coherent laser sources are crucial for the future of photonic integrated circuits, yet progress has been hindered by the complex interplay between material quality, device geometry, and performance metrics. We combine high-throughput characterization, statistical analysis, experimental design, and multi-objective Bayesian optimization to accelerate the design process for low-threshold, hig… ▽ More On-chip coherent laser sources are crucial for the future of photonic integrated circuits, yet progress has been hindered by the complex interplay between material quality, device geometry, and performance metrics. We combine high-throughput characterization, statistical analysis, experimental design, and multi-objective Bayesian optimization to accelerate the design process for low-threshold, high-yield III-V microring lasers with room-temperature operation at communication wavelengths. We demonstrate a 1.6$\times$ reduction in threshold over expert-designed configurations, achieving a 100% lasing yield that emits within the O-band with a median threshold as low as 33$μ$J cm$^{-2}$ pulse$^{-1}$. △ Less

Submitted 7 November, 2024; originally announced November 2024.

arXiv:2411.02453 [pdf, other]

Super-Resolution without High-Resolution Labels for Black Hole Simulations

Authors: Thomas Helfer, Thomas D. P. Edwards, Jessica Dafflon, Kaze W. K. Wong, Matthew Lyle Olson

Abstract: Generating high-resolution simulations is key for advancing our understanding of one of the universe's most violent events: Black Hole mergers. However, generating Black Hole simulations is limited by prohibitive computational costs and scalability issues, reducing the simulation's fidelity and resolution achievable within reasonable time frames and resources. In this work, we introduce a novel me… ▽ More Generating high-resolution simulations is key for advancing our understanding of one of the universe's most violent events: Black Hole mergers. However, generating Black Hole simulations is limited by prohibitive computational costs and scalability issues, reducing the simulation's fidelity and resolution achievable within reasonable time frames and resources. In this work, we introduce a novel method that circumvents these limitations by applying a super-resolution technique without directly needing high-resolution labels, leveraging the Hamiltonian and momentum constraints-fundamental equations in general relativity that govern the dynamics of spacetime. We demonstrate that our method achieves a reduction in constraint violation by one to two orders of magnitude and generalizes effectively to out-of-distribution simulations. △ Less

Submitted 3 November, 2024; originally announced November 2024.

Comments: Code available at https://github.com/ThomasHelfer/TorchGRTL and data at https://huggingface.co/datasets/thelfer/BinaryBlackHole

arXiv:2411.01033 [pdf]

Many-Objective Search-Based Coverage-Guided Automatic Test Generation for Deep Neural Networks

Authors: Dongcheng Li, W. Eric Wong, Hu Liu, Man Zhao

Abstract: To ensure the reliability of DNN systems and address the test generation problem for neural networks, this paper proposes a fuzzing test generation technique based on many-objective optimization algorithms. Traditional fuzz testing employs random search, leading to lower testing efficiency and tends to generate numerous invalid test cases. By utilizing many-objective optimization techniques, effec… ▽ More To ensure the reliability of DNN systems and address the test generation problem for neural networks, this paper proposes a fuzzing test generation technique based on many-objective optimization algorithms. Traditional fuzz testing employs random search, leading to lower testing efficiency and tends to generate numerous invalid test cases. By utilizing many-objective optimization techniques, effective test cases can be generated. To achieve high test coverage, this paper proposes several improvement strategies. The frequency-based fuzz sampling strategy assigns priorities based on the frequency of selection of initial data, avoiding the repetitive selection of the same data and enhancing the quality of initial data better than random sampling strategies. To address the issue that global search may yield test not satisfying semantic constraints, a local search strategy based on the Monte Carlo tree search is proposed to enhance the algorithm's local search capabilities. Furthermore, we improve the diversity of the population and the algorithm's global search capability by updating SPEA2's external archive based on a decomposition-based archiving strategy. To validate the effectiveness of the proposed approach, experiments were conducted on several public datasets and various neural network models. The results reveal that, compared to random and clustering-based sampling, the frequency-based fuzz sampling strategy provides a greater improvement in coverage rate in the later stages of iterations. On complex networks like VGG16, the improved SPEA2 algorithm increased the coverage rate by about 12% across several coverage metrics, and by approximately 40% on LeNet series networks. The experimental results also indicates that the newly generated test cases not only exhibit higher coverage rates but also generate adversarial samples that reveal model errors. △ Less

Submitted 1 November, 2024; originally announced November 2024.

arXiv:2410.23159 [pdf, other]

Fourier Amplitude and Correlation Loss: Beyond Using L2 Loss for Skillful Precipitation Nowcasting

Authors: Chiu-Wai Yan, Shi Quan Foo, Van Hoan Trinh, Dit-Yan Yeung, Ka-Hing Wong, Wai-Kin Wong

Abstract: Deep learning approaches have been widely adopted for precipitation nowcasting in recent years. Previous studies mainly focus on proposing new model architectures to improve pixel-wise metrics. However, they frequently result in blurry predictions which provide limited utility to forecasting operations. In this work, we propose a new Fourier Amplitude and Correlation Loss (FACL) which consists of… ▽ More Deep learning approaches have been widely adopted for precipitation nowcasting in recent years. Previous studies mainly focus on proposing new model architectures to improve pixel-wise metrics. However, they frequently result in blurry predictions which provide limited utility to forecasting operations. In this work, we propose a new Fourier Amplitude and Correlation Loss (FACL) which consists of two novel loss terms: Fourier Amplitude Loss (FAL) and Fourier Correlation Loss (FCL). FAL regularizes the Fourier amplitude of the model prediction and FCL complements the missing phase information. The two loss terms work together to replace the traditional $L_2$ losses such as MSE and weighted MSE for the spatiotemporal prediction problem on signal-based data. Our method is generic, parameter-free and efficient. Extensive experiments using one synthetic dataset and three radar echo datasets demonstrate that our method improves perceptual metrics and meteorology skill scores, with a small trade-off to pixel-wise accuracy and structural similarity. Moreover, to improve the error margin in meteorological skill scores such as Critical Success Index (CSI) and Fractions Skill Score (FSS), we propose and adopt the Regional Histogram Divergence (RHD), a distance metric that considers the patch-wise similarity between signal-based imagery patterns with tolerance to local transforms. Code is available at https://github.com/argenycw/FACL △ Less

Submitted 30 October, 2024; originally announced October 2024.

Comments: Accepted by NeurIPS 2024. Camera-ready submission

arXiv:2410.21076 [pdf, other]

Accelerated Bayesian parameter estimation and model selection for gravitational waves with normalizing flows

Authors: Alicja Polanska, Thibeau Wouters, Peter T. H. Pang, Kaze K. W. Wong, Jason D. McEwen

Abstract: We present an accelerated pipeline, based on high-performance computing techniques and normalizing flows, for joint Bayesian parameter estimation and model selection and demonstrate its efficiency in gravitational wave astrophysics. We integrate the Jim inference toolkit, a normalizing flow-enhanced Markov chain Monte Carlo (MCMC) sampler, with the learned harmonic mean estimator. Our Bayesian evi… ▽ More We present an accelerated pipeline, based on high-performance computing techniques and normalizing flows, for joint Bayesian parameter estimation and model selection and demonstrate its efficiency in gravitational wave astrophysics. We integrate the Jim inference toolkit, a normalizing flow-enhanced Markov chain Monte Carlo (MCMC) sampler, with the learned harmonic mean estimator. Our Bayesian evidence estimates run on $1$ GPU are consistent with traditional nested sampling techniques run on $16$ CPU cores, while reducing the computation time by factors of $5\times$ and $15\times$ for $4$-dimensional and $11$-dimensional gravitational wave inference problems, respectively. Our code is available in well-tested and thoroughly documented open-source packages, ensuring accessibility and reproducibility for the wider research community. △ Less

Submitted 31 October, 2024; v1 submitted 28 October, 2024; originally announced October 2024.

Comments: accepted to NeurIPS 2024 workshop on Machine Learning and the Physical Sciences

Showing 1–50 of 723 results for author: Wong, W