Skip to main content

Showing 1–50 of 1,486 results for author: Ng

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.15808  [pdf, ps, other

    cs.IT eess.SP

    Hybrid Near-Far Field 6D Movable Antenna Design Exploiting Directional Sparsity and Deep Learning

    Authors: Xiaodan Shao, Limei Hu, Yulong Sun, Xing Li, Yixiao Zhang, Jingze Ding, Xiaoming Shi, Feng Chen, Derrick Wing Kwan Ng, Robert Schober

    Abstract: Six-dimensional movable antenna (6DMA) has been identified as a new disruptive technology for future wireless systems to support a large number of users with only a few antennas. However, the intricate relationships between the signal carrier wavelength and the transceiver region size lead to inaccuracies in traditional far-field 6DMA channel model, causing discrepancies between the model predicti… ▽ More

    Submitted 18 June, 2025; originally announced June 2025.

    Comments: 13 pages

  2. arXiv:2506.13044  [pdf, ps, other

    cs.CL cs.AI

    Just Go Parallel: Improving the Multilingual Capabilities of Large Language Models

    Authors: Muhammad Reza Qorib, Junyi Li, Hwee Tou Ng

    Abstract: Large language models (LLMs) have demonstrated impressive translation capabilities even without being explicitly trained on parallel data. This remarkable property has led some to believe that parallel data is no longer necessary for building multilingual language models. While some attribute this to the emergent abilities of LLMs due to scale, recent work suggests that it is actually caused by in… ▽ More

    Submitted 15 June, 2025; originally announced June 2025.

    Comments: ACL 2025

  3. arXiv:2506.11042  [pdf, ps, other

    cs.LG

    GenFT: A Generative Parameter-Efficient Fine-Tuning Method for Pretrained Foundation Models

    Authors: Baoquan Zhang, Guangning Xu, Michael. K. Ng

    Abstract: Pretrained Foundation Models (PFMs) have transformed numerous applications by enabling efficient adaptation to customized tasks. Parameter-Efficient Fine-Tuning (PEFT) has emerged as a resource-efficient alternative to full fine-tuning, especially leveraging reparameterized weights $ΔW$ to adapt models for downstream tasks. However, a critical yet underexplored question remains: can we utilize wel… ▽ More

    Submitted 21 May, 2025; originally announced June 2025.

  4. arXiv:2506.09091  [pdf, ps, other

    cs.LG cs.IT

    Variational Inference Optimized Using the Curved Geometry of Coupled Free Energy

    Authors: Kenric Nelson, Igor Oliveira, Amenah Al-Najafi, Fode Zhang, Hon Keung Tony Ng

    Abstract: We introduce an optimization framework for variational inference based on the coupled free energy, extending variational inference techniques to account for the curved geometry of the coupled exponential family. This family includes important heavy-tailed distributions such as the generalized Pareto and the Student's t. By leveraging the coupled free energy, which is equal to the coupled evidence… ▽ More

    Submitted 16 June, 2025; v1 submitted 10 June, 2025; originally announced June 2025.

    Comments: 13 pages, 2 figures, AGI-25

  5. arXiv:2506.07047  [pdf, ps, other

    cs.AI

    Mathesis: Towards Formal Theorem Proving from Natural Languages

    Authors: Yu Xuejun, Jianyuan Zhong, Zijin Feng, Pengyi Zhai, Roozbeh Yousefzadeh, Wei Chong Ng, Haoxiong Liu, Ziyi Shou, Jing Xiong, Yudong Zhou, Claudia Beth Ong, Austen Jeremy Sugiarto, Yaoxi Zhang, Wai Ming Tai, Huan Cao, Dongcai Lu, Jiacheng Sun, Qiang Xu, Shen Xin, Zhenguo Li

    Abstract: Recent advances in large language models show strong promise for formal reasoning. However, most LLM-based theorem provers have long been constrained by the need for expert-written formal statements as inputs, limiting their applicability to real-world problems expressed in natural language. We tackle this gap with Mathesis, the first end-to-end theorem proving pipeline processing informal problem… ▽ More

    Submitted 8 June, 2025; originally announced June 2025.

  6. arXiv:2506.06561  [pdf, ps, other

    cs.CL cs.AI cs.CV

    LaMP-Cap: Personalized Figure Caption Generation With Multimodal Figure Profiles

    Authors: Ho Yin 'Sam' Ng, Ting-Yao Hsu, Aashish Anantha Ramakrishnan, Branislav Kveton, Nedim Lipka, Franck Dernoncourt, Dongwon Lee, Tong Yu, Sungchul Kim, Ryan A. Rossi, Ting-Hao 'Kenneth' Huang

    Abstract: Figure captions are crucial for helping readers understand and remember a figure's key message. Many models have been developed to generate these captions, helping authors compose better quality captions more easily. Yet, authors almost always need to revise generic AI-generated captions to match their writing style and the domain's style, highlighting the need for personalization. Despite languag… ▽ More

    Submitted 17 June, 2025; v1 submitted 6 June, 2025; originally announced June 2025.

    Comments: The LaMP-CAP dataset is publicly available at: https://github.com/Crowd-AI-Lab/lamp-cap

  7. arXiv:2506.04471  [pdf, ps, other

    cs.IT eess.SP

    Polarized 6D Movable Antenna for Wireless Communication: Channel Modeling and Optimization

    Authors: Xiaodan Shao, Qijun Jiang, Derrick Wing Kwan Ng, Naofal Al-Dhahir

    Abstract: In this paper, we propose a novel polarized six-dimensional movable antenna (P-6DMA) to enhance the performance of wireless communication cost-effectively. Specifically, the P-6DMA enables polarforming by adaptively tuning the antenna's polarization electrically as well as controls the antenna's rotation mechanically, thereby exploiting both polarization and spatial diversity to reconfigure wirele… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: arXiv admin note: substantial text overlap with arXiv:2505.08070

  8. arXiv:2506.03214  [pdf, ps, other

    q-bio.NC cs.AI cs.CL

    A Pre-trained Framework for Multilingual Brain Decoding Using Non-invasive Recordings

    Authors: Yi Guo, Yihang Dong, Michael Kwok-Po Ng, Shuqiang Wang

    Abstract: Brain-computer interfaces (BCIs) with speech decoding from brain recordings have broad application potential in fields such as clinical rehabilitation and cognitive neuroscience. However, current decoding methods remain limited to single-language, single-subject, and single neuroimaging modality settings, restricting their clinical applicability and generalizability. Here we propose a joint multil… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

  9. arXiv:2506.02961  [pdf, ps, other

    cs.CL

    FlowerTune: A Cross-Domain Benchmark for Federated Fine-Tuning of Large Language Models

    Authors: Yan Gao, Massimo Roberto Scamarcia, Javier Fernandez-Marques, Mohammad Naseri, Chong Shen Ng, Dimitris Stripelis, Zexi Li, Tao Shen, Jiamu Bai, Daoyuan Chen, Zikai Zhang, Rui Hu, InSeo Song, Lee KangYoon, Hong Jia, Ting Dang, Junyan Wang, Zheyuan Liu, Daniel Janes Beutel, Lingjuan Lyu, Nicholas D. Lane

    Abstract: Large Language Models (LLMs) have achieved state-of-the-art results across diverse domains, yet their development remains reliant on vast amounts of publicly available data, raising concerns about data scarcity and the lack of access to domain-specific, sensitive information. Federated Learning (FL) presents a compelling framework to address these challenges by enabling decentralized fine-tuning o… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

  10. arXiv:2506.02761  [pdf, ps, other

    cs.AI cs.CL cs.CR cs.CV

    Rethinking Machine Unlearning in Image Generation Models

    Authors: Renyang Liu, Wenjie Feng, Tianwei Zhang, Wei Zhou, Xueqi Cheng, See-Kiong Ng

    Abstract: With the surge and widespread application of image generation models, data privacy and content safety have become major concerns and attracted great attention from users, service providers, and policymakers. Machine unlearning (MU) is recognized as a cost-effective and promising means to address these challenges. Despite some advancements, image generation model unlearning (IGMU) still faces remar… ▽ More

    Submitted 6 June, 2025; v1 submitted 3 June, 2025; originally announced June 2025.

    Comments: Accepted by ACM CCS 2025

    Journal ref: ACM Conference on Computer and Communications Security (CCS 2025)

  11. arXiv:2506.02368  [pdf, ps, other

    cs.IR

    NextQuill: Causal Preference Modeling for Enhancing LLM Personalization

    Authors: Xiaoyan Zhao, Juntao You, Yang Zhang, Wenjie Wang, Hong Cheng, Fuli Feng, See-Kiong Ng, Tat-Seng Chua

    Abstract: Personalizing large language models (LLMs) for individual users has become increasingly important as they are progressively integrated into real-world applications to support users' daily lives. However, existing personalization approaches often fail to distinguish which components of model predictions and training data truly reflect user preferences, leading to superficial personalization alignme… ▽ More

    Submitted 2 June, 2025; originally announced June 2025.

  12. arXiv:2506.01290  [pdf, other

    cs.LG cs.AI

    TSRating: Rating Quality of Diverse Time Series Data by Meta-learning from LLM Judgment

    Authors: Shunyu Wu, Dan Li, Haozheng Ye, Zhuomin Chen, Jiahui Zhou, Jian Lou, Zibin Zheng, See-Kiong Ng

    Abstract: High-quality time series (TS) data are essential for ensuring TS model performance, rendering research on rating TS data quality indispensable. Existing methods have shown promising rating accuracy within individual domains, primarily by extending data quality rating techniques such as influence functions and Shapley values to account for temporal characteristics. However, they neglect the fact th… ▽ More

    Submitted 1 June, 2025; originally announced June 2025.

  13. arXiv:2506.00834  [pdf, ps, other

    cs.NI cs.OS

    Söze: One Network Telemetry Is All You Need for Per-flow Weighted Bandwidth Allocation at Scale

    Authors: Weitao Wang, T. S. Eugene Ng

    Abstract: Weighted bandwidth allocation is a powerful abstraction that has a wide range of use cases in modern data center networks. However, realizing highly agile and precise weighted bandwidth allocation for large-scale cloud environments is fundamentally challenging. In this paper, we propose Söze, a lightweight decentralized weighted bandwidth allocation system that leverages simple network telemetry f… ▽ More

    Submitted 5 June, 2025; v1 submitted 1 June, 2025; originally announced June 2025.

  14. arXiv:2506.00352  [pdf, ps, other

    cs.DC cs.AI cs.DB

    Enabling Secure and Ephemeral AI Workloads in Data Mesh Environments

    Authors: Chinkit Patel, Kee Siong Ng

    Abstract: Many large enterprises that operate highly governed and complex ICT environments have no efficient and effective way to support their Data and AI teams in rapidly spinning up and tearing down self-service data and compute infrastructure, to experiment with new data analytic tools, and deploy data products into operational use. This paper proposes a key piece of the solution to the overall problem,… ▽ More

    Submitted 30 May, 2025; originally announced June 2025.

    Comments: 52 pages

  15. arXiv:2505.24630  [pdf, ps, other

    cs.CL cs.AI

    The Hallucination Dilemma: Factuality-Aware Reinforcement Learning for Large Reasoning Models

    Authors: Junyi Li, Hwee Tou Ng

    Abstract: Large language models (LLMs) have significantly advanced in reasoning tasks through reinforcement learning (RL) optimization, achieving impressive capabilities across various challenging benchmarks. However, our empirical analysis reveals a critical drawback: reasoning-oriented RL fine-tuning significantly increases the prevalence of hallucinations. We theoretically analyze the RL training dynamic… ▽ More

    Submitted 30 May, 2025; originally announced May 2025.

  16. arXiv:2505.23387  [pdf, ps, other

    cs.SE cs.AI

    Afterburner: Reinforcement Learning Facilitates Self-Improving Code Efficiency Optimization

    Authors: Mingzhe Du, Luu Anh Tuan, Yue Liu, Yuhao Qing, Dong Huang, Xinyi He, Qian Liu, Zejun Ma, See-kiong Ng

    Abstract: Large Language Models (LLMs) generate functionally correct solutions but often fall short in code efficiency, a critical bottleneck for real-world deployment. In this paper, we introduce a novel test-time iterative optimization framework to address this, employing a closed-loop system where LLMs iteratively refine code based on empirical performance feedback from an execution sandbox. We explore t… ▽ More

    Submitted 3 June, 2025; v1 submitted 29 May, 2025; originally announced May 2025.

  17. arXiv:2505.23295  [pdf, other

    cs.CL cs.AI cs.LG

    How Does Response Length Affect Long-Form Factuality

    Authors: James Xu Zhao, Jimmy Z. J. Liu, Bryan Hooi, See-Kiong Ng

    Abstract: Large language models (LLMs) are widely used for long-form text generation. However, factual errors in the responses would undermine their reliability. Despite growing attention to LLM factuality, the effect of response length on factuality remains underexplored. In this work, we systematically investigate this relationship by first introducing an automatic and bi-level long-form factuality evalua… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

    Comments: ACL 2025 Findings. 24 pages, 10 figures, 18 tables. Code available at https://github.com/XuZhao0/length-bias-factuality

  18. arXiv:2505.22967  [pdf, ps, other

    cs.LG cs.MA

    MermaidFlow: Redefining Agentic Workflow Generation via Safety-Constrained Evolutionary Programming

    Authors: Chengqi Zheng, Jianda Chen, Yueming Lyu, Wen Zheng Terence Ng, Haopeng Zhang, Yew-Soon Ong, Ivor Tsang, Haiyan Yin

    Abstract: Despite the promise of autonomous agentic reasoning, existing workflow generation methods frequently produce fragile, unexecutable plans due to unconstrained LLM-driven construction. We introduce MermaidFlow, a framework that redefines the agentic search space through safety-constrained graph evolution. At its core, MermaidFlow represent workflows as a verifiable intermediate representation using… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

  19. arXiv:2505.22683  [pdf, ps, other

    q-bio.NC cs.AI cs.CV

    ConnectomeDiffuser: Generative AI Enables Brain Network Construction from Diffusion Tensor Imaging

    Authors: Xuhang Chen, Michael Kwok-Po Ng, Kim-Fung Tsang, Chi-Man Pun, Shuqiang Wang

    Abstract: Brain network analysis plays a crucial role in diagnosing and monitoring neurodegenerative disorders such as Alzheimer's disease (AD). Existing approaches for constructing structural brain networks from diffusion tensor imaging (DTI) often rely on specialized toolkits that suffer from inherent limitations: operator subjectivity, labor-intensive workflows, and restricted capacity to capture complex… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

  20. arXiv:2505.19241  [pdf, ps, other

    cs.LG cs.AI

    ActiveDPO: Active Direct Preference Optimization for Sample-Efficient Alignment

    Authors: Xiaoqiang Lin, Arun Verma, Zhongxiang Dai, Daniela Rus, See-Kiong Ng, Bryan Kian Hsiang Low

    Abstract: The recent success of using human preferences to align large language models (LLMs) has significantly improved their performance in various downstream tasks like question answering, mathematical reasoning, and code generation. However,3 achieving effective LLM alignment depends on high-quality human preference datasets. Collecting these datasets requires human preference annotation, which is costl… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

  21. arXiv:2505.17505  [pdf, ps, other

    cs.CL

    L-MTP: Leap Multi-Token Prediction Beyond Adjacent Context for Large Language Models

    Authors: Xiaohao Liu, Xiaobo Xia, Weixiang Zhao, Manyi Zhang, Xianzhi Yu, Xiu Su, Shuo Yang, See-Kiong Ng, Tat-Seng Chua

    Abstract: Large language models (LLMs) have achieved notable progress. Despite their success, next-token prediction (NTP), the dominant method for LLM training and inference, is constrained in both contextual coverage and inference efficiency due to its inherently sequential process. To overcome these challenges, we propose leap multi-token prediction~(L-MTP), an innovative token prediction method that exte… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

  22. arXiv:2505.17265  [pdf, other

    cs.CL cs.AI

    CaseReportBench: An LLM Benchmark Dataset for Dense Information Extraction in Clinical Case Reports

    Authors: Xiao Yu Cindy Zhang, Carlos R. Ferreira, Francis Rossignol, Raymond T. Ng, Wyeth Wasserman, Jian Zhu

    Abstract: Rare diseases, including Inborn Errors of Metabolism (IEM), pose significant diagnostic challenges. Case reports serve as key but computationally underutilized resources to inform diagnosis. Clinical dense information extraction refers to organizing medical information into structured predefined categories. Large Language Models (LLMs) may enable scalable information extraction from case reports b… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

  23. arXiv:2505.15206  [pdf, ps, other

    cs.RO cs.AI

    EndoVLA: Dual-Phase Vision-Language-Action Model for Autonomous Tracking in Endoscopy

    Authors: Chi Kit Ng, Long Bai, Guankun Wang, Yupeng Wang, Huxin Gao, Kun Yuan, Chenhan Jin, Tieyong Zeng, Hongliang Ren

    Abstract: In endoscopic procedures, autonomous tracking of abnormal regions and following circumferential cutting markers can significantly reduce the cognitive burden on endoscopists. However, conventional model-based pipelines are fragile for each component (e.g., detection, motion planning) requires manual tuning and struggles to incorporate high-level endoscopic intent, leading to poor generalization ac… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

  24. arXiv:2505.14680  [pdf, ps, other

    cs.IR cs.AI cs.CL cs.HC

    NExT-Search: Rebuilding User Feedback Ecosystem for Generative AI Search

    Authors: Sunhao Dai, Wenjie Wang, Liang Pang, Jun Xu, See-Kiong Ng, Ji-Rong Wen, Tat-Seng Chua

    Abstract: Generative AI search is reshaping information retrieval by offering end-to-end answers to complex queries, reducing users' reliance on manually browsing and summarizing multiple web pages. However, while this paradigm enhances convenience, it disrupts the feedback-driven improvement loop that has historically powered the evolution of traditional Web search. Web search can continuously improve thei… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

    Comments: SIGIR 2025 Perspective Paper

  25. arXiv:2505.14405  [pdf, ps, other

    cs.CV

    Investigating and Enhancing the Robustness of Large Multimodal Models Against Temporal Inconsistency

    Authors: Jiafeng Liang, Shixin Jiang, Xuan Dong, Ning Wang, Zheng Chu, Hui Su, Jinlan Fu, Ming Liu, See-Kiong Ng, Bing Qin

    Abstract: Large Multimodal Models (LMMs) have recently demonstrated impressive performance on general video comprehension benchmarks. Nevertheless, for broader applications, the robustness of their temporal analysis capability needs to be thoroughly investigated yet predominantly ignored. Motivated by this, we propose a novel temporal robustness benchmark (TemRobBench), which introduces temporal inconsisten… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

  26. arXiv:2505.14163  [pdf, ps, other

    cs.AI

    DSMentor: Enhancing Data Science Agents with Curriculum Learning and Online Knowledge Accumulation

    Authors: He Wang, Alexander Hanbo Li, Yiqun Hu, Sheng Zhang, Hideo Kobayashi, Jiani Zhang, Henry Zhu, Chung-Wei Hang, Patrick Ng

    Abstract: Large language model (LLM) agents have shown promising performance in generating code for solving complex data science problems. Recent studies primarily focus on enhancing in-context learning through improved search, sampling, and planning techniques, while overlooking the importance of the order in which problems are tackled during inference. In this work, we develop a novel inference-time optim… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

  27. 6G communications through sub-Terahertz CMOS power amplifiers: Design challenges and trends

    Authors: Jun Yan Lee, Duo Wu, Xuanrui Guo, Jian Ding Tan, Teh Jia Yew, Zi Neng Ng, Mohammad Arif Sobhan Bhuiyan, Mahdi H. Miraz

    Abstract: The fifth-generation (5G) network faces limitations in supporting emerging applications, such as artificial intelligence (AI), virtual reality (VR) and digital twins. To overcome these confines, sub-Terahertz (sub-THz) and Terahertz (THz) technologies are considered to be key enablers of effective 6G wireless communications, offering higher transmission speeds, longer range and wider bandwidth. Ac… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

    Journal ref: Heliyon, vol. 11, no. 11, May 2025

  28. arXiv:2505.13004  [pdf, ps, other

    cs.CL

    EffiBench-X: A Multi-Language Benchmark for Measuring Efficiency of LLM-Generated Code

    Authors: Yuhao Qing, Boyu Zhu, Mingzhe Du, Zhijiang Guo, Terry Yue Zhuo, Qianru Zhang, Jie M. Zhang, Heming Cui, Siu-Ming Yiu, Dong Huang, See-Kiong Ng, Luu Anh Tuan

    Abstract: Existing code generation benchmarks primarily evaluate functional correctness, with limited focus on code efficiency and often restricted to a single language like Python. To address this gap, we introduce EffiBench-X, the first multi-language benchmark designed to measure the efficiency of LLM-generated code. EffiBench-X supports Python, C++, Java, JavaScript, Ruby, and Golang. It comprises compe… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

    Comments: Under Review

  29. arXiv:2505.12605  [pdf, ps, other

    cs.CV

    Temporal-Oriented Recipe for Transferring Large Vision-Language Model to Video Understanding

    Authors: Thong Nguyen, Zhiyuan Hu, Xu Lin, Cong-Duy Nguyen, See-Kiong Ng, Luu Anh Tuan

    Abstract: Recent years have witnessed outstanding advances of large vision-language models (LVLMs). In order to tackle video understanding, most of them depend upon their implicit temporal understanding capacity. As such, they have not deciphered important components that contribute to temporal understanding ability, which might limit the potential of these LVLMs for video understanding. In this work, we co… ▽ More

    Submitted 18 May, 2025; originally announced May 2025.

    Comments: In Progress

  30. arXiv:2505.11323  [pdf, other

    stat.ML cs.LG

    Convergence Rates of Constrained Expected Improvement

    Authors: Haowei Wang, Jingyi Wang, Zhongxiang Dai, Nai-Yuan Chiang, Szu Hui Ng, Cosmin G. Petra

    Abstract: Constrained Bayesian optimization (CBO) methods have seen significant success in black-box optimization with constraints, and one of the most commonly used CBO methods is the constrained expected improvement (CEI) algorithm. CEI is a natural extension of the expected improvement (EI) when constraints are incorporated. However, the theoretical convergence rate of CEI has not been established. In th… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

  31. arXiv:2505.10160  [pdf, other

    stat.ML cs.LG

    One-Stage Top-$k$ Learning-to-Defer: Score-Based Surrogates with Theoretical Guarantees

    Authors: Yannis Montreuil, Axel Carlier, Lai Xing Ng, Wei Tsang Ooi

    Abstract: We introduce the first one-stage Top-$k$ Learning-to-Defer framework, which unifies prediction and deferral by learning a shared score-based model that selects the $k$ most cost-effective entities-labels or experts-per input. While existing one-stage L2D methods are limited to deferring to a single expert, our approach jointly optimizes prediction and deferral across multiple entities through a si… ▽ More

    Submitted 15 May, 2025; originally announced May 2025.

  32. arXiv:2505.09930  [pdf, other

    cs.CL

    Rethinking Prompt Optimizers: From Prompt Merits to Optimization

    Authors: Zixiao Zhu, Hanzhang Zhou, Zijian Feng, Tianjiao Li, Chua Jia Jim Deryl, Mak Lee Onn, Gee Wah Ng, Kezhi Mao

    Abstract: Prompt optimization (PO) provides a practical way to improve response quality when users lack the time or expertise to manually craft effective prompts. Existing methods typically rely on advanced, large-scale LLMs like GPT-4 to generate optimized prompts. However, due to limited downward compatibility, verbose, instruction-heavy prompts from advanced LLMs can overwhelm lightweight inference model… ▽ More

    Submitted 20 May, 2025; v1 submitted 14 May, 2025; originally announced May 2025.

    Comments: 21 pages, 14 figures

  33. arXiv:2505.09107  [pdf, other

    astro-ph.IM astro-ph.EP astro-ph.SR cs.DC

    Architecture of Tianyu Software: Relative Photometry as a Case Study

    Authors: Yicheng Rui, Yifan Xuan, Shuyue Zheng, Kexin Li, Kaiming Cui, Kai Xiao, Jie Zheng, Jun Kai Ng, Hongxuan Jiang, Fabo Feng, Qinghui Sun

    Abstract: Tianyu telescope, an one-meter robotic optical survey instrument to be constructed in Lenghu, Qinghai, China, is designed for detecting transiting exoplanets, variable stars and transients. It requires a highly automated, optimally distributed, easily extendable, and highly flexible software to enable the data processing for the raw data at rates exceeding 500MB/s. In this work, we introduce the a… ▽ More

    Submitted 14 May, 2025; v1 submitted 13 May, 2025; originally announced May 2025.

    Comments: 18 pages, 10 figures, 6 tables, accepted for publication in PASP

  34. arXiv:2505.07235  [pdf, other

    cs.SD eess.AS

    Multi-band Frequency Reconstruction for Neural Psychoacoustic Coding

    Authors: Dianwen Ng, Kun Zhou, Yi-Wen Chao, Zhiwei Xiong, Bin Ma, Eng Siong Chng

    Abstract: Achieving high-fidelity audio compression while preserving perceptual quality across diverse content remains a key challenge in Neural Audio Coding (NAC). We introduce MUFFIN, a fully convolutional Neural Psychoacoustic Coding (NPC) framework that leverages psychoacoustically guided multi-band frequency reconstruction. At its core is a Multi-Band Spectral Residual Vector Quantization (MBS-RVQ) mod… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

  35. arXiv:2505.06900  [pdf, other

    eess.SP cs.IT cs.LG

    Near-Field Channel Estimation for XL-MIMO: A Deep Generative Model Guided by Side Information

    Authors: Zhenzhou Jin, Li You, Derrick Wing Kwan Ng, Xiang-Gen Xia, Xiqi Gao

    Abstract: This paper investigates the near-field (NF) channel estimation (CE) for extremely large-scale multiple-input multiple-output (XL-MIMO) systems. Considering the pronounced NF effects in XL-MIMO communications, we first establish a joint angle-distance (AD) domain-based spherical-wavefront physical channel model that captures the inherent sparsity of XL-MIMO channels. Leveraging the channel's sparsi… ▽ More

    Submitted 11 May, 2025; originally announced May 2025.

    Comments: 15 pages, 11 figures, to appear on IEEE Transactions on Cognitive Communications and Networking

  36. arXiv:2505.06256  [pdf, other

    eess.SP cs.AI

    SpectrumFM: A Foundation Model for Intelligent Spectrum Management

    Authors: Fuhui Zhou, Chunyu Liu, Hao Zhang, Wei Wu, Qihui Wu, Derrick Wing Kwan Ng, Tony Q. S. Quek, Chan-Byoung Chae

    Abstract: Intelligent spectrum management is crucial for improving spectrum efficiency and achieving secure utilization of spectrum resources. However, existing intelligent spectrum management methods, typically based on small-scale models, suffer from notable limitations in recognition accuracy, convergence speed, and generalization, particularly in the complex and dynamic spectrum environments. To address… ▽ More

    Submitted 2 May, 2025; originally announced May 2025.

  37. arXiv:2505.05225  [pdf, ps, other

    cs.CL

    QualBench: Benchmarking Chinese LLMs with Localized Professional Qualifications for Vertical Domain Evaluation

    Authors: Mengze Hong, Wailing Ng, Di Jiang, Chen Jason Zhang

    Abstract: The rapid advancement of Chinese large language models (LLMs) underscores the need for domain-specific evaluations to ensure reliable applications. However, existing benchmarks often lack coverage in vertical domains and offer limited insights into the Chinese working context. Leveraging qualification exams as a unified framework for human expertise evaluation, we introduce QualBench, the first mu… ▽ More

    Submitted 8 May, 2025; originally announced May 2025.

  38. arXiv:2505.05064  [pdf, ps, other

    cs.LG

    WaterDrum: Watermarking for Data-centric Unlearning Metric

    Authors: Xinyang Lu, Xinyuan Niu, Gregory Kang Ruey Lau, Bui Thi Cam Nhung, Rachael Hwee Ling Sim, Fanyu Wen, Chuan-Sheng Foo, See-Kiong Ng, Bryan Kian Hsiang Low

    Abstract: Large language model (LLM) unlearning is critical in real-world applications where it is necessary to efficiently remove the influence of private, copyrighted, or harmful data from some users. However, existing utility-centric unlearning metrics (based on model utility) may fail to accurately evaluate the extent of unlearning in realistic settings such as when (a) the forget and retain set have se… ▽ More

    Submitted 8 May, 2025; originally announced May 2025.

  39. arXiv:2505.04968  [pdf, ps, other

    cs.IT eess.SP

    Dynamic Precoding for Near-Field Secure Communications: Implementation and Performance Analysis

    Authors: Zihao Teng, Jiancheng An, Christos Masouros, Hongbin Li, Lu Gan, Derrick Wing Kwan Ng

    Abstract: The increase in antenna apertures and transmission frequencies in next-generation wireless networks is catalyzing advancements in near-field communications (NFC). In this paper, we investigate secure transmission in near-field multi-user multiple-input single-output (MU-MISO) scenarios. Specifically, with the advent of extremely large-scale antenna arrays (ELAA) applied in the NFC regime, the spat… ▽ More

    Submitted 8 May, 2025; originally announced May 2025.

    Comments: 15 pages, 10 figures, 2 tables, accepted by IEEE IoTJ

  40. arXiv:2505.04028  [pdf

    cs.SI physics.soc-ph

    Appeal and Scope of Misinformation Spread by AI Agents and Humans

    Authors: Lynnette Hui Xian Ng, Wenqi Zhou, Kathleen M. Carley

    Abstract: This work examines the influence of misinformation and the role of AI agents, called bots, on social network platforms. To quantify the impact of misinformation, it proposes two new metrics based on attributes of tweet engagement and user network position: Appeal, which measures the popularity of the tweet, and Scope, which measures the potential reach of the tweet. In addition, it analyzes 5.8 mi… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

    Comments: Accepted to AMCIS 2025

  41. arXiv:2505.03573  [pdf, other

    cs.SI cs.DS math.OC

    Troika algorithm: approximate optimization for accurate clique partitioning and clustering of weighted networks

    Authors: Samin Aref, Boris Ng

    Abstract: Clique partitioning is a fundamental network clustering task, with applications in a wide range of computational sciences. It involves identifying an optimal partition of the nodes for a real-valued weighted graph according to the edge weights. An optimal partition is one that maximizes the sum of within-cluster edge weights over all possible node partitions. This paper introduces a novel approxim… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

    Comments: 29 pages, 10 figures, 3 tables

    MSC Class: 90C90; 90C10; 90C57; 90C59; 90C35; 05C15; 65K05 ACM Class: I.2.6; G.2.2

  42. arXiv:2505.00551  [pdf, other

    cs.CL

    100 Days After DeepSeek-R1: A Survey on Replication Studies and More Directions for Reasoning Language Models

    Authors: Chong Zhang, Yue Deng, Xiang Lin, Bin Wang, Dianwen Ng, Hai Ye, Xingxuan Li, Yao Xiao, Zhanfeng Mo, Qi Zhang, Lidong Bing

    Abstract: The recent development of reasoning language models (RLMs) represents a novel evolution in large language models. In particular, the recent release of DeepSeek-R1 has generated widespread social impact and sparked enthusiasm in the research community for exploring the explicit reasoning paradigm of language models. However, the implementation details of the released models have not been fully open… ▽ More

    Submitted 15 May, 2025; v1 submitted 1 May, 2025; originally announced May 2025.

  43. arXiv:2504.21501  [pdf, ps, other

    cs.LG

    Deep Learning Optimization Using Self-Adaptive Weighted Auxiliary Variables

    Authors: Yaru Liu, Yiqi Gu, Michael K. Ng

    Abstract: In this paper, we develop a new optimization framework for the least squares learning problem via fully connected neural networks or physics-informed neural networks. The gradient descent sometimes behaves inefficiently in deep learning because of the high non-convexity of loss functions and the vanishing gradient issue. Our idea is to introduce auxiliary variables to separate the layers of the de… ▽ More

    Submitted 30 April, 2025; originally announced April 2025.

    Comments: 32 pages, 11 figures

  44. arXiv:2504.21468  [pdf, other

    cs.CV

    Quaternion Nuclear Norms Over Frobenius Norms Minimization for Robust Matrix Completion

    Authors: Yu Guo, Guoqing Chen, Tieyong Zeng, Qiyu Jin, Michael Kwok-Po Ng

    Abstract: Recovering hidden structures from incomplete or noisy data remains a pervasive challenge across many fields, particularly where multi-dimensional data representation is essential. Quaternion matrices, with their ability to naturally model multi-dimensional data, offer a promising framework for this problem. This paper introduces the quaternion nuclear norm over the Frobenius norm (QNOF) as a novel… ▽ More

    Submitted 30 April, 2025; originally announced April 2025.

    MSC Class: 65F35; 90C30; 94A08; 68U10

  45. arXiv:2504.21444  [pdf, other

    cs.NI

    A Unified QoS-Aware Multiplexing Framework for Next Generation Immersive Communication with Legacy Wireless Applications

    Authors: Jihong Li, Shunqing Zhang, Tao Yu, Guangjin Pan, Kaixuan Huang, Xiaojing Chen, Yanzan Sun, Junyu Liu, Jiandong Li, Derrick Wing Kwan Ng

    Abstract: Immersive communication, including emerging augmented reality, virtual reality, and holographic telepresence, has been identified as a key service for enabling next-generation wireless applications. To align with legacy wireless applications, such as enhanced mobile broadband or ultra-reliable low-latency communication, network slicing has been widely adopted. However, attempting to statistically… ▽ More

    Submitted 2 May, 2025; v1 submitted 30 April, 2025; originally announced April 2025.

  46. arXiv:2504.21191  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Small or Large? Zero-Shot or Finetuned? Guiding Language Model Choice for Specialized Applications in Healthcare

    Authors: Lovedeep Gondara, Jonathan Simkin, Graham Sayle, Shebnum Devji, Gregory Arbour, Raymond Ng

    Abstract: This study aims to guide language model selection by investigating: 1) the necessity of finetuning versus zero-shot usage, 2) the benefits of domain-adjacent versus generic pretrained models, 3) the value of further domain-specific pretraining, and 4) the continued relevance of Small Language Models (SLMs) compared to Large Language Models (LLMs) for specific tasks. Using electronic pathology repo… ▽ More

    Submitted 29 April, 2025; originally announced April 2025.

  47. arXiv:2504.20754  [pdf, other

    cs.LG

    DDPS: Discrete Diffusion Posterior Sampling for Paths in Layered Graphs

    Authors: Hao Luan, See-Kiong Ng, Chun Kai Ling

    Abstract: Diffusion models form an important class of generative models today, accounting for much of the state of the art in cutting edge AI research. While numerous extensions beyond image and video generation exist, few of such approaches address the issue of explicit constraints in the samples generated. In this paper, we study the problem of generating paths in a layered graph (a variant of a directed… ▽ More

    Submitted 29 April, 2025; originally announced April 2025.

    Comments: To appear at Frontiers in Probabilistic Inference: Sampling meets Learning (FPI) workshop at ICLR 2025. https://openreview.net/forum?id=DBdkU0Ikzy

  48. arXiv:2504.18057  [pdf, other

    cs.RO cs.AI

    Opportunistic Collaborative Planning with Large Vision Model Guided Control and Joint Query-Service Optimization

    Authors: Jiayi Chen, Shuai Wang, Guoliang Li, Wei Xu, Guangxu Zhu, Derrick Wing Kwan Ng, Chengzhong Xu

    Abstract: Navigating autonomous vehicles in open scenarios is a challenge due to the difficulties in handling unseen objects. Existing solutions either rely on small models that struggle with generalization or large models that are resource-intensive. While collaboration between the two offers a promising solution, the key challenge is deciding when and how to engage the large model. To address this issue,… ▽ More

    Submitted 25 April, 2025; originally announced April 2025.

  49. arXiv:2504.17539  [pdf, other

    cs.CR cs.AI

    Proof of Useful Intelligence (PoUI): Blockchain Consensus Beyond Energy Waste

    Authors: Zan-Kai Chong, Hiroyuki Ohsaki, Bryan Ng

    Abstract: Blockchain technology enables secure, transparent data management in decentralized systems, supporting applications from cryptocurrencies like Bitcoin to tokenizing real-world assets like property. Its scalability and sustainability hinge on consensus mechanisms balancing security and efficiency. Proof of Work (PoW), used by Bitcoin, ensures security through energy-intensive computations but deman… ▽ More

    Submitted 24 April, 2025; originally announced April 2025.

  50. arXiv:2504.16073  [pdf, other

    cs.CL

    Guiding VLM Agents with Process Rewards at Inference Time for GUI Navigation

    Authors: Zhiyuan Hu, Shiyun Xiong, Yifan Zhang, See-Kiong Ng, Anh Tuan Luu, Bo An, Shuicheng Yan, Bryan Hooi

    Abstract: Recent advancements in visual language models (VLMs) have notably enhanced their capabilities in handling complex Graphical User Interface (GUI) interaction tasks. Despite these improvements, current frameworks often struggle to generate correct actions in challenging GUI environments. State-of-the-art commercial VLMs are black-boxes, and fine-tuning open-source VLMs for GUI tasks requires signifi… ▽ More

    Submitted 22 April, 2025; originally announced April 2025.