Skip to main content

Showing 1–13 of 13 results for author: Kwok, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.05012  [pdf, ps, other

    cs.RO physics.comp-ph physics.flu-dyn

    A Unified Framework for Simulating Strongly-Coupled Fluid-Robot Multiphysics

    Authors: Jeong Hun Lee, Junzhe Hu, Sofia Kwok, Carmel Majidi, Zachary Manchester

    Abstract: We present a framework for simulating fluid-robot multiphysics as a single, unified optimization problem. The coupled manipulator and incompressible Navier-Stokes equations governing the robot and fluid dynamics are derived together from a single Lagrangian using the principal of least action. We then employ discrete variational mechanics to derive a stable, implicit time-integration scheme for jo… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

  2. arXiv:2504.14493  [pdf, ps, other

    cs.IR cs.AI cs.LG

    FinSage: A Multi-aspect RAG System for Financial Filings Question Answering

    Authors: Xinyu Wang, Jijun Chi, Zhenghan Tai, Tung Sum Thomas Kwok, Muzhi Li, Zhuhong Li, Hailin He, Yuchen Hua, Peng Lu, Suyuchen Wang, Yihong Wu, Jerry Huang, Jingrui Tian, Fengran Mo, Yufei Cui, Ling Zhou

    Abstract: Leveraging large language models in real-world settings often entails a need to utilize domain-specific data and tools in order to follow the complex regulations that need to be followed for acceptable use. Within financial sectors, modern enterprises increasingly rely on Retrieval-Augmented Generation (RAG) systems to address complex compliance requirements in financial document workflows. Howeve… ▽ More

    Submitted 6 June, 2025; v1 submitted 20 April, 2025; originally announced April 2025.

  3. arXiv:2503.15564  [pdf, other

    cs.LG

    GReaTER: Generate Realistic Tabular data after data Enhancement and Reduction

    Authors: Tung Sum Thomas Kwok, Chi-Hua Wang, Guang Cheng

    Abstract: Tabular data synthesis involves not only multi-table synthesis but also generating multi-modal data (e.g., strings and categories), which enables diverse knowledge synthesis. However, separating numerical and categorical data has limited the effectiveness of tabular data generation. The GReaT (Generate Realistic Tabular Data) framework uses Large Language Models (LLMs) to encode entire rows, elimi… ▽ More

    Submitted 19 March, 2025; originally announced March 2025.

    Comments: Accepted by Data Engineering Meets Large Language Models: Challenges and Opportunities Workshop@ICDE2025 Workshop at ICDE 2025

  4. arXiv:2502.20889  [pdf, other

    cs.DS

    A Faster Algorithm for Maximum Weight Matching on Unrestricted Bipartite Graphs

    Authors: Shawxing Kwok

    Abstract: Given a weighted bipartite graph $G = (L, R, E, w)$, the maximum weight matching (MWM) problem seeks to find a matching $M \subseteq E$ that maximizes the total weight $\sum_{e \in M} w(e)$. This paper presents a novel algorithm with a time complexity of $O(\min(X^3 + E, XE + X^2\log X))$, where $X = \min(|L|, |R|)$. Unlike many existing algorithms, our approach supports real-valued weights with… ▽ More

    Submitted 4 April, 2025; v1 submitted 28 February, 2025; originally announced February 2025.

  5. arXiv:2411.00879  [pdf, other

    cs.DB cs.LG

    DEREC-SIMPRO: unlock Language Model benefits to advance Synthesis in Data Clean Room

    Authors: Tung Sum Thomas Kwok, Chi-hua Wang, Guang Cheng

    Abstract: Data collaboration via Data Clean Room offers value but raises privacy concerns, which can be addressed through synthetic data and multi-table synthesizers. Common multi-table synthesizers fail to perform when subjects occur repeatedly in both tables. This is an urgent yet unresolved problem, since having both tables with repeating subjects is common. To improve performance in this scenario, we pr… ▽ More

    Submitted 31 October, 2024; originally announced November 2024.

  6. arXiv:2409.10469  [pdf, other

    cs.RO

    Real-Time Whole-Body Control of Legged Robots with Model-Predictive Path Integral Control

    Authors: Juan Alvarez-Padilla, John Z. Zhang, Sofia Kwok, John M. Dolan, Zachary Manchester

    Abstract: This paper presents a system for enabling real-time synthesis of whole-body locomotion and manipulation policies for real-world legged robots. Motivated by recent advancements in robot simulation, we leverage the efficient parallelization capabilities of the MuJoCo simulator to achieve fast sampling over the robot state and action trajectories. Our results show surprisingly effective real-world lo… ▽ More

    Submitted 16 September, 2024; originally announced September 2024.

    Comments: Under review. Code and videos are available on our website: https://whole-body-mppi.github.io/

  7. arXiv:2406.10173  [pdf, other

    cs.CL

    IntentionQA: A Benchmark for Evaluating Purchase Intention Comprehension Abilities of Language Models in E-commerce

    Authors: Wenxuan Ding, Weiqi Wang, Sze Heng Douglas Kwok, Minghao Liu, Tianqing Fang, Jiaxin Bai, Xin Liu, Changlong Yu, Zheng Li, Chen Luo, Qingyu Yin, Bing Yin, Junxian He, Yangqiu Song

    Abstract: Enhancing Language Models' (LMs) ability to understand purchase intentions in E-commerce scenarios is crucial for their effective assistance in various downstream tasks. However, previous approaches that distill intentions from LMs often fail to generate meaningful and human-centric intentions applicable in real-world E-commerce contexts. This raises concerns about the true comprehension and utili… ▽ More

    Submitted 29 September, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

    Comments: Findings of EMNLP 2024

  8. arXiv:2310.17769  [pdf, other

    cs.CL cs.AI

    Social Contract AI: Aligning AI Assistants with Implicit Group Norms

    Authors: Jan-Philipp Fränken, Sam Kwok, Peixuan Ye, Kanishk Gandhi, Dilip Arumugam, Jared Moore, Alex Tamkin, Tobias Gerstenberg, Noah D. Goodman

    Abstract: We explore the idea of aligning an AI assistant by inverting a model of users' (unknown) preferences from observed interactions. To validate our proposal, we run proof-of-concept simulations in the economic ultimatum game, formalizing user preferences as policies that guide the actions of simulated players. We find that the AI assistant accurately aligns its behavior to match standard policies fro… ▽ More

    Submitted 3 December, 2023; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: SoLaR NeurIPS 2023 Workshop (https://solar-neurips.github.io/)

  9. arXiv:1908.07307  [pdf, other

    cs.LG eess.SP stat.ML

    Investigation of wind pressures on tall building under interference effects using machine learning techniques

    Authors: Gang Hu, Lingbo Liu, Dacheng Tao, Jie Song, K. C. S. Kwok

    Abstract: Interference effects of tall buildings have attracted numerous studies due to the boom of clusters of tall buildings in megacities. To fully understand the interference effects of buildings, it often requires a substantial amount of wind tunnel tests. Limited wind tunnel tests that only cover part of interference scenarios are unable to fully reveal the interference effects. This study used machin… ▽ More

    Submitted 20 August, 2019; originally announced August 2019.

    Comments: 15 pages, 14 figures

  10. arXiv:1901.06752  [pdf

    cs.LG physics.flu-dyn stat.ML

    Predicting wind pressures around circular cylinders using machine learning techniques

    Authors: Gang Hu, K. C. S. Kwok

    Abstract: Numerous studies have been carried out to measure wind pressures around circular cylinders since the early 20th century due to its engineering significance. Consequently, a large amount of wind pressure data sets have accumulated, which presents an excellent opportunity for using machine learning (ML) techniques to train models to predict wind pressures around circular cylinders. Wind pressures ar… ▽ More

    Submitted 20 January, 2019; originally announced January 2019.

  11. BACH: Grand Challenge on Breast Cancer Histology Images

    Authors: Guilherme Aresta, Teresa Araújo, Scotty Kwok, Sai Saketh Chennamsetty, Mohammed Safwan, Varghese Alex, Bahram Marami, Marcel Prastawa, Monica Chan, Michael Donovan, Gerardo Fernandez, Jack Zeineh, Matthias Kohl, Christoph Walz, Florian Ludwig, Stefan Braunewell, Maximilian Baust, Quoc Dang Vu, Minh Nguyen Nhat To, Eal Kim, Jin Tae Kwak, Sameh Galal, Veronica Sanchez-Freire, Nadia Brancati, Maria Frucci , et al. (11 additional authors not shown)

    Abstract: Breast cancer is the most common invasive cancer in women, affecting more than 10% of women worldwide. Microscopic analysis of a biopsy remains one of the most important methods to diagnose the type of breast cancer. This requires specialized analysis by pathologists, in a task that i) is highly time- and cost-consuming and ii) often leads to nonconsensual results. The relevance and potential of a… ▽ More

    Submitted 17 June, 2019; v1 submitted 13 August, 2018; originally announced August 2018.

    Comments: Accepted for publication at Medical Image Analysis (Elsevier). Publication licensed under the Creative Commons CC-BY-NC-ND 4.0 license http://creativecommons.org/licenses/by-nc-nd/4.0/

    Journal ref: Medical Image Analysis, 2019

  12. arXiv:1304.2363  [pdf

    cs.LG cs.AI stat.ML

    Multiple decision trees

    Authors: Suk Wah Kwok, Chris Carter

    Abstract: This paper describes experiments, on two domains, to investigate the effect of averaging over predictions of multiple decision trees, instead of using a single tree. Other authors have pointed out theoretical and commonsense reasons for preferring the multiple tree approach. Ideally, we would like to consider predictions from all trees, weighted by their probability. However, there is a vast nu… ▽ More

    Submitted 27 March, 2013; originally announced April 2013.

    Comments: Appears in Proceedings of the Fourth Conference on Uncertainty in Artificial Intelligence (UAI1988)

    Report number: UAI-P-1988-PG-213-220

  13. arXiv:cs/0609158  [pdf

    cs.CR cs.MM

    A Fast Image Encryption Scheme based on Chaotic Standard Map

    Authors: Kwok-Wo Wong, Bernie Sin-Hung Kwok, Wing-Shing Law

    Abstract: In recent years, a variety of effective chaos-based image encryption schemes have been proposed. The typical structure of these schemes has the permutation and the diffusion stages performed alternatively. The confusion and diffusion effect is solely contributed by the permutation and the diffusion stage, respectively. As a result, more overall rounds than necessary are required to achieve a cer… ▽ More

    Submitted 29 September, 2006; originally announced September 2006.

    Comments: 16 pages, 7 figures