Skip to main content

Showing 1–32 of 32 results for author: Yeo, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.15504  [pdf, ps, other

    cs.CV cs.AI

    Beyond Linearity: Squeeze-and-Recalibrate Blocks for Few-Shot Whole Slide Image Classification

    Authors: Conghao Xiong, Zhengrui Guo, Zhe Xu, Yifei Zhang, Raymond Kai-Yu Tong, Si Yong Yeo, Hao Chen, Joseph J. Y. Sung, Irwin King

    Abstract: Deep learning has advanced computational pathology but expert annotations remain scarce. Few-shot learning mitigates annotation burdens yet suffers from overfitting and discriminative feature mischaracterization. In addition, the current few-shot multiple instance learning (MIL) approaches leverage pretrained vision-language models to alleviate these issues, but at the cost of complex preprocessin… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

  2. arXiv:2504.07603  [pdf, other

    cs.CV cs.AI

    RASMD: RGB And SWIR Multispectral Driving Dataset for Robust Perception in Adverse Conditions

    Authors: Youngwan Jin, Michal Kovac, Yagiz Nalcakan, Hyeongjin Ju, Hanbin Song, Sanghyeop Yeo, Shiho Kim

    Abstract: Current autonomous driving algorithms heavily rely on the visible spectrum, which is prone to performance degradation in adverse conditions like fog, rain, snow, glare, and high contrast. Although other spectral bands like near-infrared (NIR) and long-wave infrared (LWIR) can enhance vision perception in such situations, they have limitations and lack large-scale datasets and benchmarks. Short-wav… ▽ More

    Submitted 10 April, 2025; originally announced April 2025.

  3. arXiv:2504.06485  [pdf, other

    cs.SI

    Cooperative Dilemmas in Rational Debate

    Authors: Toby Handfield, Julián Garcia, Christian Hilbe, Shang Long Yeo

    Abstract: As an epistemic activity, rational debate and discussion requires cooperation, yet involves a tension between collective and individual interests. While all participants benefit from collective outcomes like reaching consensus on true beliefs, individuals face personal costs when changing their minds. This creates an incentive for each debater to let others bear the cognitive burden of exploring a… ▽ More

    Submitted 8 April, 2025; originally announced April 2025.

    Comments: 20 pages, plus 19 pages of Supplementary material. 9 Figures

  4. arXiv:2503.17069  [pdf, ps, other

    cs.CV cs.AI

    PVChat: Personalized Video Chat with One-Shot Learning

    Authors: Yufei Shi, Weilong Yan, Gang Xu, Yumeng Li, Yucheng Chen, Zhenxi Li, Fei Richard Yu, Ming Li, Si Yong Yeo

    Abstract: Video large language models (ViLLMs) excel in general video understanding, e.g., recognizing activities like talking and eating, but struggle with identity-aware comprehension, such as "Wilson is receiving chemotherapy" or "Tom is discussing with Sarah", limiting their applicability in smart healthcare and smart home environments. To address this limitation, we propose a one-shot learning framewor… ▽ More

    Submitted 8 July, 2025; v1 submitted 21 March, 2025; originally announced March 2025.

  5. arXiv:2503.06565  [pdf, other

    cs.CV

    Future-Aware Interaction Network For Motion Forecasting

    Authors: Shijie Li, Xun Xu, Si Yong Yeo, Xulei Yang

    Abstract: Motion forecasting is a crucial component of autonomous driving systems, enabling the generation of accurate and smooth future trajectories to ensure safe navigation to the destination. In previous methods, potential future trajectories are often absent in the scene encoding stage, which may lead to suboptimal outcomes. Additionally, prior approaches typically employ transformer architectures for… ▽ More

    Submitted 9 March, 2025; originally announced March 2025.

  6. arXiv:2503.01019  [pdf, other

    cs.CV cs.AI

    MedUnifier: Unifying Vision-and-Language Pre-training on Medical Data with Vision Generation Task using Discrete Visual Representations

    Authors: Ziyang Zhang, Yang Yu, Yucheng Chen, Xulei Yang, Si Yong Yeo

    Abstract: Despite significant progress in Vision-Language Pre-training (VLP), current approaches predominantly emphasize feature extraction and cross-modal comprehension, with limited attention to generating or transforming visual content. This gap hinders the model's ability to synthesize coherent and novel visual representations from textual prompts, thereby reducing the effectiveness of multi-modal learn… ▽ More

    Submitted 20 April, 2025; v1 submitted 2 March, 2025; originally announced March 2025.

    Comments: To be pubilshed in CVPR 2025

  7. Enhancing Deliberativeness: Evaluating the Impact of Multimodal Reflection Nudges

    Authors: ShunYi Yeo, Zhuoqun Jiang, Anthony Tang, Simon Tangi Perrault

    Abstract: Nudging participants with text-based reflective nudges enhances deliberation quality on online deliberation platforms. The effectiveness of multimodal reflective nudges, however, remains largely unexplored. Given the multi-sensory nature of human perception, incorporating diverse modalities into self-reflection mechanisms has the potential to better support various reflective styles. This paper ex… ▽ More

    Submitted 7 February, 2025; v1 submitted 6 February, 2025; originally announced February 2025.

    Comments: CHI 2025

  8. arXiv:2501.13978  [pdf, other

    cs.CL cs.AI cs.SE

    Chain of Grounded Objectives: Bridging Process and Goal-oriented Prompting for Code Generation

    Authors: Sangyeop Yeo, Seung-won Hwang, Yu-Seung Ma

    Abstract: The use of Large Language Models (LLMs) for code generation has gained significant attention in recent years. Existing methods often aim to improve the quality of generated code by incorporating additional contextual information or guidance into input prompts. Many of these approaches adopt sequential reasoning strategies, mimicking human-like step-by-step thinking. However, such strategies may co… ▽ More

    Submitted 28 May, 2025; v1 submitted 22 January, 2025; originally announced January 2025.

    Comments: Accepted by ECOOP 2025 main conference

  9. arXiv:2501.00775  [pdf, other

    cs.HC

    MindCoder: Automated and Controllable Reasoning Chain in Qualitative Analysis

    Authors: Jie Gao, Zhiyao Shu, Shun Yi Yeo

    Abstract: Extracting insights from qualitative analysis involves a series of reasoning steps, such as open coding, grouping, and identifying themes. We introduce the MindCoder reasoning chain, built on Chain-of-Thought (CoT) prompting, to support the insight extraction process step by step-including topic clustering, code labeling, conceptualization, and reporting. We designed the MindCoder web application… ▽ More

    Submitted 16 April, 2025; v1 submitted 1 January, 2025; originally announced January 2025.

    Comments: 17 pages for main content, 3 pages for references, 10 pages for appendix

  10. arXiv:2410.15761  [pdf, ps, other

    cs.CL cs.LG stat.ML

    Optimal Query Allocation in Extractive QA with LLMs: A Learning-to-Defer Framework with Theoretical Guarantees

    Authors: Yannis Montreuil, Shu Heng Yeo, Axel Carlier, Lai Xing Ng, Wei Tsang Ooi

    Abstract: Large Language Models excel in generative tasks but exhibit inefficiencies in structured text selection, particularly in extractive question answering. This challenge is magnified in resource-constrained environments, where deploying multiple specialized models for different tasks is impractical. We propose a Learning-to-Defer framework that allocates queries to specialized experts, ensuring high-… ▽ More

    Submitted 18 February, 2025; v1 submitted 21 October, 2024; originally announced October 2024.

    Comments: 25 pages, 17 main paper

  11. arXiv:2410.15729  [pdf, ps, other

    stat.ML cs.HC cs.LG

    A Two-Stage Learning-to-Defer Approach for Multi-Task Learning

    Authors: Yannis Montreuil, Shu Heng Yeo, Axel Carlier, Lai Xing Ng, Wei Tsang Ooi

    Abstract: The Two-Stage Learning-to-Defer (L2D) framework has been extensively studied for classification and, more recently, regression tasks. However, many real-world applications require solving both tasks jointly in a multi-task setting. We introduce a novel Two-Stage L2D framework for multi-task learning that integrates classification and regression through a unified deferral mechanism. Our method leve… ▽ More

    Submitted 23 May, 2025; v1 submitted 21 October, 2024; originally announced October 2024.

    Comments: 32 pages, 17 main paper

  12. arXiv:2409.11964  [pdf, other

    cs.SD cs.LG eess.AS

    Data Efficient Acoustic Scene Classification using Teacher-Informed Confusing Class Instruction

    Authors: Jin Jie Sean Yeo, Ee-Leng Tan, Jisheng Bai, Santi Peksi, Woon-Seng Gan

    Abstract: In this technical report, we describe the SNTL-NTU team's submission for Task 1 Data-Efficient Low-Complexity Acoustic Scene Classification of the detection and classification of acoustic scenes and events (DCASE) 2024 challenge. Three systems are introduced to tackle training splits of different sizes. For small training splits, we explored reducing the complexity of the provided baseline model b… ▽ More

    Submitted 18 September, 2024; originally announced September 2024.

    Comments: 5 pages, 3 figures

  13. arXiv:2408.09084  [pdf, other

    cs.HC

    Not Too Long, Not Too Short: Goldilocks Principle of 'Optimal' Reflection Time on Online Deliberation Platforms

    Authors: ShunYi Yeo, Simon Tangi Perrault

    Abstract: The deliberative potential of online platforms has been widely examined but the impact of reflection time on the quality of deliberation remains under-explored. This paper presents two user studies involving 100 and 72 participants respectively, to investigate the impact of reflection time on the quality of deliberation in minute-scale deliberations. In the first study, we identified an optimal re… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

  14. arXiv:2407.14230  [pdf, other

    cs.CV cs.LG

    ETSCL: An Evidence Theory-Based Supervised Contrastive Learning Framework for Multi-modal Glaucoma Grading

    Authors: Zhiyuan Yang, Bo Zhang, Yufei Shi, Ningze Zhong, Johnathan Loh, Huihui Fang, Yanwu Xu, Si Yong Yeo

    Abstract: Glaucoma is one of the leading causes of vision impairment. Digital imaging techniques, such as color fundus photography (CFP) and optical coherence tomography (OCT), provide quantitative and noninvasive methods for glaucoma diagnosis. Recently, in the field of computer-aided glaucoma diagnosis, multi-modality methods that integrate the CFP and OCT modalities have achieved greater diagnostic accur… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

    Comments: Accepted by Ophthalmic Medical Image Analysis Workshop at MICCAI'24

  15. arXiv:2405.11614  [pdf, other

    cs.CV eess.IV

    Nickel and Diming Your GAN: A Dual-Method Approach to Enhancing GAN Efficiency via Knowledge Distillation

    Authors: Sangyeop Yeo, Yoojin Jang, Jaejun Yoo

    Abstract: In this paper, we address the challenge of compressing generative adversarial networks (GANs) for deployment in resource-constrained environments by proposing two novel methodologies: Distribution Matching for Efficient compression (DiME) and Network Interactive Compression via Knowledge Exchange and Learning (NICKEL). DiME employs foundation models as embedding kernels for efficient distribution… ▽ More

    Submitted 4 September, 2024; v1 submitted 19 May, 2024; originally announced May 2024.

  16. Help Me Reflect: Leveraging Self-Reflection Interface Nudges to Enhance Deliberativeness on Online Deliberation Platforms

    Authors: Shun Yi Yeo, Gionnieve Lim, Jie Gao, Weiyu Zhang, Simon Tangi Perrault

    Abstract: The deliberative potential of online platforms has been widely examined. However, little is known about how various interface-based reflection nudges impact the quality of deliberation. This paper presents two user studies with 12 and 120 participants, respectively, to investigate the impacts of different reflective nudges on the quality of deliberation. In the first study, we examined five distin… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

  17. arXiv:2309.17143  [pdf, other

    cs.CV cs.AI

    Revisiting Cephalometric Landmark Detection from the view of Human Pose Estimation with Lightweight Super-Resolution Head

    Authors: Qian Wu, Si Yong Yeo, Yufei Chen, Jun Liu

    Abstract: Accurate localization of cephalometric landmarks holds great importance in the fields of orthodontics and orthognathics due to its potential for automating key point labeling. In the context of landmark detection, particularly in cephalometrics, it has been observed that existing methods often lack standardized pipelines and well-designed bias reduction processes, which significantly impact their… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

  18. arXiv:2309.13858  [pdf, other

    cs.HC

    Impact of Human-AI Interaction on User Trust and Reliance in AI-Assisted Qualitative Coding

    Authors: Jie Gao, Junming Cao, ShunYi Yeo, Kenny Tsu Wei Choo, Zheng Zhang, Toby Jia-Jun Li, Shengdong Zhao, Simon Tangi Perrault

    Abstract: While AI shows promise for enhancing the efficiency of qualitative analysis, the unique human-AI interaction resulting from varied coding strategies makes it challenging to develop a trustworthy AI-assisted qualitative coding system (AIQCs) that supports coding tasks effectively. We bridge this gap by exploring the impact of varying coding strategies on user trust and reliance on AI. We conducted… ▽ More

    Submitted 24 September, 2023; originally announced September 2023.

    Comments: 27 pages with references, 9 figures, 5 tables

  19. arXiv:2303.07657  [pdf, other

    cs.HC

    Code Will Tell: Visual Identification of Ponzi Schemes on Ethereum

    Authors: Xiaolin Wen, Kim Siang Yeo, Yong Wang, Ling Cheng, Feida Zhu, Min Zhu

    Abstract: Ethereum has become a popular blockchain with smart contracts for investors nowadays. Due to the decentralization and anonymity of Ethereum, Ponzi schemes have been easily deployed and caused significant losses to investors. However, there are still no explainable and effective methods to help investors easily identify Ponzi schemes and validate whether a smart contract is actually a Ponzi scheme.… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

  20. arXiv:2212.08311  [pdf, other

    cs.CV cs.LG

    Can We Find Strong Lottery Tickets in Generative Models?

    Authors: Sangyeop Yeo, Yoojin Jang, Jy-yong Sohn, Dongyoon Han, Jaejun Yoo

    Abstract: Yes. In this paper, we investigate strong lottery tickets in generative models, the subnetworks that achieve good generative performance without any weight update. Neural network pruning is considered the main cornerstone of model compression for reducing the costs of computation and memory. Unfortunately, pruning a generative model has not been extensively explored, and all existing pruning algor… ▽ More

    Submitted 16 December, 2022; originally announced December 2022.

  21. arXiv:2204.08129  [pdf, other

    cs.CV

    Animal Kingdom: A Large and Diverse Dataset for Animal Behavior Understanding

    Authors: Xun Long Ng, Kian Eng Ong, Qichen Zheng, Yun Ni, Si Yong Yeo, Jun Liu

    Abstract: Understanding animals' behaviors is significant for a wide range of applications. However, existing animal behavior datasets have limitations in multiple aspects, including limited numbers of animal classes, data samples and provided tasks, and also limited variations in environmental conditions and viewpoints. To address these limitations, we create a large and diverse dataset, Animal Kingdom, th… ▽ More

    Submitted 3 June, 2022; v1 submitted 17 April, 2022; originally announced April 2022.

    Comments: Accepted by CVPR2022 (Oral). Dataset: https://sutdcv.github.io/Animal-Kingdom

  22. Differentiable Simulation of Inertial Musculotendons

    Authors: Ying Wang, Jasper Verheul, Sang-Hoon Yeo, Nima Khademi Kalantari, Shinjiro Sueda

    Abstract: We propose a simple and practical approach for incorporating the effects of muscle inertia, which has been ignored by previous musculoskeletal simulators in both graphics and biomechanics. We approximate the inertia of the muscle by assuming that muscle mass is distributed along the centerline of the muscle. We express the motion of the musculotendons in terms of the motion of the skeletal joints… ▽ More

    Submitted 22 September, 2022; v1 submitted 4 February, 2022; originally announced February 2022.

    Journal ref: ACM Transactions on Graphics (SIGGRAPH Asia), 41 (6) 272:1-272:11, 2022

  23. arXiv:2012.15198  [pdf, other

    cs.LG cs.DC

    Crossover-SGD: A gossip-based communication in distributed deep learning for alleviating large mini-batch problem and enhancing scalability

    Authors: Sangho Yeo, Minho Bae, Minjoong Jeong, Oh-kyoung Kwon, Sangyoon Oh

    Abstract: Distributed deep learning is an effective way to reduce the training time of deep learning for large datasets as well as complex models. However, the limited scalability caused by network overheads makes it difficult to synchronize the parameters of all workers. To resolve this problem, gossip-based methods that demonstrates stable scalability regardless of the number of workers have been proposed… ▽ More

    Submitted 17 October, 2022; v1 submitted 30 December, 2020; originally announced December 2020.

    Comments: Under review as a journal paper at CCPE

  24. arXiv:2002.07767  [pdf, other

    cs.CL

    Learning by Semantic Similarity Makes Abstractive Summarization Better

    Authors: Wonjin Yoon, Yoon Sun Yeo, Minbyul Jeong, Bong-Jun Yi, Jaewoo Kang

    Abstract: By harnessing pre-trained language models, summarization models had rapid progress recently. However, the models are mainly assessed by automatic evaluation metrics such as ROUGE. Although ROUGE is known for having a positive correlation with human evaluation scores, it has been criticized for its vulnerability and the gap between actual qualities. In this paper, we compare the generated summaries… ▽ More

    Submitted 2 June, 2021; v1 submitted 18 February, 2020; originally announced February 2020.

    Comments: The initial version of the manuscript includes a model design (semsim), experimental results, and discussions on the results. We found that our model has flaws in its implementation and design. This final version of the manuscript is from the rest of the initial paper; we included our findings on the benchmark dataset, BART generated results and human evaluations, and we excluded our model semsim

  25. arXiv:1908.05267  [pdf, other

    cs.CL

    Towards Debiasing Fact Verification Models

    Authors: Tal Schuster, Darsh J Shah, Yun Jie Serene Yeo, Daniel Filizzola, Enrico Santus, Regina Barzilay

    Abstract: Fact verification requires validating a claim in the context of evidence. We show, however, that in the popular FEVER dataset this might not necessarily be the case. Claim-only classifiers perform competitively with top evidence-aware models. In this paper, we investigate the cause of this phenomenon, identifying strong cues for predicting labels solely based on the claim, without considering any… ▽ More

    Submitted 30 August, 2019; v1 submitted 14 August, 2019; originally announced August 2019.

    Comments: EMNLP IJCNLP 2019

  26. arXiv:1710.02265  [pdf, ps, other

    cs.CR

    On the Closest Vector Problem for Lattices Constructed from Polynomials and Their Cryptographic Applications

    Authors: Zhe Li, San Ling, Chaoping Xing, Sze Ling Yeo

    Abstract: In this paper, we propose new classes of trapdoor functions to solve the closest vector problem in lattices. Specifically, we construct lattices based on properties of polynomials for which the closest vector problem is hard to solve unless some trapdoor information is revealed. We thoroughly analyze the security of our proposed functions using state-of-the-art attacks and results on lattice reduc… ▽ More

    Submitted 5 October, 2017; originally announced October 2017.

    Comments: 20 pages

    MSC Class: 11T71 ACM Class: E.3

  27. arXiv:1703.01025  [pdf

    cs.CV

    A Novel Multi-task Deep Learning Model for Skin Lesion Segmentation and Classification

    Authors: Xulei Yang, Zeng Zeng, Si Yong Yeo, Colin Tan, Hong Liang Tey, Yi Su

    Abstract: In this study, a multi-task deep neural network is proposed for skin lesion analysis. The proposed multi-task learning model solves different tasks (e.g., lesion segmentation and two independent binary lesion classifications) at the same time by exploiting commonalities and differences across tasks. This results in improved learning efficiency and potential prediction accuracy for the task-specifi… ▽ More

    Submitted 2 March, 2017; originally announced March 2017.

    Comments: Submission to support ISIC 2017 challenge results

  28. arXiv:1505.02532  [pdf, ps, other

    math.AC cs.SC

    On the last fall degree of zero-dimensional Weil descent systems

    Authors: Ming-Deh A. Huang, Michiel Kosters, Yun Yang, Sze Ling Yeo

    Abstract: In this article we will discuss a new, mostly theoretical, method for solving (zero-dimensional) polynomial systems, which lies in between Gröbner basis computations and the heuristic first fall degree assumption and is not based on any heuristic. This method relies on the new concept of last fall degree. Let $k$ be a finite field of cardinality $q^n$ and let $k'$ be its subfield of cardinality… ▽ More

    Submitted 17 June, 2015; v1 submitted 11 May, 2015; originally announced May 2015.

    Comments: 16 pages, changed definition of tau and revised Section 5

    MSC Class: 13P10; 13P15

  29. New Constant-Weight Codes from Propagation Rules

    Authors: Yeow Meng Chee, Chaoping Xing, Sze Ling Yeo

    Abstract: This paper proposes some simple propagation rules which give rise to new binary constant-weight codes.

    Submitted 9 August, 2010; originally announced August 2010.

    Comments: 4 pages

    Journal ref: IEEE Transactions on Information Theory, vol. 56, no. 4, pp. 1596-1599, 2010

  30. arXiv:0909.1146  [pdf, other

    cs.DC

    Energy-Efficient Scheduling of HPC Applications in Cloud Computing Environments

    Authors: Saurabh Kumar Garg, Chee Shin Yeo, Arun Anandasivam, Rajkumar Buyya

    Abstract: The use of High Performance Computing (HPC) in commercial and consumer IT applications is becoming popular. They need the ability to gain rapid and scalable access to high-end computing capabilities. Cloud computing promises to deliver such a computing infrastructure using data centers so that HPC users can access applications and data from a Cloud anywhere in the world on demand and pay based o… ▽ More

    Submitted 7 September, 2009; originally announced September 2009.

  31. Market-Oriented Cloud Computing: Vision, Hype, and Reality for Delivering IT Services as Computing Utilities

    Authors: Rajkumar Buyya, Chee Shin Yeo, Srikumar Venugopal

    Abstract: This keynote paper: presents a 21st century vision of computing; identifies various computing paradigms promising to deliver the vision of computing utilities; defines Cloud computing and provides the architecture for creating market-oriented Clouds by leveraging technologies such as VMs; provides thoughts on market-based resource management strategies that encompass both customer-driven service… ▽ More

    Submitted 26 August, 2008; originally announced August 2008.

    Comments: 9 pages; GRIDS Lab Technical Report, Aug 2008

    ACM Class: C.2.4

    Journal ref: Proceedings of the 10th IEEE International Conference on High Performance Computing and Communications (HPCC-08, IEEE CS Press, Los Alamitos, CA, USA), Sept. 25-27, 2008, Dalian, China

  32. arXiv:cs/0605056  [pdf

    cs.DC

    Utility Computing and Global Grids

    Authors: Chee Shin Yeo, Marcos Dias de Assuncao, Jia Yu, Anthony Sulistio, Srikumar Venugopal, Martin Placek, Rajkumar Buyya

    Abstract: This chapter focuses on the use of Grid technologies to achieve utility computing. An overview of how Grids can support utility computing is first presented through the architecture of Utility Grids. Then, utility-based resource allocation is described in detail at each level of the architecture. Finally, some industrial solutions for utility computing are discussed.

    Submitted 12 May, 2006; originally announced May 2006.

    Comments: 23 pages

    Report number: Technical Report, GRIDS-TR-2006-7, Grid Computing and Distributed Systems Laboratory, The University of Melbourne, Australia, April 13, 2006 ACM Class: C.2.4