Skip to main content

Showing 1–7 of 7 results for author: Soh, D W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.14335  [pdf, other

    cs.CV cs.AI

    Visual Prompting for One-shot Controllable Video Editing without Inversion

    Authors: Zhengbo Zhang, Yuxi Zhou, Duo Peng, Joo-Hwee Lim, Zhigang Tu, De Wen Soh, Lin Geng Foo

    Abstract: One-shot controllable video editing (OCVE) is an important yet challenging task, aiming to propagate user edits that are made -- using any image editing tool -- on the first frame of a video to all subsequent frames, while ensuring content consistency between edited frames and source frames. To achieve this, prior methods employ DDIM inversion to transform source frames into latent noise, which is… ▽ More

    Submitted 19 April, 2025; originally announced April 2025.

    Comments: accepted by cvpr2025

  2. arXiv:2504.00640  [pdf, other

    cs.CV

    POPEN: Preference-Based Optimization and Ensemble for LVLM-Based Reasoning Segmentation

    Authors: Lanyun Zhu, Tianrun Chen, Qianxiong Xu, Xuanyi Liu, Deyi Ji, Haiyang Wu, De Wen Soh, Jun Liu

    Abstract: Existing LVLM-based reasoning segmentation methods often suffer from imprecise segmentation results and hallucinations in their text responses. This paper introduces POPEN, a novel framework designed to address these issues and achieve improved results. POPEN includes a preference-based optimization method to finetune the LVLM, aligning it more closely with human preferences and thereby generating… ▽ More

    Submitted 1 April, 2025; originally announced April 2025.

    Comments: CVPR2025

  3. arXiv:2502.02358  [pdf, other

    cs.CV

    MotionLab: Unified Human Motion Generation and Editing via the Motion-Condition-Motion Paradigm

    Authors: Ziyan Guo, Zeyu Hu, Na Zhao, De Wen Soh

    Abstract: Human motion generation and editing are key components of computer graphics and vision. However, current approaches in this field tend to offer isolated solutions tailored to specific tasks, which can be inefficient and impractical for real-world applications. While some efforts have aimed to unify motion-related tasks, these methods simply use different modalities as conditions to guide motion ge… ▽ More

    Submitted 12 March, 2025; v1 submitted 4 February, 2025; originally announced February 2025.

  4. arXiv:2410.01535  [pdf, other

    cs.CV

    GaussianBlock: Building Part-Aware Compositional and Editable 3D Scene by Primitives and Gaussians

    Authors: Shuyi Jiang, Qihao Zhao, Hossein Rahmani, De Wen Soh, Jun Liu, Na Zhao

    Abstract: Recently, with the development of Neural Radiance Fields and Gaussian Splatting, 3D reconstruction techniques have achieved remarkably high fidelity. However, the latent representations learnt by these methods are highly entangled and lack interpretability. In this paper, we propose a novel part-aware compositional reconstruction method, called GaussianBlock, that enables semantically coherent and… ▽ More

    Submitted 24 April, 2025; v1 submitted 2 October, 2024; originally announced October 2024.

  5. arXiv:2401.00194  [pdf, other

    cs.IT eess.SP

    On the Identifiability from Modulo Measurements under DFT Sensing Matrix

    Authors: Qi Zhang, Jiang Zhu, Fengzhong Qu, Zheng Zhu, De Wen Soh

    Abstract: Modulo sampling (MS) has been recently introduced to enhance the dynamic range of conventional ADCs by applying a modulo operator before sampling. This paper examines the identifiability of a measurement model where measurements are taken using a discrete Fourier transform (DFT) sensing matrix, followed by a modulo operator (modulo-DFT). Firstly, we derive a necessary and sufficient condition for… ▽ More

    Submitted 6 August, 2024; v1 submitted 30 December, 2023; originally announced January 2024.

  6. arXiv:2309.04901  [pdf, other

    eess.SP cs.IT

    One-Bit-Aided Modulo Sampling for DOA Estimation

    Authors: Qi Zhang, Jiang Zhu, Fengzhong Qu, De Wen Soh

    Abstract: Modulo sampling has recently drawn a great deal of attention for cutting-edge applications, due to overcoming the barrier of information loss through sensor saturation and clipping. This is a significant problem, especially when the range of signal amplitudes is unknown or in the near-far case. To overcome this fundamental bottleneck, we propose a one-bit-aided (1bit-aided) modulo sampling scheme… ▽ More

    Submitted 30 December, 2023; v1 submitted 9 September, 2023; originally announced September 2023.

  7. arXiv:2305.11791  [pdf, other

    cs.CL

    Enhancing Few-shot NER with Prompt Ordering based Data Augmentation

    Authors: Huiming Wang, Liying Cheng, Wenxuan Zhang, De Wen Soh, Lidong Bing

    Abstract: Recently, data augmentation (DA) methods have been proven to be effective for pre-trained language models (PLMs) in low-resource settings, including few-shot named entity recognition (NER). However, conventional NER DA methods are mostly aimed at sequence labeling models, i.e., token-level classification, and few are compatible with unified autoregressive generation frameworks, which can handle a… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

    Comments: 7 pages, 2 figures