Skip to main content

Showing 1–43 of 43 results for author: Yap, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.04118  [pdf, ps, other

    cs.CV

    PromptSR: Cascade Prompting for Lightweight Image Super-Resolution

    Authors: Wenyang Liu, Chen Cai, Jianjun Gao, Kejun Wu, Yi Wang, Kim-Hui Yap, Lap-Pui Chau

    Abstract: Although the lightweight Vision Transformer has significantly advanced image super-resolution (SR), it faces the inherent challenge of a limited receptive field due to the window-based self-attention modeling. The quadratic computational complexity relative to window size restricts its ability to use a large window size for expanding the receptive field while maintaining low computational costs. T… ▽ More

    Submitted 5 July, 2025; originally announced July 2025.

    Comments: Accepted in TMM

  2. arXiv:2505.05088  [pdf, other

    cs.MM cs.CV eess.IV

    SSH-Net: A Self-Supervised and Hybrid Network for Noisy Image Watermark Removal

    Authors: Wenyang Liu, Jianjun Gao, Kim-Hui Yap

    Abstract: Visible watermark removal is challenging due to its inherent complexities and the noise carried within images. Existing methods primarily rely on supervised learning approaches that require paired datasets of watermarked and watermark-free images, which are often impractical to obtain in real-world scenarios. To address this challenge, we propose SSH-Net, a Self-Supervised and Hybrid Network speci… ▽ More

    Submitted 8 May, 2025; originally announced May 2025.

    Comments: Under Review in JVCI

  3. arXiv:2411.05161  [pdf

    cs.HC

    Investigation of Tactile Texture Simulation on Online Shopping Experience

    Authors: Pei Hsin Lim, Kian Meng Yap

    Abstract: With safety measures towards the current Covid-19 pandemic, many retails clothing stores have restricted on-site fittings and shifted their business online. Inability to touch on product evaluations shows an apparent limitation as compared to retail shopping especially when the object's material information is crucial like clothing. Haptic technologies show potential of bridging the gap between on… ▽ More

    Submitted 7 November, 2024; originally announced November 2024.

    Comments: Part of proceedings of 6th International Conference AsiaHaptics 2024

  4. arXiv:2411.05148  [pdf

    cs.HC

    Haptic VR Simulation for Surgery Procedures in Medical Training

    Authors: Lim Zheng Jie, Kian Meng Yap

    Abstract: Traditional medical training faces challenges like ethical concerns, safety risks, and high costs. VR technology offers a promising solution but is limited by low complexity and lack of tactile feedback. This paper presents a cost-effective haptic VR surgery simulation which simulates realistic Kidney Transplant using commercial devices to enhance training authenticity and immersion. Trainees can… ▽ More

    Submitted 7 November, 2024; originally announced November 2024.

    Comments: Part of proceedings of 6th International Conference AsiaHaptics 2024

  5. arXiv:2411.05133  [pdf

    cs.HC

    Innovative Weight Simulation in Virtual Reality Cube Games: A Pseudo-Haptic Approach

    Authors: Woan Ning Lim, Edric Yi Junn Leong, Yun Li Lee, Kian Meng Yap

    Abstract: This paper presents an innovative pseudo-haptic model for weight simulation in virtual reality (VR) environments. By integrating visual feedback with voluntary exerted force through a passive haptic glove, the model creates haptic illusions of weight perception. Two VR cube games were developed to evaluate the model's effectiveness. The first game assesses participants' ability to discriminate rel… ▽ More

    Submitted 7 November, 2024; originally announced November 2024.

    Comments: Part of proceedings of 6th International Conference AsiaHaptics 2024

  6. arXiv:2411.05122  [pdf

    cs.RO cs.HC

    Socially Assistive Robots: A Technological Approach to Emotional Support

    Authors: Leanne Oon Hui Yee, Siew Sui Fun, Thit Sar Zin, Zar Nie Aung, Kian Meng Yap, Jiehan Teoh

    Abstract: In today's high-pressure and isolated society, the demand for emotional support has surged, necessitating innovative solutions. Socially Assistive Robots (SARs) offer a technological approach to providing emotional assistance by leveraging advanced robotics, artificial intelligence, and sensor technologies. This study explores the development of an emotional support robot designed to detect and re… ▽ More

    Submitted 7 November, 2024; originally announced November 2024.

    Comments: Part of proceedings of 6th International Conference AsiaHaptics 2024

  7. arXiv:2411.05106  [pdf

    cs.HC

    Enhancing Medical Anatomy Education through Virtual Reality (VR): Design, Development, and Evaluation

    Authors: Myint Zu Than, Kian Meng Yap

    Abstract: Modern medicine demands innovations in medical education, particularly in the learning of human anatomy, traditionally taught through textbooks, dissections, and lectures. Virtual Reality (VR) has emerged as a promising tool to address the limitations of these conventional methods by emphasising vision-based and active learning. However, current VR educational tools are often inaccessible due to h… ▽ More

    Submitted 7 November, 2024; originally announced November 2024.

    Comments: Part of proceedings of 6th International Conference AsiaHaptics 2024

  8. arXiv:2410.20855  [pdf, other

    cs.CV cs.CR cs.MM

    ByteNet: Rethinking Multimedia File Fragment Classification through Visual Perspectives

    Authors: Wenyang Liu, Kejun Wu, Tianyi Liu, Yi Wang, Kim-Hui Yap, Lap-Pui Chau

    Abstract: Multimedia file fragment classification (MFFC) aims to identify file fragment types, e.g., image/video, audio, and text without system metadata. It is of vital importance in multimedia storage and communication. Existing MFFC methods typically treat fragments as 1D byte sequences and emphasize the relations between separate bytes (interbytes) for classification. However, the more informative relat… ▽ More

    Submitted 28 October, 2024; originally announced October 2024.

    Comments: Accepted in TMM

  9. arXiv:2410.15657  [pdf, other

    cs.CV cs.CL

    CL-HOI: Cross-Level Human-Object Interaction Distillation from Vision Large Language Models

    Authors: Jianjun Gao, Chen Cai, Ruoyu Wang, Wenyang Liu, Kim-Hui Yap, Kratika Garg, Boon-Siew Han

    Abstract: Human-object interaction (HOI) detection has seen advancements with Vision Language Models (VLMs), but these methods often depend on extensive manual annotations. Vision Large Language Models (VLLMs) can inherently recognize and reason about interactions at the image level but are computationally heavy and not designed for instance-level HOI detection. To overcome these limitations, we propose a C… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

  10. Open World Object Detection: A Survey

    Authors: Yiming Li, Yi Wang, Wenqian Wang, Dan Lin, Bingbing Li, Kim-Hui Yap

    Abstract: Exploring new knowledge is a fundamental human ability that can be mirrored in the development of deep neural networks, especially in the field of object detection. Open world object detection (OWOD) is an emerging area of research that adapts this principle to explore new knowledge. It focuses on recognizing and learning from objects absent from initial training sets, thereby incrementally expand… ▽ More

    Submitted 28 June, 2025; v1 submitted 15 October, 2024; originally announced October 2024.

    Comments: Accepted for publication in IEEE TCSVT

  11. arXiv:2410.00771  [pdf, other

    cs.CV cs.CL

    Empowering Large Language Model for Continual Video Question Answering with Collaborative Prompting

    Authors: Chen Cai, Zheng Wang, Jianjun Gao, Wenyang Liu, Ye Lu, Runzhong Zhang, Kim-Hui Yap

    Abstract: In recent years, the rapid increase in online video content has underscored the limitations of static Video Question Answering (VideoQA) models trained on fixed datasets, as they struggle to adapt to new questions or tasks posed by newly available content. In this paper, we explore the novel challenge of VideoQA within a continual learning framework, and empirically identify a critical issue: fine… ▽ More

    Submitted 16 January, 2025; v1 submitted 1 October, 2024; originally announced October 2024.

    Comments: Accepted by main EMNLP 2024

  12. arXiv:2408.01766  [pdf, other

    cs.CV

    MultiFuser: Multimodal Fusion Transformer for Enhanced Driver Action Recognition

    Authors: Ruoyu Wang, Wenqian Wang, Jianjun Gao, Dan Lin, Kim-Hui Yap, Bingbing Li

    Abstract: Driver action recognition, aiming to accurately identify drivers' behaviours, is crucial for enhancing driver-vehicle interactions and ensuring driving safety. Unlike general action recognition, drivers' environments are often challenging, being gloomy and dark, and with the development of sensors, various cameras such as IR and depth cameras have emerged for analyzing drivers' behaviors. Therefor… ▽ More

    Submitted 17 August, 2024; v1 submitted 3 August, 2024; originally announced August 2024.

  13. arXiv:2406.11340  [pdf, other

    cs.CV cs.LG

    CM2-Net: Continual Cross-Modal Mapping Network for Driver Action Recognition

    Authors: Ruoyu Wang, Chen Cai, Wenqian Wang, Jianjun Gao, Dan Lin, Wenyang Liu, Kim-Hui Yap

    Abstract: Driver action recognition has significantly advanced in enhancing driver-vehicle interactions and ensuring driving safety by integrating multiple modalities, such as infrared and depth. Nevertheless, compared to RGB modality only, it is always laborious and costly to collect extensive data for all types of non-RGB modalities in car cabin environments. Therefore, previous works have suggested indep… ▽ More

    Submitted 3 August, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

  14. arXiv:2404.13611  [pdf, other

    cs.CV cs.CL

    Video sentence grounding with temporally global textual knowledge

    Authors: Cai Chen, Runzhong Zhang, Jianjun Gao, Kejun Wu, Kim-Hui Yap, Yi Wang

    Abstract: Temporal sentence grounding involves the retrieval of a video moment with a natural language query. Many existing works directly incorporate the given video and temporally localized query for temporal grounding, overlooking the inherent domain gap between different modalities. In this paper, we utilize pseudo-query features containing extensive temporally global textual knowledge sourced from the… ▽ More

    Submitted 1 June, 2024; v1 submitted 21 April, 2024; originally announced April 2024.

  15. arXiv:2401.14838  [pdf, other

    cs.CV

    Multi-modality action recognition based on dual feature shift in vehicle cabin monitoring

    Authors: Dan Lin, Philip Hann Yung Lee, Yiming Li, Ruoyu Wang, Kim-Hui Yap, Bingbing Li, You Shing Ngim

    Abstract: Driver Action Recognition (DAR) is crucial in vehicle cabin monitoring systems. In real-world applications, it is common for vehicle cabins to be equipped with cameras featuring different modalities. However, multi-modality fusion strategies for the DAR task within car cabins have rarely been studied. In this paper, we propose a novel yet efficient multi-modality driver action recognition method b… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

  16. arXiv:2401.08126  [pdf, other

    cs.NI

    Octopus: A Fair Packet Delivery Service

    Authors: Junzhi Gong, Yuliang Li, Devdeep Ray, KK Yap, Nandita Dukkipati

    Abstract: The packet delivery fairness is critical in many applications in the cloud, such as exchange systems, consensus protocols, and online gaming applications. However, due to nonidentical and dynamic packet forwarding paths, as well as many in-network queuing delays, supporting packet delivery fairness is challenging in a shared compute environment. In this paper, we present Octopus, the first general… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

  17. arXiv:2311.06070  [pdf, other

    cs.CV

    Learning-Based Biharmonic Augmentation for Point Cloud Classification

    Authors: Jiacheng Wei, Guosheng Lin, Henghui Ding, Jie Hu, Kim-Hui Yap

    Abstract: Point cloud datasets often suffer from inadequate sample sizes in comparison to image datasets, making data augmentation challenging. While traditional methods, like rigid transformations and scaling, have limited potential in increasing dataset diversity due to their constraints on altering individual sample shapes, we introduce the Biharmonic Augmentation (BA) method. BA is a novel and efficient… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

  18. arXiv:2309.13890  [pdf, other

    cs.CV eess.IV

    Bitstream-Corrupted Video Recovery: A Novel Benchmark Dataset and Method

    Authors: Tianyi Liu, Kejun Wu, Yi Wang, Wenyang Liu, Kim-Hui Yap, Lap-Pui Chau

    Abstract: The past decade has witnessed great strides in video recovery by specialist technologies, like video inpainting, completion, and error concealment. However, they typically simulate the missing content by manual-designed error masks, thus failing to fill in the realistic video loss in video communication (e.g., telepresence, live streaming, and internet video) and multimedia forensics. To address t… ▽ More

    Submitted 26 September, 2023; v1 submitted 25 September, 2023; originally announced September 2023.

    Comments: Accepted by NeurIPS Dataset and Benchmark Track 2023

  19. arXiv:2309.10360  [pdf, other

    cs.CV cs.AI

    OccluTrack: Rethinking Awareness of Occlusion for Enhancing Multiple Pedestrian Tracking

    Authors: Jianjun Gao, Yi Wang, Kim-Hui Yap, Kratika Garg, Boon Siew Han

    Abstract: Multiple pedestrian tracking is crucial for enhancing safety and efficiency in intelligent transport and autonomous driving systems by predicting movements and enabling adaptive decision-making in dynamic environments. It optimizes traffic flow, facilitates human interaction, and ensures compliance with regulations. However, it faces the challenge of tracking pedestrians in the presence of occlusi… ▽ More

    Submitted 26 April, 2025; v1 submitted 19 September, 2023; originally announced September 2023.

    Comments: Accepted by IEEE T-ITS

  20. arXiv:2308.16763  [pdf, other

    cs.CL cs.AI

    Ladder-of-Thought: Using Knowledge as Steps to Elevate Stance Detection

    Authors: Kairui Hu, Ming Yan, Joey Tianyi Zhou, Ivor W. Tsang, Wen Haw Chong, Yong Keong Yap

    Abstract: Stance detection aims to identify the attitude expressed in a document towards a given target. Techniques such as Chain-of-Thought (CoT) prompting have advanced this task, enhancing a model's reasoning capabilities through the derivation of intermediate rationales. However, CoT relies primarily on a model's pre-trained internal knowledge during reasoning, thereby neglecting the valuable external i… ▽ More

    Submitted 7 September, 2023; v1 submitted 31 August, 2023; originally announced August 2023.

    Comments: 5 pages, 2 figures, 2 tables

  21. arXiv:2306.07490  [pdf, other

    cs.CV

    Top-Down Framework for Weakly-supervised Grounded Image Captioning

    Authors: Chen Cai, Suchen Wang, Kim-hui Yap, Yi Wang

    Abstract: Weakly-supervised grounded image captioning (WSGIC) aims to generate the caption and ground (localize) predicted object words in the input image without using bounding box supervision. Recent two-stage solutions mostly apply a bottom-up pipeline: (1) encode the input image into multiple region features using an object detector; (2) leverage region features for captioning and grounding. However, ut… ▽ More

    Submitted 2 March, 2024; v1 submitted 12 June, 2023; originally announced June 2023.

  22. arXiv:2305.19845  [pdf, other

    cs.CL

    Guiding Computational Stance Detection with Expanded Stance Triangle Framework

    Authors: Zhengyuan Liu, Yong Keong Yap, Hai Leong Chieu, Nancy F. Chen

    Abstract: Stance detection determines whether the author of a piece of text is in favor of, against, or neutral towards a specified target, and can be used to gain valuable insights into social media. The ubiquitous indirect referral of targets makes this task challenging, as it requires computational solutions to model semantic features and infer the corresponding implications from a literal statement. Mor… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

    Comments: Main Conference in ACL 2023

  23. arXiv:2304.11404  [pdf, other

    cs.CV eess.IV eess.SP

    SSN: Stockwell Scattering Network for SAR Image Change Detection

    Authors: Gong Chen, Yanan Zhao, Yi Wang, Kim-Hui Yap

    Abstract: Recently, synthetic aperture radar (SAR) image change detection has become an interesting yet challenging direction due to the presence of speckle noise. Although both traditional and modern learning-driven methods attempted to overcome this challenge, deep convolutional neural networks (DCNNs)-based methods are still hindered by the lack of interpretability and the requirement of large computatio… ▽ More

    Submitted 22 April, 2023; originally announced April 2023.

    Comments: 5 pages, 6 figures

    MSC Class: 53-04 ACM Class: I.2.1

    Journal ref: IEEE Geoscience and Remote Sensing Letters, 2023

  24. arXiv:2304.06983  [pdf, other

    cs.CV eess.SP

    A Byte Sequence is Worth an Image: CNN for File Fragment Classification Using Bit Shift and n-Gram Embeddings

    Authors: Wenyang Liu, Yi Wang, Kejun Wu, Kim-Hui Yap, Lap-Pui Chau

    Abstract: File fragment classification (FFC) on small chunks of memory is essential in memory forensics and Internet security. Existing methods mainly treat file fragments as 1d byte signals and utilize the captured inter-byte features for classification, while the bit information within bytes, i.e., intra-byte information, is seldom considered. This is inherently inapt for classifying variable-length codin… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

    Comments: Accepted by AICAS 2023

  25. arXiv:2304.06976  [pdf, other

    eess.IV cs.CV

    Bitstream-Corrupted JPEG Images are Restorable: Two-stage Compensation and Alignment Framework for Image Restoration

    Authors: Wenyang Liu, Yi Wang, Kim-Hui Yap, Lap-Pui Chau

    Abstract: In this paper, we study a real-world JPEG image restoration problem with bit errors on the encrypted bitstream. The bit errors bring unpredictable color casts and block shifts on decoded image contents, which cannot be resolved by existing image restoration methods mainly relying on pre-defined degradation models in the pixel domain. To address these challenges, we propose a robust JPEG decoder, f… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

    Comments: Accepted by CVPR 2023

  26. arXiv:2303.13273  [pdf, other

    cs.CV

    TAPS3D: Text-Guided 3D Textured Shape Generation from Pseudo Supervision

    Authors: Jiacheng Wei, Hao Wang, Jiashi Feng, Guosheng Lin, Kim-Hui Yap

    Abstract: In this paper, we investigate an open research task of generating controllable 3D textured shapes from the given textual descriptions. Previous works either require ground truth caption labeling or extensive optimization time. To resolve these issues, we present a novel framework, TAPS3D, to train a text-guided 3D shape generator with pseudo captions. Specifically, based on rendered 2D images, we… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

    Comments: Accepted to CVPR2023

  27. Dense Supervision Propagation for Weakly Supervised Semantic Segmentation on 3D Point Clouds

    Authors: Jiacheng Wei, Guosheng Lin, Kim-Hui Yap, Fayao Liu, Tzu-Yi Hung

    Abstract: Semantic segmentation on 3D point clouds is an important task for 3D scene understanding. While dense labeling on 3D data is expensive and time-consuming, only a few works address weakly supervised semantic point cloud segmentation methods to relieve the labeling cost by learning from simpler and cheaper labels. Meanwhile, there are still huge performance gaps between existing weakly supervised me… ▽ More

    Submitted 1 April, 2024; v1 submitted 23 July, 2021; originally announced July 2021.

  28. arXiv:2106.00256  [pdf, other

    cs.CV

    Reconciliation of Statistical and Spatial Sparsity For Robust Image and Image-Set Classification

    Authors: Hao Cheng, Kim-Hui Yap, Bihan Wen

    Abstract: Recent image classification algorithms, by learning deep features from large-scale datasets, have achieved significantly better results comparing to the classic feature-based approaches. However, there are still various challenges of image classifications in practice, such as classifying noisy image or image-set queries and training deep image classification models over the limited-scale dataset.… ▽ More

    Submitted 1 June, 2021; originally announced June 2021.

    Comments: Submitted to IEEE Transactions on Multimedia

  29. arXiv:2105.03856  [pdf, ps, other

    cs.SC

    The D-plus Discriminant and Complexity of Root Clustering

    Authors: Jing Yang, Chee K. Yap

    Abstract: Let $p(x)$ be an integer polynomial with $m\ge 2$ distinct roots $ρ_1,\ldots,ρ_m$ whose multiplicities are $\boldsymbolμ=(μ_1,\ldots,μ_m)$. We define the D-plus discriminant of $p(x)$ to be $D^+(p):= \prod_{1\le i<j\le m}(ρ_i-ρ_j)^{μ_i+μ_j}$. We first prove a conjecture that $D^+(p)$ is a $\boldsymbolμ$-symmetric function of its roots $ρ_1,\ldots,ρ_m$. Our main result gives an explicit formula for… ▽ More

    Submitted 19 May, 2021; v1 submitted 9 May, 2021; originally announced May 2021.

    MSC Class: 68W30; 11R29; 68Q25

  30. arXiv:2006.14265  [pdf, other

    cs.LG cs.CV stat.ML

    Empirical Analysis of Overfitting and Mode Drop in GAN Training

    Authors: Yasin Yazici, Chuan-Sheng Foo, Stefan Winkler, Kim-Hui Yap, Vijay Chandrasekhar

    Abstract: We examine two key questions in GAN training, namely overfitting and mode drop, from an empirical perspective. We show that when stochasticity is removed from the training procedure, GANs can overfit and exhibit almost no mode drop. Our results shed light on important characteristics of the GAN training procedure. They also provide evidence against prevailing intuitions that GANs do not memorize t… ▽ More

    Submitted 25 June, 2020; originally announced June 2020.

    Comments: To appear in ICIP2020

  31. arXiv:2006.14256  [pdf, other

    cs.AR

    Arnold: an eFPGA-Augmented RISC-V SoC for Flexible and Low-Power IoT End-Nodes

    Authors: Pasquale Davide Schiavone, Davide Rossi, Alfio Di Mauro, Frank Gurkaynak, Timothy Saxe, Mao Wang, Ket Chong Yap, Luca Benini

    Abstract: A wide range of Internet of Things (IoT) applications require powerful, energy-efficient and flexible end-nodes to acquire data from multiple sources, process and distill the sensed data through near-sensor data analytics algorithms, and transmit it wirelessly. This work presents Arnold: a 0.5 V to 0.8 V, 46.83 uW/MHz, 600 MOPS fully programmable RISC-V Microcontroller unit (MCU) fabricated in 22… ▽ More

    Submitted 25 June, 2020; originally announced June 2020.

  32. arXiv:2003.13035  [pdf, other

    cs.CV

    Multi-Path Region Mining For Weakly Supervised 3D Semantic Segmentation on Point Clouds

    Authors: Jiacheng Wei, Guosheng Lin, Kim-Hui Yap, Tzu-Yi Hung, Lihua Xie

    Abstract: Point clouds provide intrinsic geometric information and surface context for scene understanding. Existing methods for point cloud segmentation require a large amount of fully labeled data. Using advanced depth sensors, collection of large scale 3D dataset is no longer a cumbersome process. However, manually producing point-level label on the large scale dataset is time and labor-intensive. In thi… ▽ More

    Submitted 29 March, 2020; originally announced March 2020.

    Comments: Accepted by CVPR2020

  33. arXiv:2001.07403  [pdf, other

    cs.SC

    On mu-Symmetric Polynomials

    Authors: Jing Yang, Chee K. Yap

    Abstract: In this paper, we study functions of the roots of a univariate polynomial in which the roots have a given multiplicity structure $μ$. Traditionally, root functions are studied via the theory of symmetric polynomials; we extend this theory to $μ$-symmetric polynomials. We were motivated by a conjecture from Becker et al.~(ISSAC 2016) about the $μ$-symmetry of a particular root function $D^+(μ)$, ca… ▽ More

    Submitted 21 January, 2020; originally announced January 2020.

  34. arXiv:1912.09021  [pdf, other

    cs.CV

    AANet: Attribute Attention Network for Person Re-Identifications

    Authors: Chiat-Pin Tay, Sharmili Roy, Kim-Hui Yap

    Abstract: This paper proposes Attribute Attention Network (AANet), a new architecture that integrates person attributes and attribute attention maps into a classification framework to solve the person re-identification (re-ID) problem. Many person re-ID models typically employ semantic cues such as body parts or human pose to improve the re-ID performance. Attribute information, however, is often not utiliz… ▽ More

    Submitted 19 December, 2019; originally announced December 2019.

    Comments: CVPR 2019

  35. arXiv:1911.06047  [pdf, other

    cs.CV cs.LG

    Semantic Granularity Metric Learning for Visual Search

    Authors: Dipu Manandhar, Muhammet Bastan, Kim-Hui Yap

    Abstract: Deep metric learning applied to various applications has shown promising results in identification, retrieval and recognition. Existing methods often do not consider different granularity in visual similarity. However, in many domain applications, images exhibit similarity at multiple granularities with visual semantic concepts, e.g. fashion demonstrates similarity ranging from clothing of the exa… ▽ More

    Submitted 14 November, 2019; originally announced November 2019.

    Comments: 10 pages, 10 figures

  36. arXiv:1902.03444  [pdf, other

    cs.LG stat.ML

    Venn GAN: Discovering Commonalities and Particularities of Multiple Distributions

    Authors: Yasin Yazıcı, Bruno Lecouat, Chuan-Sheng Foo, Stefan Winkler, Kim-Hui Yap, Georgios Piliouras, Vijay Chandrasekhar

    Abstract: We propose a GAN design which models multiple distributions effectively and discovers their commonalities and particularities. Each data distribution is modeled with a mixture of $K$ generator distributions. As the generators are partially shared between the modeling of different true data distributions, shared ones captures the commonality of the distributions, while non-shared ones capture uniqu… ▽ More

    Submitted 9 February, 2019; originally announced February 2019.

  37. arXiv:1901.00031  [pdf, ps, other

    cs.CV

    Interest Point Detection based on Adaptive Ternary Coding

    Authors: Zhenwei Miao, Kim-Hui Yap, Xudong Jiang

    Abstract: In this paper, an adaptive pixel ternary coding mechanism is proposed and a contrast invariant and noise resistant interest point detector is developed on the basis of this mechanism. Every pixel in a local region is adaptively encoded into one of the three statuses: bright, uncertain and dark. The blob significance of the local region is measured by the spatial distribution of the bright and dark… ▽ More

    Submitted 31 December, 2018; originally announced January 2019.

  38. arXiv:1901.00027  [pdf, other

    cs.CV

    DCI: Discriminative and Contrast Invertible Descriptor

    Authors: Zhenwei Miao, Kim-Hui Yap, Xudong Jiang, Subbhuraam Sinduja, Zhenhua Wang

    Abstract: Local feature descriptors have been widely used in fine-grained visual object search thanks to their robustness in scale and rotation variation and cluttered background. However, the performance of such descriptors drops under severe illumination changes. In this paper, we proposed a Discriminative and Contrast Invertible (DCI) local feature descriptor. In order to increase the discriminative abil… ▽ More

    Submitted 31 December, 2018; originally announced January 2019.

  39. arXiv:1806.04498  [pdf, other

    stat.ML cs.CV cs.LG

    The Unusual Effectiveness of Averaging in GAN Training

    Authors: Yasin Yazıcı, Chuan-Sheng Foo, Stefan Winkler, Kim-Hui Yap, Georgios Piliouras, Vijay Chandrasekhar

    Abstract: We examine two different techniques for parameter averaging in GAN training. Moving Average (MA) computes the time-average of parameters, whereas Exponential Moving Average (EMA) computes an exponentially discounted sum. Whilst MA is known to lead to convergence in bilinear settings, we provide the -- to our knowledge -- first theoretical arguments in support of EMA. We show that EMA converges to… ▽ More

    Submitted 26 February, 2019; v1 submitted 12 June, 2018; originally announced June 2018.

    Comments: Published as a conference paper at ICLR 2019

  40. arXiv:1804.10805  [pdf, ps, other

    cs.CV

    Remote Detection of Idling Cars Using Infrared Imaging and Deep Networks

    Authors: Muhammet Bastan, Kim-Hui Yap, Lap-Pui Chau

    Abstract: Idling vehicles waste energy and pollute the environment through exhaust emission. In some countries, idling a vehicle for more than a predefined duration is prohibited and automatic idling vehicle detection is desirable for law enforcement. We propose the first automatic system to detect idling cars, using infrared (IR) imaging and deep networks. We rely on the differences in spatio-temporal he… ▽ More

    Submitted 28 April, 2018; originally announced April 2018.

    Comments: Neural Computing and Applications

  41. Handling state space explosion in verification of component-based systems: A review

    Authors: Faranak Nejati, Abdul Azim Abd. Ghani, Ng Keng Yap, Azmi Jaafar

    Abstract: Component-based software development (CBSD) is an alternative approach to constructing software systems that offers numerous benefits, particularly in decreasing the complexity of system design. However, deploying components into a system is a challenging and error-prone task. Model-checking is one of the reliable methods to systematically analyze the correctness of a system. It is a bruce-force c… ▽ More

    Submitted 26 May, 2021; v1 submitted 28 July, 2017; originally announced September 2017.

    Journal ref: IEEEAccess, 2021

  42. arXiv:1704.05123  [pdf, other

    cs.CG cs.RO

    Resolution-Exact Planner for Thick Non-Crossing 2-Link Robots

    Authors: Chee K. Yap, Zhongdi Luo, Ching-Hsiang Hsu

    Abstract: We consider the path planning problem for a 2-link robot amidst polygonal obstacles. Our robot is parametrizable by the lengths $\ell_1, \ell_2>0$ of its two links, the thickness $τ\ge 0$ of the links, and an angle $κ$ that constrains the angle between the 2 links to be strictly greater than $κ$. The case $τ>0$ and $κ\ge 0$ corresponds to "thick non-crossing" robots. This results in a novel 4DOF c… ▽ More

    Submitted 17 April, 2017; originally announced April 2017.

  43. arXiv:1506.06265  [pdf, ps, other

    cs.CG

    Certified Computation of planar Morse-Smale Complexes

    Authors: Amit Chattopadhyay, Gert Vegter, Chee K. Yap

    Abstract: The Morse-Smale complex is an important tool for global topological analysis in various problems of computational geometry and topology. Algorithms for Morse-Smale complexes have been presented in case of piecewise linear manifolds. However, previous research in this field is incomplete in the case of smooth functions. In the current paper we address the following question: Given an arbitrarily co… ▽ More

    Submitted 20 June, 2015; originally announced June 2015.

    Comments: Under Review in Journal