Skip to main content

Showing 1–22 of 22 results for author: Ong, Y

Searching in archive stat. Search in all archives.
.
  1. Co-Learning Bayesian Optimization

    Authors: Zhendong Guo, Yew-Soon Ong, Tiantian He, Haitao Liu

    Abstract: Bayesian optimization (BO) is well known to be sample-efficient for solving black-box problems. However, the BO algorithms can sometimes get stuck in suboptimal solutions even with plenty of samples. Intrinsically, such suboptimal problem of BO can attribute to the poor surrogate accuracy of the trained Gaussian process (GP), particularly that in the regions where the optimal solutions locate. Hen… ▽ More

    Submitted 22 January, 2025; originally announced January 2025.

    Journal ref: in IEEE Transactions on Cybernetics, vol. 52, no. 9, pp. 9820-9833, Sept. 2022

  2. arXiv:2411.02467  [pdf, other

    cs.LG cs.CY stat.ML

    Towards Harmless Rawlsian Fairness Regardless of Demographic Prior

    Authors: Xuanqian Wang, Jing Li, Ivor W. Tsang, Yew-Soon Ong

    Abstract: Due to privacy and security concerns, recent advancements in group fairness advocate for model training regardless of demographic information. However, most methods still require prior knowledge of demographics. In this study, we explore the potential for achieving fairness without compromising its utility when no prior demographics are provided to the training set, namely \emph{harmless Rawlsian… ▽ More

    Submitted 8 November, 2024; v1 submitted 4 November, 2024; originally announced November 2024.

    Comments: Accepted as a Poster in Neurips 2024

  3. arXiv:2406.00812  [pdf, other

    stat.ML cs.LG

    Covariance-Adaptive Sequential Black-box Optimization for Diffusion Targeted Generation

    Authors: Yueming Lyu, Kim Yong Tan, Yew Soon Ong, Ivor W. Tsang

    Abstract: Diffusion models have demonstrated great potential in generating high-quality content for images, natural language, protein domains, etc. However, how to perform user-preferred targeted generation via diffusion models with only black-box target scores of users remains challenging. To address this issue, we first formulate the fine-tuning of the targeted reserve-time stochastic differential equatio… ▽ More

    Submitted 8 June, 2024; v1 submitted 2 June, 2024; originally announced June 2024.

  4. arXiv:2202.12636  [pdf, other

    stat.ML cs.LG

    Learning Multi-Task Gaussian Process Over Heterogeneous Input Domains

    Authors: Haitao Liu, Kai Wu, Yew-Soon Ong, Chao Bian, Xiaomo Jiang, Xiaofang Wang

    Abstract: Multi-task Gaussian process (MTGP) is a well-known non-parametric Bayesian model for learning correlated tasks effectively by transferring knowledge across tasks. But current MTGPs are usually limited to the multi-task scenario defined in the same input domain, leaving no space for tackling the heterogeneous case, i.e., the features of input domains vary over tasks. To this end, this paper present… ▽ More

    Submitted 18 June, 2022; v1 submitted 25 February, 2022; originally announced February 2022.

    Comments: This work has been submitted to the IEEE for possible publication

  5. arXiv:2008.12922  [pdf, other

    stat.ML cs.LG

    Modulating Scalable Gaussian Processes for Expressive Statistical Learning

    Authors: Haitao Liu, Yew-Soon Ong, Xiaomo Jiang, Xiaofang Wang

    Abstract: For a learning task, Gaussian process (GP) is interested in learning the statistical relationship between inputs and outputs, since it offers not only the prediction mean but also the associated variability. The vanilla GP however struggles to learn complicated distribution with the property of, e.g., heteroscedastic noise, multi-modality and non-stationarity, from massive data due to the Gaussian… ▽ More

    Submitted 29 August, 2020; originally announced August 2020.

    Comments: 31 pages, 9 figures, 4 tables, preprint under review

  6. arXiv:2008.06199  [pdf, other

    cs.LG stat.ML

    Adversary Agnostic Robust Deep Reinforcement Learning

    Authors: Xinghua Qu, Yew-Soon Ong, Abhishek Gupta, Zhu Sun

    Abstract: Deep reinforcement learning (DRL) policies have been shown to be deceived by perturbations (e.g., random noise or intensional adversarial attacks) on state observations that appear at test time but are unknown during training. To increase the robustness of DRL policies, previous approaches assume that the knowledge of adversaries can be added into the training process to achieve the corresponding… ▽ More

    Submitted 24 December, 2020; v1 submitted 14 August, 2020; originally announced August 2020.

  7. arXiv:2005.08467  [pdf, other

    stat.ML cs.LG

    Deep Latent-Variable Kernel Learning

    Authors: Haitao Liu, Yew-Soon Ong, Xiaomo Jiang, Xiaofang Wang

    Abstract: Deep kernel learning (DKL) leverages the connection between Gaussian process (GP) and neural networks (NN) to build an end-to-end, hybrid model. It combines the capability of NN to learn rich representations under massive data and the non-parametric property of GP to achieve automatic regularization that incorporates a trade-off between model fit and model complexity. However, the deterministic en… ▽ More

    Submitted 19 August, 2020; v1 submitted 18 May, 2020; originally announced May 2020.

    Comments: 13 pages, 8 figures, preprint under review

  8. arXiv:2004.13303  [pdf, other

    cs.LG stat.ML

    Heterogeneous Representation Learning: A Review

    Authors: Joey Tianyi Zhou, Xi Peng, Yew-Soon Ong

    Abstract: The real-world data usually exhibits heterogeneous properties such as modalities, views, or resources, which brings some unique challenges wherein the key is Heterogeneous Representation Learning (HRL) termed in this paper. This brief survey covers the topic of HRL, centered around several major learning settings and real-world applications. First of all, from the mathematical perspective, we pres… ▽ More

    Submitted 30 April, 2020; v1 submitted 28 April, 2020; originally announced April 2020.

  9. arXiv:2001.01051  [pdf, other

    cs.LG stat.ML

    Temporal Tensor Transformation Network for Multivariate Time Series Prediction

    Authors: Yuya Jeremy Ong, Mu Qiao, Divyesh Jadav

    Abstract: Multivariate time series prediction has applications in a wide variety of domains and is considered to be a very challenging task, especially when the variables have correlations and exhibit complex temporal patterns, such as seasonality and trend. Many existing methods suffer from strong statistical assumptions, numerical issues with high dimensionality, manual feature engineering efforts, and sc… ▽ More

    Submitted 4 January, 2020; originally announced January 2020.

  10. arXiv:1911.07693  [pdf, ps, other

    cs.LG stat.ML

    A Multi-Task Gradient Descent Method for Multi-Label Learning

    Authors: Lu Bai, Yew-Soon Ong, Tiantian He, Abhishek Gupta

    Abstract: Multi-label learning studies the problem where an instance is associated with a set of labels. By treating single-label learning problem as one task, the multi-label learning problem can be casted as solving multiple related tasks simultaneously. In this paper, we propose a novel Multi-task Gradient Descent (MGD) algorithm to solve a group of related tasks simultaneously. In the proposed algorithm… ▽ More

    Submitted 19 November, 2019; v1 submitted 18 November, 2019; originally announced November 2019.

  11. arXiv:1911.03849  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Minimalistic Attacks: How Little it Takes to Fool a Deep Reinforcement Learning Policy

    Authors: Xinghua Qu, Zhu Sun, Yew-Soon Ong, Abhishek Gupta, Pengfei Wei

    Abstract: Recent studies have revealed that neural network-based policies can be easily fooled by adversarial examples. However, while most prior works analyze the effects of perturbing every pixel of every frame assuming white-box policy access, in this paper we take a more restrictive view towards adversary generation - with the goal of unveiling the limits of a model's vulnerability. In particular, we ex… ▽ More

    Submitted 29 October, 2020; v1 submitted 9 November, 2019; originally announced November 2019.

    Comments: Accepted by IEEE Transactions on Cognitive and Developmental System

  12. arXiv:1910.04062  [pdf, other

    cs.LG stat.ML

    DEVDAN: Deep Evolving Denoising Autoencoder

    Authors: Andri Ashfahani, Mahardhika Pratama, Edwin Lughofer, Yew Soon Ong

    Abstract: The Denoising Autoencoder (DAE) enhances the flexibility of the data stream method in exploiting unlabeled samples. Nonetheless, the feasibility of DAE for data stream analytic deserves an in-depth study because it characterizes a fixed network capacity that cannot adapt to rapidly changing environments. Deep evolving denoising autoencoder (DEVDAN), is proposed in this paper. It features an open s… ▽ More

    Submitted 9 January, 2020; v1 submitted 8 October, 2019; originally announced October 2019.

    Comments: This paper has been accepted for publication in Neurocomputing 2019. arXiv admin note: substantial text overlap with arXiv:1809.09081

  13. arXiv:1910.03437  [pdf, other

    cs.LG cs.NE stat.ML

    Automatic Construction of Multi-layer Perceptron Network from Streaming Examples

    Authors: Mahardhika Pratama, Choiru Za'in, Andri Ashfahani, Yew Soon Ong, Weiping Ding

    Abstract: Autonomous construction of deep neural network (DNNs) is desired for data streams because it potentially offers two advantages: proper model's capacity and quick reaction to drift and shift. While the self-organizing mechanism of DNNs remains an open issue, this task is even more challenging to be developed for standard multi-layer DNNs than that using the different-depth structures, because the a… ▽ More

    Submitted 9 January, 2020; v1 submitted 8 October, 2019; originally announced October 2019.

    Comments: This paper has been accepted for publication in CIKM 2019

  14. arXiv:1909.06541  [pdf, other

    stat.ML cs.LG

    Scalable Gaussian Process Classification with Additive Noise for Various Likelihoods

    Authors: Haitao Liu, Yew-Soon Ong, Ziwei Yu, Jianfei Cai, Xiaobo Shen

    Abstract: Gaussian process classification (GPC) provides a flexible and powerful statistical framework describing joint distributions over function space. Conventional GPCs however suffer from (i) poor scalability for big data due to the full kernel matrix, and (ii) intractable inference due to the non-Gaussian likelihoods. Hence, various scalable GPCs have been proposed through (i) the sparse approximation… ▽ More

    Submitted 14 September, 2019; originally announced September 2019.

    Comments: 11 pages, 5 figures, preprint under review

  15. arXiv:1905.09877  [pdf, other

    cs.LG eess.SP stat.ML

    CASS: Cross Adversarial Source Separation via Autoencoder

    Authors: Yong Zheng Ong, Charles K. Chui, Haizhao Yang

    Abstract: This paper introduces a cross adversarial source separation (CASS) framework via autoencoder, a new model that aims at separating an input signal consisting of a mixture of multiple components into individual components defined via adversarial learning and autoencoder fitting. CASS unifies popular generative networks like auto-encoders (AEs) and generative adversarial networks (GANs) in a single f… ▽ More

    Submitted 23 May, 2019; originally announced May 2019.

  16. arXiv:1901.00248  [pdf, other

    cs.LG cs.CV stat.ML

    A Survey on Multi-output Learning

    Authors: Donna Xu, Yaxin Shi, Ivor W. Tsang, Yew-Soon Ong, Chen Gong, Xiaobo Shen

    Abstract: Multi-output learning aims to simultaneously predict multiple outputs given an input. It is an important learning problem due to the pressing need for sophisticated decision making in real-world applications. Inspired by big data, the 4Vs characteristics of multi-output imposes a set of challenges to multi-output learning, in terms of the volume, velocity, variety and veracity of the outputs. Incr… ▽ More

    Submitted 13 October, 2019; v1 submitted 1 January, 2019; originally announced January 2019.

    Comments: Paper accepted by IEEE Transactions on Neural Networks and Learning Systems

  17. arXiv:1811.01179  [pdf, other

    stat.ML cs.LG

    Large-scale Heteroscedastic Regression via Gaussian Process

    Authors: Haitao Liu, Yew-Soon Ong, Jianfei Cai

    Abstract: Heteroscedastic regression considering the varying noises among observations has many applications in the fields like machine learning and statistics. Here we focus on the heteroscedastic Gaussian process (HGP) regression which integrates the latent function and the noise function together in a unified non-parametric Bayesian framework. Though showing remarkable performance, HGP suffers from the c… ▽ More

    Submitted 21 January, 2020; v1 submitted 3 November, 2018; originally announced November 2018.

    Comments: 14 pages, 15 figures

  18. arXiv:1811.01159  [pdf, ps, other

    stat.ML cs.LG

    Understanding and Comparing Scalable Gaussian Process Regression for Big Data

    Authors: Haitao Liu, Jianfei Cai, Yew-Soon Ong, Yi Wang

    Abstract: As a non-parametric Bayesian model which produces informative predictive distribution, Gaussian process (GP) has been widely used in various fields, like regression, classification and optimization. The cubic complexity of standard GP however leads to poor scalability, which poses challenges in the era of big data. Hence, various scalable GPs have been developed in the literature in order to impro… ▽ More

    Submitted 3 November, 2018; originally announced November 2018.

    Comments: 25 pages, 15 figures, preprint submitted to KBS

  19. arXiv:1809.09081  [pdf, other

    cs.LG cs.AI stat.ML

    Autonomous Deep Learning: Incremental Learning of Denoising Autoencoder for Evolving Data Streams

    Authors: Mahardhika Pratama, Andri Ashfahani, Yew Soon Ong, Savitha Ramasamy, Edwin Lughofer

    Abstract: The generative learning phase of Autoencoder (AE) and its successor Denosing Autoencoder (DAE) enhances the flexibility of data stream method in exploiting unlabelled samples. Nonetheless, the feasibility of DAE for data stream analytic deserves in-depth study because it characterizes a fixed network capacity which cannot adapt to rapidly changing environments. An automated construction of a denoi… ▽ More

    Submitted 24 September, 2018; originally announced September 2018.

    Comments: have been submitted to AAAI 2019 conference

  20. arXiv:1807.01065  [pdf, ps, other

    stat.ML cs.LG

    When Gaussian Process Meets Big Data: A Review of Scalable GPs

    Authors: Haitao Liu, Yew-Soon Ong, Xiaobo Shen, Jianfei Cai

    Abstract: The vast quantity of information brought by big data as well as the evolving computer hardware encourages success stories in the machine learning community. In the meanwhile, it poses challenges for the Gaussian process (GP) regression, a well-known non-parametric and interpretable Bayesian model, which suffers from cubic complexity to data size. To improve the scalability while retaining desirabl… ▽ More

    Submitted 9 April, 2019; v1 submitted 3 July, 2018; originally announced July 2018.

    Comments: 20 pages, 6 figures

  21. arXiv:1806.00720  [pdf, ps, other

    stat.ML cs.LG

    Generalized Robust Bayesian Committee Machine for Large-scale Gaussian Process Regression

    Authors: Haitao Liu, Jianfei Cai, Yi Wang, Yew-Soon Ong

    Abstract: In order to scale standard Gaussian process (GP) regression to large-scale datasets, aggregation models employ factorized training process and then combine predictions from distributed experts. The state-of-the-art aggregation models, however, either provide inconsistent predictions or require time-consuming aggregation process. We first prove the inconsistency of typical aggregations using disjoi… ▽ More

    Submitted 2 June, 2018; originally announced June 2018.

    Comments: paper + supplementary material, appears in Proceedings of ICML 2018

  22. arXiv:1206.6477  [pdf

    cs.LG stat.ML

    Discovering Support and Affiliated Features from Very High Dimensions

    Authors: Yiteng Zhai, Mingkui Tan, Ivor Tsang, Yew Soon Ong

    Abstract: In this paper, a novel learning paradigm is presented to automatically identify groups of informative and correlated features from very high dimensions. Specifically, we explicitly incorporate correlation measures as constraints and then propose an efficient embedded feature selection method using recently developed cutting plane strategy. The benefits of the proposed algorithm are two-folds. Firs… ▽ More

    Submitted 27 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the 29th International Conference on Machine Learning (ICML 2012)