Skip to main content

Showing 1–14 of 14 results for author: Foo, C

Searching in archive stat. Search in all archives.
.
  1. arXiv:2502.00270  [pdf, other

    cs.LG cs.AI stat.ML

    DUET: Optimizing Training Data Mixtures via Feedback from Unseen Evaluation Tasks

    Authors: Zhiliang Chen, Gregory Kang Ruey Lau, Chuan-Sheng Foo, Bryan Kian Hsiang Low

    Abstract: The performance of an LLM depends heavily on the relevance of its training data to the downstream evaluation task. However, in practice, the data involved in an unseen evaluation task is often unknown (e.g., conversations between an LLM and a user are end-to-end encrypted). Hence, it is unclear what data are relevant for fine-tuning the LLM to maximize its performance on the specific unseen evalua… ▽ More

    Submitted 18 May, 2025; v1 submitted 31 January, 2025; originally announced February 2025.

  2. arXiv:2310.00646  [pdf, other

    cs.LG cs.AI stat.ML

    Source Attribution for Large Language Model-Generated Data

    Authors: Jingtan Wang, Xinyang Lu, Zitong Zhao, Zhongxiang Dai, Chuan-Sheng Foo, See-Kiong Ng, Bryan Kian Hsiang Low

    Abstract: The impressive performances of Large Language Models (LLMs) and their immense potential for commercialization have given rise to serious concerns over the Intellectual Property (IP) of their training data. In particular, the synthetic texts generated by LLMs may infringe the IP of the data being used to train the LLMs. To this end, it is imperative to be able to perform source attribution by ident… ▽ More

    Submitted 25 September, 2024; v1 submitted 1 October, 2023; originally announced October 2023.

  3. On Representation Knowledge Distillation for Graph Neural Networks

    Authors: Chaitanya K. Joshi, Fayao Liu, Xu Xun, Jie Lin, Chuan-Sheng Foo

    Abstract: Knowledge distillation is a learning paradigm for boosting resource-efficient graph neural networks (GNNs) using more expressive yet cumbersome teacher models. Past work on distillation for GNNs proposed the Local Structure Preserving loss (LSP), which matches local structural relationships defined over edges across the student and teacher's node embeddings. This paper studies whether preserving t… ▽ More

    Submitted 4 February, 2023; v1 submitted 9 November, 2021; originally announced November 2021.

    Comments: IEEE Transactions on Neural Networks and Learning Representation (TNNLS), Special Issue on Deep Neural Networks for Graphs: Theory, Models, Algorithms and Applications

  4. arXiv:2109.11428  [pdf, other

    cs.LG cs.AI stat.ML

    An Evaluation of Anomaly Detection and Diagnosis in Multivariate Time Series

    Authors: Astha Garg, Wenyu Zhang, Jules Samaran, Savitha Ramasamy, Chuan-Sheng Foo

    Abstract: Several techniques for multivariate time series anomaly detection have been proposed recently, but a systematic comparison on a common set of datasets and metrics is lacking. This paper presents a systematic and comprehensive evaluation of unsupervised and semi-supervised deep-learning based methods for anomaly detection and diagnosis on multivariate time series data from cyberphysical systems. Un… ▽ More

    Submitted 23 September, 2021; originally announced September 2021.

    Comments: IEEE Transactions on Neural Networks and Learning Systems

  5. arXiv:2006.14265  [pdf, other

    cs.LG cs.CV stat.ML

    Empirical Analysis of Overfitting and Mode Drop in GAN Training

    Authors: Yasin Yazici, Chuan-Sheng Foo, Stefan Winkler, Kim-Hui Yap, Vijay Chandrasekhar

    Abstract: We examine two key questions in GAN training, namely overfitting and mode drop, from an empirical perspective. We show that when stochasticity is removed from the training procedure, GANs can overfit and exhibit almost no mode drop. Our results shed light on important characteristics of the GAN training procedure. They also provide evidence against prevailing intuitions that GANs do not memorize t… ▽ More

    Submitted 25 June, 2020; originally announced June 2020.

    Comments: To appear in ICIP2020

  6. Classify and Generate: Using Classification Latent Space Representations for Image Generations

    Authors: Saisubramaniam Gopalakrishnan, Pranshu Ranjan Singh, Yasin Yazici, Chuan-Sheng Foo, Vijay Chandrasekhar, ArulMurugan Ambikapathi

    Abstract: Utilization of classification latent space information for downstream reconstruction and generation is an intriguing and a relatively unexplored area. In general, discriminative representations are rich in class-specific features but are too sparse for reconstruction, whereas, in autoencoders the representations are dense but have limited indistinguishable class-specific features, making them less… ▽ More

    Submitted 14 December, 2021; v1 submitted 16 April, 2020; originally announced April 2020.

    Journal ref: Saisubramaniam Gopalakrishnan, Pranshu Ranjan Singh et. al., Classify and generate: Using classification latent space representations for image generations, Neurocomputing, Volume 471, 2022, Pages 296-334, ISSN 0925-2312

  7. arXiv:2002.06015  [pdf, other

    cs.LG stat.ML

    Scalable and Practical Natural Gradient for Large-Scale Deep Learning

    Authors: Kazuki Osawa, Yohei Tsuji, Yuichiro Ueno, Akira Naruse, Chuan-Sheng Foo, Rio Yokota

    Abstract: Large-scale distributed training of deep neural networks results in models with worse generalization performance as a result of the increase in the effective mini-batch size. Previous approaches attempt to address this problem by varying the learning rate and batch size over epochs and layers, or ad hoc modifications of batch normalization. We propose Scalable and Practical Natural Gradient Descen… ▽ More

    Submitted 13 February, 2020; originally announced February 2020.

    Comments: arXiv admin note: text overlap with arXiv:1811.12019

  8. arXiv:1902.03444  [pdf, other

    cs.LG stat.ML

    Venn GAN: Discovering Commonalities and Particularities of Multiple Distributions

    Authors: Yasin Yazıcı, Bruno Lecouat, Chuan-Sheng Foo, Stefan Winkler, Kim-Hui Yap, Georgios Piliouras, Vijay Chandrasekhar

    Abstract: We propose a GAN design which models multiple distributions effectively and discovers their commonalities and particularities. Each data distribution is modeled with a mixture of $K$ generator distributions. As the generators are partially shared between the modeling of different true data distributions, shared ones captures the commonality of the distributions, while non-shared ones capture uniqu… ▽ More

    Submitted 9 February, 2019; originally announced February 2019.

  9. arXiv:1812.02288  [pdf, other

    cs.LG stat.ML

    Adversarially Learned Anomaly Detection

    Authors: Houssam Zenati, Manon Romain, Chuan Sheng Foo, Bruno Lecouat, Vijay Ramaseshan Chandrasekhar

    Abstract: Anomaly detection is a significant and hence well-studied problem. However, developing effective anomaly detection methods for complex and high-dimensional data remains a challenge. As Generative Adversarial Networks (GANs) are able to model the complex high-dimensional distributions of real-world data, they offer a promising approach to address this challenge. In this work, we propose an anomaly… ▽ More

    Submitted 5 December, 2018; originally announced December 2018.

    Comments: In the Proceedings of the 20th IEEE International Conference on Data Mining (ICDM), 2018

  10. arXiv:1807.04307  [pdf, other

    cs.LG stat.ML

    Manifold regularization with GANs for semi-supervised learning

    Authors: Bruno Lecouat, Chuan-Sheng Foo, Houssam Zenati, Vijay Chandrasekhar

    Abstract: Generative Adversarial Networks are powerful generative models that are able to model the manifold of natural images. We leverage this property to perform manifold regularization by approximating a variant of the Laplacian norm using a Monte Carlo approximation that is easily computed with the GAN. When incorporated into the semi-supervised feature-matching GAN we achieve state-of-the-art results… ▽ More

    Submitted 11 July, 2018; originally announced July 2018.

  11. arXiv:1807.02629  [pdf, other

    cs.LG cs.GT math.OC stat.ML

    Optimistic mirror descent in saddle-point problems: Going the extra (gradient) mile

    Authors: Panayotis Mertikopoulos, Bruno Lecouat, Houssam Zenati, Chuan-Sheng Foo, Vijay Chandrasekhar, Georgios Piliouras

    Abstract: Owing to their connection with generative adversarial networks (GANs), saddle-point problems have recently attracted considerable interest in machine learning and beyond. By necessity, most theoretical guarantees revolve around convex-concave (or even linear) problems; however, making theoretical inroads towards efficient GAN training depends crucially on moving beyond this classic framework. To m… ▽ More

    Submitted 1 October, 2018; v1 submitted 7 July, 2018; originally announced July 2018.

    Comments: 26 pages, 14 figures

  12. arXiv:1806.04498  [pdf, other

    stat.ML cs.CV cs.LG

    The Unusual Effectiveness of Averaging in GAN Training

    Authors: Yasin Yazıcı, Chuan-Sheng Foo, Stefan Winkler, Kim-Hui Yap, Georgios Piliouras, Vijay Chandrasekhar

    Abstract: We examine two different techniques for parameter averaging in GAN training. Moving Average (MA) computes the time-average of parameters, whereas Exponential Moving Average (EMA) computes an exponentially discounted sum. Whilst MA is known to lead to convergence in bilinear settings, we provide the -- to our knowledge -- first theoretical arguments in support of EMA. We show that EMA converges to… ▽ More

    Submitted 26 February, 2019; v1 submitted 12 June, 2018; originally announced June 2018.

    Comments: Published as a conference paper at ICLR 2019

  13. arXiv:1805.08957  [pdf, other

    cs.LG stat.ML

    Semi-Supervised Learning with GANs: Revisiting Manifold Regularization

    Authors: Bruno Lecouat, Chuan-Sheng Foo, Houssam Zenati, Vijay R. Chandrasekhar

    Abstract: GANS are powerful generative models that are able to model the manifold of natural images. We leverage this property to perform manifold regularization by approximating the Laplacian norm using a Monte Carlo approximation that is easily computed with the GAN. When incorporated into the feature-matching GAN of Improved GAN, we achieve state-of-the-art results for GAN-based semi-supervised learning… ▽ More

    Submitted 23 May, 2018; originally announced May 2018.

    Comments: Accepted paper

    Journal ref: Workshop track - ICLR 2018

  14. arXiv:1802.06222  [pdf, ps, other

    cs.LG stat.ML

    Efficient GAN-Based Anomaly Detection

    Authors: Houssam Zenati, Chuan Sheng Foo, Bruno Lecouat, Gaurav Manek, Vijay Ramaseshan Chandrasekhar

    Abstract: Generative adversarial networks (GANs) are able to model the complex highdimensional distributions of real-world data, which suggests they could be effective for anomaly detection. However, few works have explored the use of GANs for the anomaly detection task. We leverage recently developed GAN models for anomaly detection, and achieve state-of-the-art performance on image and network intrusion d… ▽ More

    Submitted 1 May, 2019; v1 submitted 17 February, 2018; originally announced February 2018.

    Comments: Updated version of this work is published at ICDM 2018, see arXiv:1812.02288 . Submitted to the ICLR Workshop 2018