Skip to main content

Showing 1–10 of 10 results for author: Luong, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2011.04419  [pdf, other

    cs.LG cs.AI stat.ML

    Towards Domain-Agnostic Contrastive Learning

    Authors: Vikas Verma, Minh-Thang Luong, Kenji Kawaguchi, Hieu Pham, Quoc V. Le

    Abstract: Despite recent success, most contrastive self-supervised learning methods are domain-specific, relying heavily on data augmentation techniques that require knowledge about a particular domain, such as image cropping and rotation. To overcome such limitation, we propose a novel domain-agnostic approach to contrastive learning, named DACL, that is applicable to domains where invariances, and thus, d… ▽ More

    Submitted 19 July, 2021; v1 submitted 9 November, 2020; originally announced November 2020.

    Comments: Published in ICML 2021

  2. arXiv:2003.10580  [pdf, other

    cs.LG stat.ML

    Meta Pseudo Labels

    Authors: Hieu Pham, Zihang Dai, Qizhe Xie, Minh-Thang Luong, Quoc V. Le

    Abstract: We present Meta Pseudo Labels, a semi-supervised learning method that achieves a new state-of-the-art top-1 accuracy of 90.2% on ImageNet, which is 1.6% better than the existing state-of-the-art. Like Pseudo Labels, Meta Pseudo Labels has a teacher network to generate pseudo labels on unlabeled data to teach a student network. However, unlike Pseudo Labels where the teacher is fixed, the teacher i… ▽ More

    Submitted 1 March, 2021; v1 submitted 23 March, 2020; originally announced March 2020.

    Comments: Preprint

  3. arXiv:2001.09977  [pdf, other

    cs.CL cs.LG cs.NE stat.ML

    Towards a Human-like Open-Domain Chatbot

    Authors: Daniel Adiwardana, Minh-Thang Luong, David R. So, Jamie Hall, Noah Fiedel, Romal Thoppilan, Zi Yang, Apoorv Kulshreshtha, Gaurav Nemade, Yifeng Lu, Quoc V. Le

    Abstract: We present Meena, a multi-turn open-domain chatbot trained end-to-end on data mined and filtered from public domain social media conversations. This 2.6B parameter neural network is simply trained to minimize perplexity of the next token. We also propose a human evaluation metric called Sensibleness and Specificity Average (SSA), which captures key elements of a human-like multi-turn conversation.… ▽ More

    Submitted 27 February, 2020; v1 submitted 27 January, 2020; originally announced January 2020.

    Comments: 38 pages, 12 figures

  4. arXiv:1911.04252  [pdf, other

    cs.LG cs.CV stat.ML

    Self-training with Noisy Student improves ImageNet classification

    Authors: Qizhe Xie, Minh-Thang Luong, Eduard Hovy, Quoc V. Le

    Abstract: We present Noisy Student Training, a semi-supervised learning approach that works well even when labeled data is abundant. Noisy Student Training achieves 88.4% top-1 accuracy on ImageNet, which is 2.0% better than the state-of-the-art model that requires 3.5B weakly labeled Instagram images. On robustness test sets, it improves ImageNet-A top-1 accuracy from 61.0% to 83.7%, reduces ImageNet-C mea… ▽ More

    Submitted 19 June, 2020; v1 submitted 11 November, 2019; originally announced November 2019.

    Comments: CVPR 2020

  5. arXiv:1906.02940  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    Selfie: Self-supervised Pretraining for Image Embedding

    Authors: Trieu H. Trinh, Minh-Thang Luong, Quoc V. Le

    Abstract: We introduce a pretraining technique called Selfie, which stands for SELFie supervised Image Embedding. Selfie generalizes the concept of masked language modeling of BERT (Devlin et al., 2019) to continuous data, such as images, by making use of the Contrastive Predictive Coding loss (Oord et al., 2018). Given masked-out patches in an input image, our method learns to select the correct patch, amo… ▽ More

    Submitted 27 July, 2019; v1 submitted 7 June, 2019; originally announced June 2019.

  6. arXiv:1904.12848  [pdf, other

    cs.LG cs.AI cs.CL cs.CV stat.ML

    Unsupervised Data Augmentation for Consistency Training

    Authors: Qizhe Xie, Zihang Dai, Eduard Hovy, Minh-Thang Luong, Quoc V. Le

    Abstract: Semi-supervised learning lately has shown much promise in improving deep learning models when labeled data is scarce. Common among recent approaches is the use of consistency training on a large amount of unlabeled data to constrain model predictions to be invariant to input noise. In this work, we present a new perspective on how to effectively noise unlabeled examples and argue that the quality… ▽ More

    Submitted 5 November, 2020; v1 submitted 29 April, 2019; originally announced April 2019.

    Comments: NeurIPS 2020

  7. arXiv:1803.00144  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Learning Longer-term Dependencies in RNNs with Auxiliary Losses

    Authors: Trieu H. Trinh, Andrew M. Dai, Minh-Thang Luong, Quoc V. Le

    Abstract: Despite recent advances in training recurrent neural networks (RNNs), capturing long-term dependencies in sequences remains a fundamental challenge. Most approaches use backpropagation through time (BPTT), which is difficult to scale to very long sequences. This paper proposes a simple method that improves the ability to capture long term dependencies in RNNs by adding an unsupervised auxiliary lo… ▽ More

    Submitted 13 June, 2018; v1 submitted 28 February, 2018; originally announced March 2018.

    Comments: ICML 2018

  8. arXiv:1511.06114  [pdf, ps, other

    cs.LG cs.CL stat.ML

    Multi-task Sequence to Sequence Learning

    Authors: Minh-Thang Luong, Quoc V. Le, Ilya Sutskever, Oriol Vinyals, Lukasz Kaiser

    Abstract: Sequence to sequence learning has recently emerged as a new paradigm in supervised learning. To date, most of its applications focused on only one task and not much work explored this framework for multiple tasks. This paper examines three multi-task learning (MTL) settings for sequence to sequence models: (a) the oneto-many setting - where the encoder is shared between several tasks such as machi… ▽ More

    Submitted 1 March, 2016; v1 submitted 19 November, 2015; originally announced November 2015.

    Comments: 10 pages, 4 figures, ICLR 2016 camera-ready, added parsing SOTA results

  9. arXiv:1212.1778  [pdf, other

    stat.AP stat.CO

    Hidden Markov Model Applications in Change-Point Analysis

    Authors: The Minh Luong, Vittorio Perduca, Gregory Nuel

    Abstract: The detection of change-points in heterogeneous sequences is a statistical challenge with many applications in fields such as finance, signal analysis and biology. A wide variety of literature exists for finding an ideal set of change-points for characterizing the data. In this tutorial we elaborate on the Hidden Markov Model (HMM) and present two different frameworks for applying HMM to change-po… ▽ More

    Submitted 8 December, 2012; originally announced December 2012.

  10. arXiv:1211.3210  [pdf, other

    stat.CO stat.AP

    Fast estimation of the ICL criterion for change-point detection problems with applications to Next-Generation Sequencing data

    Authors: Alice Cleynen, The Minh Luong, Guillem Rigaill, Gregory Nuel

    Abstract: In this paper, we consider the Integrated Completed Likelihood (ICL) as a useful criterion for estimating the number of changes in the underlying distribution of data in problems where detecting the precise location of these changes is the main goal. The exact computation of the ICL requires O(Kn2) operations (with K the number of segments and n the number of data-points) which is prohibitive in m… ▽ More

    Submitted 1 July, 2013; v1 submitted 14 November, 2012; originally announced November 2012.

    Comments: 15 pages, 8 figures