Skip to main content

Showing 1–1 of 1 results for author: Tarvainen, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:1703.01780  [pdf, other

    cs.NE cs.LG stat.ML

    Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results

    Authors: Antti Tarvainen, Harri Valpola

    Abstract: The recently proposed Temporal Ensembling has achieved state-of-the-art results in several semi-supervised learning benchmarks. It maintains an exponential moving average of label predictions on each training example, and penalizes predictions that are inconsistent with this target. However, because the targets change only once per epoch, Temporal Ensembling becomes unwieldy when learning large da… ▽ More

    Submitted 16 April, 2018; v1 submitted 6 March, 2017; originally announced March 2017.

    Comments: In this version: Corrected hyperparameters of the 4000-label CIFAR-10 ResNet experiment. Changed Antti's contact info, Advances in Neural Information Processing Systems 30 (NIPS 2017) pre-proceedings