Learning Representations by Stochastic Meta-Gradient Descent in Neural Networks

Veeriah, Vivek; Zhang, Shangtong; Sutton, Richard S.

Computer Science > Machine Learning

arXiv:1612.02879 (cs)

[Submitted on 9 Dec 2016 (v1), last revised 27 Apr 2017 (this version, v2)]

Title:Learning Representations by Stochastic Meta-Gradient Descent in Neural Networks

Authors:Vivek Veeriah, Shangtong Zhang, Richard S. Sutton

View PDF

Abstract:Representations are fundamental to artificial intelligence. The performance of a learning system depends on the type of representation used for representing the data. Typically, these representations are hand-engineered using domain knowledge. More recently, the trend is to learn these representations through stochastic gradient descent in multi-layer neural networks, which is called backprop. Learning the representations directly from the incoming data stream reduces the human labour involved in designing a learning system. More importantly, this allows in scaling of a learning system for difficult tasks. In this paper, we introduce a new incremental learning algorithm called crossprop, which learns incoming weights of hidden units based on the meta-gradient descent approach, that was previously introduced by Sutton (1992) and Schraudolph (1999) for learning step-sizes. The final update equation introduces an additional memory parameter for each of these weights and generalizes the backprop update equation. From our experiments, we show that crossprop learns and reuses its feature representation while tackling new and unseen tasks whereas backprop relearns a new feature representation.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:1612.02879 [cs.LG]
	(or arXiv:1612.02879v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1612.02879

Submission history

From: Vivek Veeriah [view email]
[v1] Fri, 9 Dec 2016 00:56:42 UTC (7,274 KB)
[v2] Thu, 27 Apr 2017 14:53:00 UTC (2,288 KB)

Computer Science > Machine Learning

Title:Learning Representations by Stochastic Meta-Gradient Descent in Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Learning Representations by Stochastic Meta-Gradient Descent in Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators