Characterizing and Understanding the Generalization Error of Transfer Learning with Gibbs Algorithm

Bu, Yuheng; Aminian, Gholamali; Toni, Laura; Rodrigues, Miguel; Wornell, Gregory

Computer Science > Machine Learning

arXiv:2111.01635 (cs)

[Submitted on 2 Nov 2021]

Title:Characterizing and Understanding the Generalization Error of Transfer Learning with Gibbs Algorithm

Authors:Yuheng Bu, Gholamali Aminian, Laura Toni, Miguel Rodrigues, Gregory Wornell

View PDF

Abstract:We provide an information-theoretic analysis of the generalization ability of Gibbs-based transfer learning algorithms by focusing on two popular transfer learning approaches, $\alpha$-weighted-ERM and two-stage-ERM. Our key result is an exact characterization of the generalization behaviour using the conditional symmetrized KL information between the output hypothesis and the target training samples given the source samples. Our results can also be applied to provide novel distribution-free generalization error upper bounds on these two aforementioned Gibbs algorithms. Our approach is versatile, as it also characterizes the generalization errors and excess risks of these two Gibbs algorithms in the asymptotic regime, where they converge to the $\alpha$-weighted-ERM and two-stage-ERM, respectively. Based on our theoretical results, we show that the benefits of transfer learning can be viewed as a bias-variance trade-off, with the bias induced by the source distribution and the variance induced by the lack of target samples. We believe this viewpoint can guide the choice of transfer learning algorithms in practice.

Subjects:	Machine Learning (cs.LG); Information Theory (cs.IT); Machine Learning (stat.ML)
Cite as:	arXiv:2111.01635 [cs.LG]
	(or arXiv:2111.01635v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2111.01635

Submission history

From: Yuheng Bu [view email]
[v1] Tue, 2 Nov 2021 14:49:48 UTC (41 KB)

Computer Science > Machine Learning

Title:Characterizing and Understanding the Generalization Error of Transfer Learning with Gibbs Algorithm

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Characterizing and Understanding the Generalization Error of Transfer Learning with Gibbs Algorithm

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators