Training Over-parameterized Models with Non-decomposable Objectives

Narasimhan, Harikrishna; Menon, Aditya Krishna

Computer Science > Machine Learning

arXiv:2107.04641 (cs)

[Submitted on 9 Jul 2021]

Title:Training Over-parameterized Models with Non-decomposable Objectives

Authors:Harikrishna Narasimhan, Aditya Krishna Menon

View PDF

Abstract:Many modern machine learning applications come with complex and nuanced design goals such as minimizing the worst-case error, satisfying a given precision or recall target, or enforcing group-fairness constraints. Popular techniques for optimizing such non-decomposable objectives reduce the problem into a sequence of cost-sensitive learning tasks, each of which is then solved by re-weighting the training loss with example-specific costs. We point out that the standard approach of re-weighting the loss to incorporate label costs can produce unsatisfactory results when used to train over-parameterized models. As a remedy, we propose new cost-sensitive losses that extend the classical idea of logit adjustment to handle more general cost matrices. Our losses are calibrated, and can be further improved with distilled labels from a teacher model. Through experiments on benchmark image datasets, we showcase the effectiveness of our approach in training ResNet models with common robust and constrained optimization objectives.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2107.04641 [cs.LG]
	(or arXiv:2107.04641v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2107.04641

Submission history

From: Harikrishna Narasimhan [view email]
[v1] Fri, 9 Jul 2021 19:29:33 UTC (134 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2021-07

Change to browse by:

cs
cs.AI

References & Citations

DBLP - CS Bibliography

listing | bibtex

Harikrishna Narasimhan
Aditya Krishna Menon

export BibTeX citation

Computer Science > Machine Learning

Title:Training Over-parameterized Models with Non-decomposable Objectives

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Training Over-parameterized Models with Non-decomposable Objectives

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators