Eliminate Deviation with Deviation for Data Augmentation and a General Multi-modal Data Learning Method

Gong, Yunpeng; Huang, Liqing; Chen, Lifei

Computer Science > Computer Vision and Pattern Recognition

arXiv:2101.08533 (cs)

[Submitted on 21 Jan 2021 (v1), last revised 13 Jun 2022 (this version, v5)]

Title:Eliminate Deviation with Deviation for Data Augmentation and a General Multi-modal Data Learning Method

Authors:Yunpeng Gong, Liqing Huang, Lifei Chen

View PDF

Abstract:One of the challenges of computer vision is that it needs to adapt to color deviations in changeable environments. Therefore, minimizing the adverse effects of color deviation on the prediction is one of the main goals of vision task. Current solutions focus on using generative models to augment training data to enhance the invariance of input variation. However, such methods often introduce new noise, which limits the gain from generated data. To this end, this paper proposes a strategy eliminate deviation with deviation, which is named Random Color Dropout (RCD). Our hypothesis is that if there are color deviation between the query image and the gallery image, the retrieval results of some examples will be better after ignoring the color information. Specifically, this strategy balances the weights between color features and color-independent features in the neural network by dropouting partial color information in the training data, so as to overcome the effect of color devitaion. The proposed RCD can be combined with various existing ReID models without changing the learning strategy, and can be applied to other computer vision fields, such as object detection. Experiments on several ReID baselines and three common large-scale datasets such as Market1501, DukeMTMC, and MSMT17 have verified the effectiveness of this method. Experiments on Cross-domain tests have shown that this strategy is significant eliminating the domain gap. Furthermore, in order to understand the working mechanism of RCD, we analyzed the effectiveness of this strategy from the perspective of classification, which reveals that it may be better to utilize many instead of all of color information in visual tasks with strong domain variations.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2101.08533 [cs.CV]
	(or arXiv:2101.08533v5 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2101.08533

Submission history

From: Yunpeng Gong [view email]
[v1] Thu, 21 Jan 2021 10:33:02 UTC (686 KB)
[v2] Wed, 7 Apr 2021 08:26:49 UTC (1,611 KB)
[v3] Mon, 31 May 2021 15:15:14 UTC (765 KB)
[v4] Tue, 1 Jun 2021 01:30:13 UTC (765 KB)
[v5] Mon, 13 Jun 2022 14:16:14 UTC (789 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Eliminate Deviation with Deviation for Data Augmentation and a General Multi-modal Data Learning Method

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Eliminate Deviation with Deviation for Data Augmentation and a General Multi-modal Data Learning Method

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators