COMET: Coverage-guided Model Generation For Deep Learning Library Testing

Li, Meiziniu; Cao, Jialun; Tian, Yongqiang; Li, Tsz On; Wen, Ming; Cheung, Shing-Chi

doi:10.1145/3583566

Computer Science > Software Engineering

arXiv:2208.01508 (cs)

[Submitted on 2 Aug 2022 (v1), last revised 30 Jan 2023 (this version, v2)]

Title:COMET: Coverage-guided Model Generation For Deep Learning Library Testing

Authors:Meiziniu Li, Jialun Cao, Yongqiang Tian, Tsz On Li, Ming Wen, Shing-Chi Cheung

View PDF

Abstract:Recent deep learning (DL) applications are mostly built on top of DL libraries. The quality assurance of these libraries is critical to the dependable deployment of DL applications. Techniques have been proposed to generate various DL models and apply them to test these libraries. However, their test effectiveness is constrained by the diversity of layer API calls in their generated DL models. Our study reveals that these techniques can cover at most 34.1% layer inputs, 25.9% layer parameter values, and 15.6% layer sequences. As a result, we find that many bugs arising from specific layer API calls (i.e., specific layer inputs, parameter values, or layer sequences) can be missed by existing techniques. Because of this limitation, we propose COMET to effectively generate DL models with diverse layer API calls for DL library testing. COMET: (1) designs a set of mutation operators and a coverage-based search algorithm to diversify layer inputs, layer parameter values, and layer sequences in DL models. (2) proposes a model synthesis method to boost the test efficiency without compromising the layer API call diversity. Our evaluation result shows that COMET outperforms baselines by covering twice as many layer inputs (69.7% vs. 34.1%), layer parameter values (50.2% vs. 25.9%), and layer sequences (39.0% vs. 15.6%) as those by the state-of-the-art. Moreover, COMET covers 3.4% more library branches than those by existing techniques. Finally, COMET detects 32 new bugs in the latest version of eight popular DL libraries, including TensorFlow and MXNet, with 21 of them confirmed by DL library developers and 7 of those confirmed bugs have been fixed by developers.

Comments:	34 pages, 12 figures
Subjects:	Software Engineering (cs.SE); Artificial Intelligence (cs.AI)
ACM classes:	D.2.5; I.2.5
Cite as:	arXiv:2208.01508 [cs.SE]
	(or arXiv:2208.01508v2 [cs.SE] for this version)
	https://doi.org/10.48550/arXiv.2208.01508
Related DOI:	https://doi.org/10.1145/3583566

Submission history

From: Meiziniu Li [view email]
[v1] Tue, 2 Aug 2022 14:53:02 UTC (736 KB)
[v2] Mon, 30 Jan 2023 12:01:23 UTC (1,098 KB)

Computer Science > Software Engineering

Title:COMET: Coverage-guided Model Generation For Deep Learning Library Testing

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Software Engineering

Title:COMET: Coverage-guided Model Generation For Deep Learning Library Testing

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators