DiPair: Fast and Accurate Distillation for Trillion-Scale Text Matching and Pair Modeling

Chen, Jiecao; Yang, Liu; Raman, Karthik; Bendersky, Michael; Yeh, Jung-Jung; Zhou, Yun; Najork, Marc; Cai, Danyang; Emadzadeh, Ehsan

doi:10.18653/v1/2020.findings-emnlp.264

Computer Science > Computation and Language

arXiv:2010.03099 (cs)

[Submitted on 7 Oct 2020]

Title:DiPair: Fast and Accurate Distillation for Trillion-Scale Text Matching and Pair Modeling

Authors:Jiecao Chen, Liu Yang, Karthik Raman, Michael Bendersky, Jung-Jung Yeh, Yun Zhou, Marc Najork, Danyang Cai, Ehsan Emadzadeh

View PDF

Abstract:Pre-trained models like BERT (Devlin et al., 2018) have dominated NLP / IR applications such as single sentence classification, text pair classification, and question answering. However, deploying these models in real systems is highly non-trivial due to their exorbitant computational costs. A common remedy to this is knowledge distillation (Hinton et al., 2015), leading to faster inference. However -- as we show here -- existing works are not optimized for dealing with pairs (or tuples) of texts. Consequently, they are either not scalable or demonstrate subpar performance. In this work, we propose DiPair -- a novel framework for distilling fast and accurate models on text pair tasks. Coupled with an end-to-end training strategy, DiPair is both highly scalable and offers improved quality-speed tradeoffs. Empirical studies conducted on both academic and real-world e-commerce benchmarks demonstrate the efficacy of the proposed approach with speedups of over 350x and minimal quality drop relative to the cross-attention teacher BERT model.

Comments:	13 pages. Accepted to Findings of EMNLP 2020
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2010.03099 [cs.CL]
	(or arXiv:2010.03099v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2010.03099
Related DOI:	https://doi.org/10.18653/v1/2020.findings-emnlp.264

Submission history

From: Jiecao Chen [view email]
[v1] Wed, 7 Oct 2020 01:19:23 UTC (164 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2020-10

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Jiecao Chen
Liu Yang
Karthik Raman
Michael Bendersky
Yun Zhou

…

export BibTeX citation

Computer Science > Computation and Language

Title:DiPair: Fast and Accurate Distillation for Trillion-Scale Text Matching and Pair Modeling

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:DiPair: Fast and Accurate Distillation for Trillion-Scale Text Matching and Pair Modeling

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators