CrisisBERT: a Robust Transformer for Crisis Classification and Contextual Crisis Embedding

Liu, Junhua; Singhal, Trisha; Blessing, Lucienne T. M.; Wood, Kristin L.; Lim, Kwan Hui

Computer Science > Computation and Language

arXiv:2005.06627 (cs)

COVID-19 e-print

Important: e-prints posted on arXiv are not peer-reviewed by arXiv; they should not be relied upon without context to guide clinical practice or health-related behavior and should not be reported in news media as established information without consulting multiple experts in the field.

[Submitted on 11 May 2020 (v1), last revised 18 May 2020 (this version, v2)]

Title:CrisisBERT: a Robust Transformer for Crisis Classification and Contextual Crisis Embedding

Authors:Junhua Liu, Trisha Singhal, Lucienne T.M. Blessing, Kristin L. Wood, Kwan Hui Lim

View PDF

Abstract:Classification of crisis events, such as natural disasters, terrorist attacks and pandemics, is a crucial task to create early signals and inform relevant parties for spontaneous actions to reduce overall damage. Despite crisis such as natural disasters can be predicted by professional institutions, certain events are first signaled by civilians, such as the recent COVID-19 pandemics. Social media platforms such as Twitter often exposes firsthand signals on such crises through high volume information exchange over half a billion tweets posted daily. Prior works proposed various crisis embeddings and classification using conventional Machine Learning and Neural Network models. However, none of the works perform crisis embedding and classification using state of the art attention-based deep neural networks models, such as Transformers and document-level contextual embeddings. This work proposes CrisisBERT, an end-to-end transformer-based model for two crisis classification tasks, namely crisis detection and crisis recognition, which shows promising results across accuracy and f1 scores. The proposed model also demonstrates superior robustness over benchmark, as it shows marginal performance compromise while extending from 6 to 36 events with only 51.4% additional data points. We also proposed Crisis2Vec, an attention-based, document-level contextual embedding architecture for crisis embedding, which achieve better performance than conventional crisis embedding methods such as Word2Vec and GloVe. To the best of our knowledge, our works are first to propose using transformer-based crisis classification and document-level contextual crisis embedding in the literature.

Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG); Social and Information Networks (cs.SI)
Cite as:	arXiv:2005.06627 [cs.CL]
	(or arXiv:2005.06627v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2005.06627

Submission history

From: Junhua Liu [view email]
[v1] Mon, 11 May 2020 09:57:24 UTC (191 KB)
[v2] Mon, 18 May 2020 07:58:23 UTC (191 KB)

Computer Science > Computation and Language

Title:CrisisBERT: a Robust Transformer for Crisis Classification and Contextual Crisis Embedding

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:CrisisBERT: a Robust Transformer for Crisis Classification and Contextual Crisis Embedding

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators