Formal Language Recognition by Hard Attention Transformers: Perspectives from Circuit Complexity

Hao, Yiding; Angluin, Dana; Frank, Robert

Computer Science > Computational Complexity

arXiv:2204.06618 (cs)

[Submitted on 13 Apr 2022]

Title:Formal Language Recognition by Hard Attention Transformers: Perspectives from Circuit Complexity

Authors:Yiding Hao, Dana Angluin, Robert Frank

View PDF

Abstract:This paper analyzes three formal models of Transformer encoders that differ in the form of their self-attention mechanism: unique hard attention (UHAT); generalized unique hard attention (GUHAT), which generalizes UHAT; and averaging hard attention (AHAT). We show that UHAT and GUHAT Transformers, viewed as string acceptors, can only recognize formal languages in the complexity class AC$^0$, the class of languages recognizable by families of Boolean circuits of constant depth and polynomial size. This upper bound subsumes Hahn's (2020) results that GUHAT cannot recognize the DYCK languages or the PARITY language, since those languages are outside AC$^0$ (Furst et al., 1984). In contrast, the non-AC$^0$ languages MAJORITY and DYCK-1 are recognizable by AHAT networks, implying that AHAT can recognize languages that UHAT and GUHAT cannot.

Comments:	To appear in Transactions of the Association for Computational Linguistics
Subjects:	Computational Complexity (cs.CC); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Formal Languages and Automata Theory (cs.FL); Machine Learning (cs.LG)
Cite as:	arXiv:2204.06618 [cs.CC]
	(or arXiv:2204.06618v1 [cs.CC] for this version)
	https://doi.org/10.48550/arXiv.2204.06618

Submission history

From: Yiding Hao [view email]
[v1] Wed, 13 Apr 2022 19:25:42 UTC (37 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CC

< prev | next >

new | recent | 2022-04

Change to browse by:

cs
cs.AI
cs.CL
cs.FL
cs.LG

References & Citations

1 blog link

(what is this?)

export BibTeX citation

Computer Science > Computational Complexity

Title:Formal Language Recognition by Hard Attention Transformers: Perspectives from Circuit Complexity

Submission history

Access Paper:

References & Citations

1 blog link

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computational Complexity

Title:Formal Language Recognition by Hard Attention Transformers: Perspectives from Circuit Complexity

Submission history

Access Paper:

References & Citations

1 blog link

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators