Hybrid Focal and Full-Range Attention Based Graph Transformers

Zhu, Minhong; Zhao, Zhenhao; Cai, Weiran

doi:10.1109/IJCNN60899.2024.10651554

Computer Science > Machine Learning

arXiv:2311.04653 (cs)

[Submitted on 8 Nov 2023 (v1), last revised 10 Sep 2024 (this version, v2)]

Title:Hybrid Focal and Full-Range Attention Based Graph Transformers

Authors:Minhong Zhu, Zhenhao Zhao, Weiran Cai

View PDF HTML (experimental)

Abstract:The paradigm of Transformers using the self-attention mechanism has manifested its advantage in learning graph-structured data. Yet, Graph Transformers are capable of modeling full range dependencies but are often deficient in extracting information from locality. A common practice is to utilize Message Passing Neural Networks (MPNNs) as an auxiliary to capture local information, which however are still inadequate for comprehending substructures. In this paper, we present a purely attention-based architecture, namely Focal and Full-Range Graph Transformer (FFGT), which can mitigate the loss of local information in learning global correlations. The core component of FFGT is a new mechanism of compound attention, which combines the conventional full-range attention with K-hop focal attention on ego-nets to aggregate both global and local information. Beyond the scope of canonical Transformers, the FFGT has the merit of being more substructure-aware. Our approach enhances the performance of existing Graph Transformers on various open datasets, while achieves compatible SOTA performance on several Long-Range Graph Benchmark (LRGB) datasets even with a vanilla transformer. We further examine influential factors on the optimal focal length of attention via introducing a novel synthetic dataset based on SBM-PATTERN.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2311.04653 [cs.LG]
	(or arXiv:2311.04653v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2311.04653
Related DOI:	https://doi.org/10.1109/IJCNN60899.2024.10651554

Submission history

From: Minhong Zhu [view email]
[v1] Wed, 8 Nov 2023 12:53:07 UTC (284 KB)
[v2] Tue, 10 Sep 2024 03:38:37 UTC (1,286 KB)

Computer Science > Machine Learning

Title:Hybrid Focal and Full-Range Attention Based Graph Transformers

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Hybrid Focal and Full-Range Attention Based Graph Transformers

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators