ParaFormer: Parallel Attention Transformer for Efficient Feature Matching

Lu, Xiaoyong; Yan, Yaping; Kang, Bin; Du, Songlin

Computer Science > Computer Vision and Pattern Recognition

arXiv:2303.00941 (cs)

[Submitted on 2 Mar 2023 (v1), last revised 10 Mar 2023 (this version, v2)]

Title:ParaFormer: Parallel Attention Transformer for Efficient Feature Matching

Authors:Xiaoyong Lu, Yaping Yan, Bin Kang, Songlin Du

View PDF

Abstract:Heavy computation is a bottleneck limiting deep-learningbased feature matching algorithms to be applied in many realtime applications. However, existing lightweight networks optimized for Euclidean data cannot address classical feature matching tasks, since sparse keypoint based descriptors are expected to be matched. This paper tackles this problem and proposes two concepts: 1) a novel parallel attention model entitled ParaFormer and 2) a graph based U-Net architecture with attentional pooling. First, ParaFormer fuses features and keypoint positions through the concept of amplitude and phase, and integrates self- and cross-attention in a parallel manner which achieves a win-win performance in terms of accuracy and efficiency. Second, with U-Net architecture and proposed attentional pooling, the ParaFormer-U variant significantly reduces computational complexity, and minimize performance loss caused by downsampling. Sufficient experiments on various applications, including homography estimation, pose estimation, and image matching, demonstrate that ParaFormer achieves state-of-the-art performance while maintaining high efficiency. The efficient ParaFormer-U variant achieves comparable performance with less than 50% FLOPs of the existing attention-based models.

Comments:	Have been accepted by AAAI 2023
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2303.00941 [cs.CV]
	(or arXiv:2303.00941v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2303.00941

Submission history

From: Xiaoyong Lu [view email]
[v1] Thu, 2 Mar 2023 03:29:16 UTC (8,242 KB)
[v2] Fri, 10 Mar 2023 02:52:47 UTC (8,414 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:ParaFormer: Parallel Attention Transformer for Efficient Feature Matching

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:ParaFormer: Parallel Attention Transformer for Efficient Feature Matching

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators