Deep Tensor Network

Zhang, Yifan

Computer Science > Machine Learning

arXiv:2311.11091 (cs)

[Submitted on 18 Nov 2023 (v1), last revised 31 Aug 2025 (this version, v3)]

Title:Deep Tensor Network

Authors:Yifan Zhang

View PDF HTML (experimental)

Abstract:The quadratic complexity of dot-product attention introduced in Transformer remains a fundamental bottleneck impeding the progress of foundation models toward unbounded context lengths. Addressing this challenge, we introduce the Deep Tensor Network, a new architectural framework that fundamentally reformulates attention by unifying the expressive power of tensor algebra with neural network design. Our approach moves beyond both conventional dot-product attention and subsequent linear-time approximations to capture higher-order statistical dependencies. We introduce two core operators derived from this framework: \emph{Tensor Attention}, which models complex token-mixing via data-dependent polynomial kernels, and Tensor Interaction, a novel mechanism for adaptive channel-mixing. We demonstrate that these operators are powered by second-order summaries that entirely bypass the formation of $n \times n$ matrices, enabling a causality-preserving streaming implementation with $O(d^2)$ per-token updates and $O(d^2)$ state. This efficiency rivals that of modern State Space Models while retaining an attention-like formulation. The Deep Tensor Network thus provides a principled and powerful new class of building blocks for next-generation sequence models, bridging the gap between scalable computation and rich, expressive interaction modeling.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Quantum Physics (quant-ph)
Cite as:	arXiv:2311.11091 [cs.LG]
	(or arXiv:2311.11091v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2311.11091

Submission history

From: Yifan Zhang [view email]
[v1] Sat, 18 Nov 2023 14:41:33 UTC (100 KB)
[v2] Tue, 11 Mar 2025 04:55:59 UTC (77 KB)
[v3] Sun, 31 Aug 2025 04:19:07 UTC (76 KB)

Computer Science > Machine Learning

Title:Deep Tensor Network

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Deep Tensor Network

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators