Distributed Cross-Channel Hierarchical Aggregation for Foundation Models

Tsaris, Aristeidis; Lyngaas, Isaac; Lagregren, John; Wahib, Mohamed; York, Larry; Balaprakash, Prasanna; Lu, Dan; Wang, Feiyi; Wang, Xiao

Computer Science > Machine Learning

arXiv:2506.21411 (cs)

[Submitted on 26 Jun 2025]

Title:Distributed Cross-Channel Hierarchical Aggregation for Foundation Models

Authors:Aristeidis Tsaris, Isaac Lyngaas, John Lagregren, Mohamed Wahib, Larry York, Prasanna Balaprakash, Dan Lu, Feiyi Wang, Xiao Wang

View PDF HTML (experimental)

Abstract:Vision-based scientific foundation models hold significant promise for advancing scientific discovery and innovation. This potential stems from their ability to aggregate images from diverse sources such as varying physical groundings or data acquisition systems and to learn spatio-temporal correlations using transformer architectures. However, tokenizing and aggregating images can be compute-intensive, a challenge not fully addressed by current distributed methods. In this work, we introduce the Distributed Cross-Channel Hierarchical Aggregation (D-CHAG) approach designed for datasets with a large number of channels across image modalities. Our method is compatible with any model-parallel strategy and any type of vision transformer architecture, significantly improving computational efficiency. We evaluated D-CHAG on hyperspectral imaging and weather forecasting tasks. When integrated with tensor parallelism and model sharding, our approach achieved up to a 75% reduction in memory usage and more than doubled sustained throughput on up to 1,024 AMD GPUs on the Frontier Supercomputer.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2506.21411 [cs.LG]
	(or arXiv:2506.21411v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2506.21411

Submission history

From: Aristeidis Tsaris [view email]
[v1] Thu, 26 Jun 2025 15:58:14 UTC (3,893 KB)

Computer Science > Machine Learning

Title:Distributed Cross-Channel Hierarchical Aggregation for Foundation Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Distributed Cross-Channel Hierarchical Aggregation for Foundation Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators