Skip to main content

Showing 1–4 of 4 results for author: Haghi, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2501.18749  [pdf, other

    cs.AR

    ACiS: Complex Processing in the Switch Fabric

    Authors: Pouya Haghi, Anqi Guo, Tong Geng, Anthony Skjellum, Martin Herbordt

    Abstract: For the last three decades a core use of FPGAs has been for processing communication: FPGA-based SmartNICs are in widespread use from the datacenter to IoT. Augmenting switches with FPGAs, however, has been less studied, but has numerous advantages built around the processing being moved from the edge of the network to the center. Communication switches have previously been augmented to process co… ▽ More

    Submitted 30 January, 2025; originally announced January 2025.

  2. arXiv:2305.19946  [pdf, other

    cs.DC

    A Survey of Potential MPI Complex Collectives: Large-Scale Mining and Analysis of HPC Applications

    Authors: Pouya Haghi, Ryan Marshall, Po Hao Chen, Anthony Skjellum, Martin Herbordt

    Abstract: Offload of MPI collectives to network devices, e.g., NICs and switches, is being implemented as an effective mechanism to improve application performance by reducing inter- and intra-node communication and bypassing MPI software layers. Given the rich deployment of accelerators and programmable NICs/switches in data centers, we posit that there is an opportunity to further improve performance by e… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

  3. arXiv:2204.04816  [pdf, other

    cs.CR

    Distributed Hardware Accelerated Secure Joint Computation on the COPA Framework

    Authors: Rushi Patel, Pouya Haghi, Shweta Jain, Andriy Kot, Venkata Krishnan, Mayank Varia, Martin Herbordt

    Abstract: Performance of distributed data center applications can be improved through use of FPGA-based SmartNICs, which provide additional functionality and enable higher bandwidth communication. Until lately, however, the lack of a simple approach for customizing SmartNICs to application requirements has limited the potential benefits. Intel's Configurable Network Protocol Accelerator (COPA) provides a cu… ▽ More

    Submitted 10 April, 2022; originally announced April 2022.

  4. arXiv:1908.10834  [pdf, other

    cs.DC cs.LG

    AWB-GCN: A Graph Convolutional Network Accelerator with Runtime Workload Rebalancing

    Authors: Tong Geng, Ang Li, Runbin Shi, Chunshu Wu, Tianqi Wang, Yanfei Li, Pouya Haghi, Antonino Tumeo, Shuai Che, Steve Reinhardt, Martin Herbordt

    Abstract: Deep learning systems have been successfully applied to Euclidean data such as images, video, and audio. In many applications, however, information and their relationships are better expressed with graphs. Graph Convolutional Networks (GCNs) appear to be a promising approach to efficiently learn from graph data structures, having shown advantages in many critical applications. As with other deep l… ▽ More

    Submitted 10 September, 2020; v1 submitted 23 August, 2019; originally announced August 2019.