Skip to main content

Showing 1–5 of 5 results for author: Labini, P S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.11551  [pdf, other

    cs.DC

    High Performance Unstructured SpMM Computation Using Tensor Cores

    Authors: Patrik Okanovic, Grzegorz Kwasniewski, Paolo Sylos Labini, Maciej Besta, Flavio Vella, Torsten Hoefler

    Abstract: High-performance sparse matrix-matrix (SpMM) multiplication is paramount for science and industry, as the ever-increasing sizes of data prohibit using dense data structures. Yet, existing hardware, such as Tensor Cores (TC), is ill-suited for SpMM, as it imposes strict constraints on data structures that cannot be met by unstructured sparsity found in many applications. To address this, we introdu… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

    Comments: Accepted by 2024 International Conference on High Performance Computing, Networking, Storage and Analysis, 2023 (SC'24)

  2. arXiv:2306.00606  [pdf

    cs.SI cs.DS

    Scaling Expected Force: Efficient Identification of Key Nodes in Network-based Epidemic Models

    Authors: Paolo Sylos Labini, Andrej Jurco, Matteo Ceccarello, Stefano Guarino, Enrico Mastrostefano, Flavio Vella

    Abstract: Centrality measures are fundamental tools of network analysis as they highlight the key actors within the network. This study focuses on a newly proposed centrality measure, Expected Force (EF), and its use in identifying spreaders in network-based epidemic models. We found that EF effectively predicts the spreading power of nodes and identifies key nodes and immunization targets. However, its hig… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

  3. Towards a learning-based performance modeling for accelerating Deep Neural Networks

    Authors: Damiano Perri, Paolo Sylos Labini, Osvaldo Gervasi, Sergio Tasso, Flavio Vella

    Abstract: Emerging applications such as Deep Learning are often data-driven, thus traditional approaches based on auto-tuners are not performance effective across the wide range of inputs used in practice. In the present paper, we start an investigation of predictive models based on machine learning techniques in order to optimize Convolution Neural Networks (CNNs). As a use-case, we focus on the ARM Comput… ▽ More

    Submitted 9 December, 2022; originally announced December 2022.

  4. arXiv:2208.11469  [pdf, other

    cs.DC cs.DS

    ProbGraph: High-Performance and High-Accuracy Graph Mining with Probabilistic Set Representations

    Authors: Maciej Besta, Cesare Miglioli, Paolo Sylos Labini, Jakub Tětek, Patrick Iff, Raghavendra Kanakagiri, Saleh Ashkboos, Kacper Janda, Michal Podstawski, Grzegorz Kwasniewski, Niels Gleinig, Flavio Vella, Onur Mutlu, Torsten Hoefler

    Abstract: Important graph mining problems such as Clustering are computationally demanding. To significantly accelerate these problems, we propose ProbGraph: a graph representation that enables simple and fast approximate parallel graph mining with strong theoretical guarantees on work, depth, and result accuracy. The key idea is to represent sets of vertices using probabilistic set representations such as… ▽ More

    Submitted 21 November, 2022; v1 submitted 24 August, 2022; originally announced August 2022.

    Comments: Best Paper Award at ACM/IEEE Supercomputing'22 (SC22)

    Journal ref: Proceedings of the ACM/IEEE International Conference on High Performance Computing, Networking, Storage and Analysis, November 2022

  5. arXiv:2202.05868  [pdf, other

    cs.DC

    Blocking Techniques for Sparse Matrix Multiplication on Tensor Accelerators

    Authors: Paolo Sylos Labini, Massimo Bernaschi, Francesco Silvestri, Flavio Vella

    Abstract: Tensor accelerators have gained popularity because they provide a cheap and efficient solution for speeding up computational-expensive tasks in Deep Learning and, more recently, in other Scientific Computing applications. However, since their features are specifically designed for tensor algebra (typically dense matrix-product), it is commonly assumed that they are not suitable for applications wi… ▽ More

    Submitted 11 February, 2022; originally announced February 2022.

    Comments: 12 pages, 14 images