Skip to main content

Showing 1–7 of 7 results for author: Sarda, G M

.
  1. arXiv:2504.06737  [pdf, other

    math.DG

    Urysohn width of hypersurfaces and positive macroscopic scalar curvature

    Authors: Teo Gil Moreno de Mora Sardà

    Abstract: We prove that if a complete Riemannian $n$-manifold with non-trivial codimension 1 homology with $\mathbb{Z}_2$-coefficients or $\mathbb{Z}$-coefficients has positive macroscopic scalar curvature large enough, then it contains a non-nullhomologous hypersurface of small Urysohn $(n-2)$-width. This constitutes a macroscopic analogue of a theorem by Bray--Brendle--Neves on the area of non-contractibl… ▽ More

    Submitted 9 April, 2025; originally announced April 2025.

    Comments: 12 pages, 3 figures

    MSC Class: Primary 53C23; Secondary 53C21

  2. arXiv:2410.08855  [pdf, other

    cs.DC cs.AI

    MATCH: Model-Aware TVM-based Compilation for Heterogeneous Edge Devices

    Authors: Mohamed Amine Hamdi, Francesco Daghero, Giuseppe Maria Sarda, Josse Van Delm, Arne Symons, Luca Benini, Marian Verhelst, Daniele Jahier Pagliari, Alessio Burrello

    Abstract: Streamlining the deployment of Deep Neural Networks (DNNs) on heterogeneous edge platforms, coupling within the same micro-controller unit (MCU) instruction processors and hardware accelerators for tensor computations, is becoming one of the crucial challenges of the TinyML field. The best-performing DNN compilation toolchains are usually deeply customized for a single MCU family, and porting to… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

    Comments: 13 pages, 11 figures, 4 tables

    ACM Class: I.2.2; D.1.3

  3. Optimising GPGPU Execution Through Runtime Micro-Architecture Parameter Analysis

    Authors: Giuseppe M. Sarda, Nimish Shah, Debjyoti Bhattacharjee, Peter Debacker, Marian Verhelst

    Abstract: GPGPU execution analysis has always been tied to closed-source, proprietary benchmarking tools that provide high-level, non-exhaustive, and/or statistical information, preventing a thorough understanding of bottlenecks and optimization possibilities. Open-source hardware platforms offer opportunities to overcome such limits and co-optimize the full {hardware-mapping-algorithm} compute stack. Yet,… ▽ More

    Submitted 14 June, 2024; originally announced July 2024.

    Journal ref: 2023 IEEE International Symposium on Workload Characterization (IISWC)

  4. arXiv:2407.07198  [pdf, other

    math.DG

    Complete 3-manifolds of positive scalar curvature with quadratic decay

    Authors: Florent Balacheff, Teo Gil Moreno de Mora Sardà, Stéphane Sabourau

    Abstract: We prove that if an orientable 3-manifold $M$ admits a complete Riemannian metric whose scalar curvature is positive and has a subquadratic decay at infinity, then it decomposes as a (possibly infinite) connected sum of spherical manifolds and $\mathbb{S}^2 \times \mathbb{S}^1$ summands. This generalises a theorem of Gromov and Wang by using a different, more topological, approach. As a result, th… ▽ More

    Submitted 11 May, 2025; v1 submitted 9 July, 2024; originally announced July 2024.

    Comments: 24 pages, 8 figures. To appear in Mathematische Annalen

    MSC Class: Primary 53C23; Secondary 53C21

  5. HTVM: Efficient Neural Network Deployment On Heterogeneous TinyML Platforms

    Authors: Josse Van Delm, Maarten Vandersteegen, Alessio Burrello, Giuseppe Maria Sarda, Francesco Conti, Daniele Jahier Pagliari, Luca Benini, Marian Verhelst

    Abstract: Optimal deployment of deep neural networks (DNNs) on state-of-the-art Systems-on-Chips (SoCs) is crucial for tiny machine learning (TinyML) at the edge. The complexity of these SoCs makes deployment non-trivial, as they typically contain multiple heterogeneous compute cores with limited, programmer-managed memory to optimize latency and energy efficiency. We propose HTVM - a compiler that merges T… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Presented at DAC2023. Open-source code is available at https://github.com/KULeuven-MICAS/htvm

    ACM Class: D.3.4

    Journal ref: 2023 60th ACM/IEEE Design Automation Conference (DAC), San Francisco, CA, USA, 2023, pp. 1-6

  6. arXiv:2306.05060  [pdf, other

    cs.LG

    Precision-aware Latency and Energy Balancing on Multi-Accelerator Platforms for DNN Inference

    Authors: Matteo Risso, Alessio Burrello, Giuseppe Maria Sarda, Luca Benini, Enrico Macii, Massimo Poncino, Marian Verhelst, Daniele Jahier Pagliari

    Abstract: The need to execute Deep Neural Networks (DNNs) at low latency and low power at the edge has spurred the development of new heterogeneous Systems-on-Chips (SoCs) encapsulating a diverse set of hardware accelerators. How to optimally map a DNN onto such multi-accelerator systems is an open problem. We propose ODiMO, a hardware-aware tool that performs a fine-grain mapping across different accelerat… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

    Comments: Accepted at 2023 ACM/IEEE International Symposium on Low Power Electronics and Design (ISLPED)

  7. arXiv:2208.00331  [pdf, other

    cs.AR cs.LG

    CoNLoCNN: Exploiting Correlation and Non-Uniform Quantization for Energy-Efficient Low-precision Deep Convolutional Neural Networks

    Authors: Muhammad Abdullah Hanif, Giuseppe Maria Sarda, Alberto Marchisio, Guido Masera, Maurizio Martina, Muhammad Shafique

    Abstract: In today's era of smart cyber-physical systems, Deep Neural Networks (DNNs) have become ubiquitous due to their state-of-the-art performance in complex real-world applications. The high computational complexity of these networks, which translates to increased energy consumption, is the foremost obstacle towards deploying large DNNs in resource-constrained systems. Fixed-Point (FP) implementations… ▽ More

    Submitted 30 July, 2022; originally announced August 2022.

    Comments: 8 pages, 15 figures, 2 tables