Skip to main content

Showing 1–6 of 6 results for author: Bamberg, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.01166  [pdf, other

    cs.AR cs.AI cs.LG

    VUSA: Virtually Upscaled Systolic Array Architecture to Exploit Unstructured Sparsity in AI Acceleration

    Authors: Shereef Helal, Alberto Garcia-Ortiz, Lennart Bamberg

    Abstract: Leveraging high degrees of unstructured sparsity is a promising approach to enhance the efficiency of deep neural network DNN accelerators - particularly important for emerging Edge-AI applications. We introduce VUSA, a systolic-array architecture that virtually grows based on the present sparsity to perform larger matrix multiplications with the same number of physical multiply-accumulate MAC uni… ▽ More

    Submitted 1 June, 2025; originally announced June 2025.

    Comments: Preprint accepted for publication at MOCAST 2025. Submitted for possible publication in IEEE Xplore

  2. arXiv:2311.05557  [pdf, other

    cs.LG cs.AR

    Exploiting Neural-Network Statistics for Low-Power DNN Inference

    Authors: Lennart Bamberg, Ardalan Najafi, Alberto Garcia-Ortiz

    Abstract: Specialized compute blocks have been developed for efficient DNN execution. However, due to the vast amount of data and parameter movements, the interconnects and on-chip memories form another bottleneck, impairing power and performance. This work addresses this bottleneck by contributing a low-power technique for edge-AI inference engines that combines overhead-free coding with a statistical anal… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

  3. arXiv:2112.07019  [pdf, other

    cs.AR cs.AI

    Synapse Compression for Event-Based Convolutional-Neural-Network Accelerators

    Authors: Lennart Bamberg, Arash Pourtaherian, Luc Waeijen, Anupam Chahar, Orlando Moreira

    Abstract: Manufacturing-viable neuromorphic chips require novel computer architectures to achieve the massively parallel and efficient information processing the brain supports so effortlessly. Emerging event-based architectures are making this dream a reality. However, the large memory requirements for synaptic connectivity are a showstopper for the execution of modern convolutional neural networks (CNNs)… ▽ More

    Submitted 24 January, 2023; v1 submitted 13 December, 2021; originally announced December 2021.

    Comments: Preprint accepted by the IEEE Transactions on Parallel and Distributed Systems

  4. arXiv:1912.05670  [pdf, other

    cs.AR

    Ratatoskr: An open-source framework for in-depth power, performance and area analysis in 3D NoCs

    Authors: Jan Moritz Joseph, Lennart Bamberg, Imad Hajjar, Anna Drewes, Behnam Razi Perjikolaei, Alberto García-Ortiz, Thilo Pionteck

    Abstract: We introduce ratatoskr, an open-source framework for in-depth power, performance and area (PPA) analysis in NoCs for 3D-integrated and heterogeneous System-on-Chips (SoCs). It covers all layers of abstraction by providing a NoC hardware implementation on RT level, a NoC simulator on cycle-accurate level and an application model on transaction level. By this comprehensive approach, ratatoskr can pr… ▽ More

    Submitted 14 January, 2020; v1 submitted 11 December, 2019; originally announced December 2019.

  5. arXiv:1909.13807  [pdf, other

    cs.AR

    System-level optimization of Network-on-Chips for heterogeneous 3D System-on-Chips

    Authors: Jan Moritz Joseph, Dominik Ermel, Lennart Bamberg, Alberto García-Ortiz, Thilo Pionteck

    Abstract: For a system-level design of Networks-on-Chip for 3D heterogeneous System-on-Chip (SoC), the locations of components, routers and vertical links are determined from an application model and technology parameters. In conventional methods, the two inputs are accounted for separately; here, we define an integrated problem that considers both application model and technology parameters. We show that t… ▽ More

    Submitted 3 October, 2019; v1 submitted 30 September, 2019; originally announced September 2019.

  6. arXiv:1909.04554  [pdf, other

    cs.AR

    NoCs in Heterogeneous 3D SoCs: Co-Design of Routing Strategies and Microarchitectures

    Authors: Jan Moritz Joseph, Lennart Bamberg, Dominik Ermel, Behnam Razi Perjikolaei, Anna Drewes, Alberto García-Oritz, Thilo Pionteck

    Abstract: Heterogeneous 3D System-on-Chips (3D SoCs) are the most promising design paradigm to combine sensing and computing within a single chip. A special characteristic of communication networks in heterogeneous 3D SoCs is the varying latency and throughput in each layer. As shown in this work, this variance drastically degrades the network performance. We contribute a co-design of routing algorithms and… ▽ More

    Submitted 10 September, 2019; originally announced September 2019.