Skip to main content

Showing 1–11 of 11 results for author: Finkbeiner, J

.
  1. arXiv:2412.11810  [pdf, other

    cs.NE cs.AR cs.LG

    Optimal Gradient Checkpointing for Sparse and Recurrent Architectures using Off-Chip Memory

    Authors: Wadjih Bencheikh, Jan Finkbeiner, Emre Neftci

    Abstract: Recurrent neural networks (RNNs) are valued for their computational efficiency and reduced memory requirements on tasks involving long sequence lengths but require high memory-processor bandwidth to train. Checkpointing techniques can reduce the memory requirements by only storing a subset of intermediate states, the checkpoints, but are still rarely used due to the computational overhead of the a… ▽ More

    Submitted 16 December, 2024; originally announced December 2024.

  2. arXiv:2410.08711  [pdf, other

    cs.NE

    On-Chip Learning via Transformer In-Context Learning

    Authors: Jan Finkbeiner, Emre Neftci

    Abstract: Autoregressive decoder-only transformers have become key components for scalable sequence processing and generation models. However, the transformer's self-attention mechanism requires transferring prior token projections from the main memory at each time step (token), thus severely limiting their performance on conventional processors. Self-attention can be viewed as a dynamic feed-forward layer,… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

  3. arXiv:2409.19315  [pdf, other

    cs.NE cs.AI cs.AR cs.ET

    Analog In-Memory Computing Attention Mechanism for Fast and Energy-Efficient Large Language Models

    Authors: Nathan Leroux, Paul-Philipp Manea, Chirag Sudarshan, Jan Finkbeiner, Sebastian Siegel, John Paul Strachan, Emre Neftci

    Abstract: Transformer networks, driven by self-attention, are central to Large Language Models. In generative Transformers, self-attention uses cache memory to store token projections, avoiding recomputation at each time step. However, GPU-stored projections must be loaded into SRAM for each new generation step, causing latency and energy bottlenecks. We present a custom self-attention in-memory computing… ▽ More

    Submitted 25 November, 2024; v1 submitted 28 September, 2024; originally announced September 2024.

    Comments: 25 pages, 6 figures, 1 table

  4. arXiv:2409.02842  [pdf, other

    cs.NE cs.LG

    SNNAX -- Spiking Neural Networks in JAX

    Authors: Jamie Lohoff, Jan Finkbeiner, Emre Neftci

    Abstract: Spiking Neural Networks (SNNs) simulators are essential tools to prototype biologically inspired models and neuromorphic hardware architectures and predict their performance. For such a tool, ease of use and flexibility are critical, but so is simulation speed especially given the complexity inherent to simulating SNN. Here, we present SNNAX, a JAX-based framework for simulating and training such… ▽ More

    Submitted 27 January, 2025; v1 submitted 4 September, 2024; originally announced September 2024.

    Journal ref: Proceedings of the International Conference on Neuromorphic Systems. 2024

  5. arXiv:2403.10173  [pdf, other

    cs.CV cs.AI

    Efficient Event-Based Object Detection: A Hybrid Neural Network with Spatial and Temporal Attention

    Authors: Soikat Hasan Ahmed, Jan Finkbeiner, Emre Neftci

    Abstract: Event cameras offer high temporal resolution and dynamic range with minimal motion blur, making them promising for robust object detection. While Spiking Neural Networks (SNNs) on neuromorphic hardware are often considered for energy-efficient and low latency event-based data processing, they often fall short of Artificial Neural Networks (ANNs) in accuracy and flexibility. Here, we introduce Atte… ▽ More

    Submitted 11 March, 2025; v1 submitted 15 March, 2024; originally announced March 2024.

  6. The Ouroboros of Memristors: Neural Networks Facilitating Memristor Programming

    Authors: Zhenming Yu, Ming-Jay Yang, Jan Finkbeiner, Sebastian Siegel, John Paul Strachan, Emre Neftci

    Abstract: Memristive devices hold promise to improve the scale and efficiency of machine learning and neuromorphic hardware, thanks to their compact size, low power consumption, and the ability to perform matrix multiplications in constant time. However, on-chip training with memristor arrays still faces challenges, including device-to-device and cycle-to-cycle variations, switching non-linearity, and espec… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: This work is accepted at the 2024 IEEE AICAS

    Journal ref: 2024 AICAS, Abu Dhabi, United Arab Emirates, 2024, pp. 398-402

  7. arXiv:2311.04386  [pdf, other

    cs.NE cs.AI

    Harnessing Manycore Processors with Distributed Memory for Accelerated Training of Sparse and Recurrent Models

    Authors: Jan Finkbeiner, Thomas Gmeinder, Mark Pupilli, Alexander Titterton, Emre Neftci

    Abstract: Current AI training infrastructure is dominated by single instruction multiple data (SIMD) and systolic array architectures, such as Graphics Processing Units (GPUs) and Tensor Processing Units (TPUs), that excel at accelerating parallel workloads and dense vector matrix multiplications. Potentially more efficient neural network models utilizing sparsity and recurrence cannot leverage the full pow… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  8. arXiv:2309.03840  [pdf, other

    physics.comp-ph cond-mat.mtrl-sci

    Generating Minimal Training Sets for Machine Learned Potentials

    Authors: Jan Finkbeiner, Samuel Tovey, Christian Holm

    Abstract: This letter presents a novel approach for identifying uncorrelated atomic configurations from extensive data sets with a non-standard neural network workflow known as random network distillation (RND) for training machine-learned inter-atomic potentials (MLPs). This method is coupled with a DFT workflow wherein initial data is generated with cheaper classical methods before only the minimal subset… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

    Comments: 9 pages, 8 figures, letter

  9. arXiv:2303.11860  [pdf, other

    cs.NE cs.AI cs.LG

    Online Transformers with Spiking Neurons for Fast Prosthetic Hand Control

    Authors: Nathan Leroux, Jan Finkbeiner, Emre Neftci

    Abstract: Transformers are state-of-the-art networks for most sequence processing tasks. However, the self-attention mechanism often used in Transformers requires large time windows for each computation step and thus makes them less suitable for online signal processing compared to Recurrent Neural Networks (RNNs). In this paper, instead of the self-attention mechanism, we use a sliding window attention mec… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: Preprint of 9 pages, 4 figures

  10. arXiv:2108.01582  [pdf, other

    physics.comp-ph

    Efficient Data Selection Methods for the Development of Machine Learned Potentials

    Authors: Jan Finkbeiner, Samuel Tovey, Christian Holm

    Abstract: We present an investigation into data selection methods for the efficient sampling of configuration space as applied to the development of inter-atomic potentials for scale bridging in molecular dynamics (MD) simulations. This investigation suggests that the most efficient sampling techniques are those that incorporate information on an atomic level such as forces or atomic energies. Finally, we g… ▽ More

    Submitted 3 August, 2021; originally announced August 2021.

  11. arXiv:2006.13084  [pdf, other

    cs.CV cs.LG cs.RO eess.IV

    Single-Shot 3D Detection of Vehicles from Monocular RGB Images via Geometry Constrained Keypoints in Real-Time

    Authors: Nils Gählert, Jun-Jun Wan, Nicolas Jourdan, Jan Finkbeiner, Uwe Franke, Joachim Denzler

    Abstract: In this paper we propose a novel 3D single-shot object detection method for detecting vehicles in monocular RGB images. Our approach lifts 2D detections to 3D space by predicting additional regression and classification parameters and hence keeping the runtime close to pure 2D object detection. The additional parameters are transformed to 3D bounding box keypoints within the network under geometri… ▽ More

    Submitted 23 June, 2020; originally announced June 2020.

    Comments: 2020 IEEE IV Symposium