Skip to main content

Showing 1–11 of 11 results for author: Poli, M

Searching in archive eess. Search in all archives.
.
  1. arXiv:2505.02692  [pdf, other

    cs.CL cs.SD eess.AS

    fastabx: A library for efficient computation of ABX discriminability

    Authors: Maxime Poli, Emmanuel Chemla, Emmanuel Dupoux

    Abstract: We introduce fastabx, a high-performance Python library for building ABX discrimination tasks. ABX is a measure of the separation between generic categories of interest. It has been used extensively to evaluate phonetic discriminability in self-supervised speech representations. However, its broader adoption has been limited by the absence of adequate tools. fastabx addresses this gap by providing… ▽ More

    Submitted 5 May, 2025; originally announced May 2025.

    Comments: 8 pages, 6 figures

  2. arXiv:2410.00025  [pdf, other

    cs.CL cs.SD eess.AS

    Improving Spoken Language Modeling with Phoneme Classification: A Simple Fine-tuning Approach

    Authors: Maxime Poli, Emmanuel Chemla, Emmanuel Dupoux

    Abstract: Recent progress in Spoken Language Modeling has shown that learning language directly from speech is feasible. Generating speech through a pipeline that operates at the text level typically loses nuances, intonations, and non-verbal vocalizations. Modeling directly from speech opens up the path to more natural and expressive systems. On the other hand, speech-only systems require up to three order… ▽ More

    Submitted 30 October, 2024; v1 submitted 16 September, 2024; originally announced October 2024.

    Comments: Accepted at EMNLP 2024 main conference. 9 pages, 4 figures

  3. arXiv:2405.06147  [pdf, other

    cs.LG eess.SY

    State-Free Inference of State-Space Models: The Transfer Function Approach

    Authors: Rom N. Parnichkun, Stefano Massaroli, Alessandro Moro, Jimmy T. H. Smith, Ramin Hasani, Mathias Lechner, Qi An, Christopher Ré, Hajime Asama, Stefano Ermon, Taiji Suzuki, Atsushi Yamashita, Michael Poli

    Abstract: We approach designing a state-space model for deep learning applications through its dual representation, the transfer function, and uncover a highly efficient sequence parallel inference algorithm that is state-free: unlike other proposed algorithms, state-free inference does not incur any significant memory or computational cost with an increase in state size. We achieve this using properties of… ▽ More

    Submitted 1 June, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

    Comments: Resubmission 02/06/2024: Fixed minor typo of recurrent form RTF

  4. arXiv:2310.18780  [pdf, other

    cs.LG cs.AI eess.SP

    Laughing Hyena Distillery: Extracting Compact Recurrences From Convolutions

    Authors: Stefano Massaroli, Michael Poli, Daniel Y. Fu, Hermann Kumbong, Rom N. Parnichkun, Aman Timalsina, David W. Romero, Quinn McIntyre, Beidi Chen, Atri Rudra, Ce Zhang, Christopher Re, Stefano Ermon, Yoshua Bengio

    Abstract: Recent advances in attention-free sequence models rely on convolutions as alternatives to the attention operator at the core of Transformers. In particular, long convolution sequence models have achieved state-of-the-art performance in many domains, but incur a significant cost during auto-regressive inference workloads -- naively requiring a full pass (or caching of activations) over the input se… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

  5. arXiv:2211.14453  [pdf, other

    cs.LG cs.AI eess.SY

    Transform Once: Efficient Operator Learning in Frequency Domain

    Authors: Michael Poli, Stefano Massaroli, Federico Berto, Jinykoo Park, Tri Dao, Christopher Ré, Stefano Ermon

    Abstract: Spectral analysis provides one of the most effective paradigms for information-preserving dimensionality reduction, as simple descriptions of naturally occurring signals are often obtained via few terms of periodic basis functions. In this work, we study deep neural networks designed to harness the structure in frequency domain for efficient learning of long-range correlations in space or time: fr… ▽ More

    Submitted 25 November, 2022; originally announced November 2022.

    Comments: Published at the 36th Conference on Neural Information Processing Systems (NeurIPS 2022)

  6. arXiv:2211.13152  [pdf, other

    cs.NE cs.LG cs.SD eess.AS

    Introducing topography in convolutional neural networks

    Authors: Maxime Poli, Emmanuel Dupoux, Rachid Riad

    Abstract: Parts of the brain that carry sensory tasks are organized topographically: nearby neurons are responsive to the same properties of input signals. Thus, in this work, inspired by the neuroscience literature, we proposed a new topographic inductive bias in Convolutional Neural Networks (CNNs). To achieve this, we introduced a new topographic loss and an efficient implementation to topographically or… ▽ More

    Submitted 28 October, 2022; originally announced November 2022.

    Comments: Submitted to ICASSP 2023

  7. arXiv:2112.05555  [pdf, other

    cs.CL cs.SD eess.AS

    Shennong: a Python toolbox for audio speech features extraction

    Authors: Mathieu Bernard, Maxime Poli, Julien Karadayi, Emmanuel Dupoux

    Abstract: We introduce Shennong, a Python toolbox and command-line utility for speech features extraction. It implements a wide range of well-established state of art algorithms including spectro-temporal filters such as Mel-Frequency Cepstral Filterbanks or Predictive Linear Filters, pre-trained neural networks, pitch estimators as well as speaker normalization methods and post-processing algorithms. Shenn… ▽ More

    Submitted 10 December, 2021; originally announced December 2021.

    Journal ref: Behavior Research Methods, 2023

  8. arXiv:2106.04165  [pdf, other

    cs.LG cs.NE eess.SY math.DS

    Neural Hybrid Automata: Learning Dynamics with Multiple Modes and Stochastic Transitions

    Authors: Michael Poli, Stefano Massaroli, Luca Scimeca, Seong Joon Oh, Sanghyuk Chun, Atsushi Yamashita, Hajime Asama, Jinkyoo Park, Animesh Garg

    Abstract: Effective control and prediction of dynamical systems often require appropriate handling of continuous-time and discrete, event-triggered processes. Stochastic hybrid systems (SHSs), common across engineering domains, provide a formalism for dynamical systems subject to discrete, possibly stochastic, state jumps and multi-modal continuous-time flows. Despite the versatility and importance of SHSs… ▽ More

    Submitted 8 June, 2021; originally announced June 2021.

  9. arXiv:2106.03780  [pdf, other

    cs.LG cs.AI eess.SY math.OC

    Learning Stochastic Optimal Policies via Gradient Descent

    Authors: Stefano Massaroli, Michael Poli, Stefano Peluchetti, Jinkyoo Park, Atsushi Yamashita, Hajime Asama

    Abstract: We systematically develop a learning-based treatment of stochastic optimal control (SOC), relying on direct optimization of parametric control policies. We propose a derivation of adjoint sensitivity results for stochastic differential equations through direct application of variational calculus. Then, given an objective function for a predetermined task specifying the desiderata for the controlle… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

    Journal ref: IEEE Control Systems Letters, 2021

  10. arXiv:2101.05537  [pdf, other

    eess.SY cs.AI cs.LG cs.NE math.DS

    Optimal Energy Shaping via Neural Approximators

    Authors: Stefano Massaroli, Michael Poli, Federico Califano, Jinkyoo Park, Atsushi Yamashita, Hajime Asama

    Abstract: We introduce optimal energy shaping as an enhancement of classical passivity-based control methods. A promising feature of passivity theory, alongside stability, has traditionally been claimed to be intuitive performance tuning along the execution of a given task. However, a systematic approach to adjust performance within a passive control framework has yet to be developed, as each method relies… ▽ More

    Submitted 14 January, 2021; originally announced January 2021.

  11. arXiv:1909.02702  [pdf, other

    cs.NE cs.LG eess.SY stat.ML

    Port-Hamiltonian Approach to Neural Network Training

    Authors: Stefano Massaroli, Michael Poli, Federico Califano, Angela Faragasso, Jinkyoo Park, Atsushi Yamashita, Hajime Asama

    Abstract: Neural networks are discrete entities: subdivided into discrete layers and parametrized by weights which are iteratively optimized via difference equations. Recent work proposes networks with layer outputs which are no longer quantized but are solutions of an ordinary differential equation (ODE); however, these networks are still optimized via discrete methods (e.g. gradient descent). In this pape… ▽ More

    Submitted 5 September, 2019; originally announced September 2019.

    Comments: To appear in the Proceedings of the 58th IEEE Conference on Decision and Control (CDC 2019). The first two authors contributed equally to the work