Skip to main content

Showing 1–11 of 11 results for author: Rochette, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.10266  [pdf, other

    cs.CL cs.LG

    psifx -- Psychological and Social Interactions Feature Extraction Package

    Authors: Guillaume Rochette, Matthew J. Vowels, Mathieu Rochat

    Abstract: psifx is a plug-and-play multi-modal feature extraction toolkit, aiming to facilitate and democratize the use of state-of-the-art machine learning techniques for human sciences research. It is motivated by a need (a) to automate and standardize data annotation processes, otherwise involving expensive, lengthy, and inconsistent human labor, such as the transcription or coding of behavior changes fr… ▽ More

    Submitted 9 December, 2024; v1 submitted 14 July, 2024; originally announced July 2024.

  2. arXiv:2305.18512  [pdf, other

    cs.LG cs.CV eess.SP

    A Rainbow in Deep Network Black Boxes

    Authors: Florentin Guth, Brice Ménard, Gaspar Rochette, Stéphane Mallat

    Abstract: A central question in deep learning is to understand the functions learned by deep networks. What is their approximation class? Do the learned weights and representations depend on initialization? Previous empirical work has evidenced that kernels defined by network activations are similar across initializations. For shallow networks, this has been theoretically studied with random feature models,… ▽ More

    Submitted 24 October, 2024; v1 submitted 29 May, 2023; originally announced May 2023.

    Comments: 59 pages, 10 figures. To appear at JMLR

  3. Novel View Synthesis of Humans using Differentiable Rendering

    Authors: Guillaume Rochette, Chris Russell, Richard Bowden

    Abstract: We present a new approach for synthesizing novel views of people in new poses. Our novel differentiable renderer enables the synthesis of highly realistic images from any viewpoint. Rather than operating over mesh-based structures, our renderer makes use of diffuse Gaussian primitives that directly represent the underlying skeletal structure of a human. Rendering these primitives gives results in… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

    Comments: Accepted at IEEE transactions on Biometrics, Behavior, and Identity Science, 10 pages, 11 figures. arXiv admin note: substantial text overlap with arXiv:2111.12731

  4. arXiv:2206.09556  [pdf, other

    eess.AS cs.AI cs.LG cs.SD eess.SP

    An Empirical Analysis on the Vulnerabilities of End-to-End Speech Segregation Models

    Authors: Rahil Parikh, Gaspar Rochette, Carol Espy-Wilson, Shihab Shamma

    Abstract: End-to-end learning models have demonstrated a remarkable capability in performing speech segregation. Despite their wide-scope of real-world applications, little is known about the mechanisms they employ to group and consequently segregate individual speakers. Knowing that harmonicity is a critical cue for these networks to group sources, in this work, we perform a thorough investigation on ConvT… ▽ More

    Submitted 19 June, 2022; originally announced June 2022.

    Comments: Accepted at Interspeech 2022

  5. arXiv:2204.10177  [pdf, other

    physics.data-an cond-mat.dis-nn cs.LG eess.SP q-fin.MF stat.ML

    Scale Dependencies and Self-Similar Models with Wavelet Scattering Spectra

    Authors: Rudy Morel, Gaspar Rochette, Roberto Leonarduzzi, Jean-Philippe Bouchaud, Stéphane Mallat

    Abstract: We introduce the wavelet scattering spectra which provide non-Gaussian models of time-series having stationary increments. A complex wavelet transform computes signal variations at each scale. Dependencies across scales are captured by the joint correlation across time and scales of wavelet coefficients and their modulus. This correlation matrix is nearly diagonalized by a second wavelet transform… ▽ More

    Submitted 19 June, 2023; v1 submitted 19 April, 2022; originally announced April 2022.

  6. arXiv:2111.12731  [pdf, other

    cs.CV

    Human Pose Manipulation and Novel View Synthesis using Differentiable Rendering

    Authors: Guillaume Rochette, Chris Russell, Richard Bowden

    Abstract: We present a new approach for synthesizing novel views of people in new poses. Our novel differentiable renderer enables the synthesis of highly realistic images from any viewpoint. Rather than operating over mesh-based structures, our renderer makes use of diffuse Gaussian primitives that directly represent the underlying skeletal structure of a human. Rendering these primitives gives results in… ▽ More

    Submitted 20 February, 2022; v1 submitted 24 November, 2021; originally announced November 2021.

    Comments: Accepted at Face and Gesture 2021, 8 pages, 7 figures

  7. arXiv:2105.02351  [pdf, other

    cs.CV cs.CL

    Content4All Open Research Sign Language Translation Datasets

    Authors: Necati Cihan Camgoz, Ben Saunders, Guillaume Rochette, Marco Giovanelli, Giacomo Inches, Robin Nachtrab-Ribback, Richard Bowden

    Abstract: Computational sign language research lacks the large-scale datasets that enables the creation of useful reallife applications. To date, most research has been limited to prototype systems on small domains of discourse, e.g. weather forecasts. To address this issue and to push the field forward, we release six datasets comprised of 190 hours of footage on the larger domain of news. From this, 20 ho… ▽ More

    Submitted 5 May, 2021; originally announced May 2021.

  8. arXiv:1912.06015  [pdf, other

    cs.LG cs.CV stat.ML

    Efficient Per-Example Gradient Computations in Convolutional Neural Networks

    Authors: Gaspar Rochette, Andre Manoel, Eric W. Tramel

    Abstract: Deep learning frameworks leverage GPUs to perform massively-parallel computations over batches of many training examples efficiently. However, for certain tasks, one may be interested in performing per-example computations, for instance using per-example gradients to evaluate a quantity of interest unique to each example. One notable application comes from the field of differential privacy, where… ▽ More

    Submitted 12 December, 2019; originally announced December 2019.

    Journal ref: Theory and Practice of Differential Privacy (TPDP) workshop at CCS 2020

  9. arXiv:1909.06119  [pdf, other

    cs.CV

    Weakly-Supervised 3D Pose Estimation from a Single Image using Multi-View Consistency

    Authors: Guillaume Rochette, Chris Russell, Richard Bowden

    Abstract: We present a novel data-driven regularizer for weakly-supervised learning of 3D human pose estimation that eliminates the drift problem that affects existing approaches. We do this by moving the stereo reconstruction problem into the loss of the network itself. This avoids the need to reconstruct 3D data prior to training and unlike previous semi-supervised approaches, avoids the need for a warm-u… ▽ More

    Submitted 13 September, 2019; originally announced September 2019.

    Comments: BMVC

  10. arXiv:1812.11214  [pdf, ps, other

    cs.LG cs.CV cs.SD eess.AS stat.ML

    Kymatio: Scattering Transforms in Python

    Authors: Mathieu Andreux, Tomás Angles, Georgios Exarchakis, Roberto Leonarduzzi, Gaspar Rochette, Louis Thiry, John Zarka, Stéphane Mallat, Joakim andén, Eugene Belilovsky, Joan Bruna, Vincent Lostanlen, Muawiz Chaudhary, Matthew J. Hirn, Edouard Oyallon, Sixin Zhang, Carmine Cella, Michael Eickenberg

    Abstract: The wavelet scattering transform is an invariant signal representation suitable for many signal processing and machine learning applications. We present the Kymatio software package, an easy-to-use, high-performance Python implementation of the scattering transform in 1D, 2D, and 3D that is compatible with modern deep learning frameworks. All transforms may be executed on a GPU (in addition to CPU… ▽ More

    Submitted 31 May, 2022; v1 submitted 28 December, 2018; originally announced December 2018.

  11. arXiv:1810.12136  [pdf, other

    eess.SP cs.LG stat.ML

    Phase Harmonic Correlations and Convolutional Neural Networks

    Authors: Stéphane Mallat, Sixin Zhang, Gaspar Rochette

    Abstract: A major issue in harmonic analysis is to capture the phase dependence of frequency representations, which carries important signal properties. It seems that convolutional neural networks have found a way. Over time-series and images, convolutional networks often learn a first layer of filters which are well localized in the frequency domain, with different phases. We show that a rectifier then act… ▽ More

    Submitted 29 June, 2019; v1 submitted 29 October, 2018; originally announced October 2018.

    Comments: 26 pages, 8 figures