Skip to main content

Showing 1–5 of 5 results for author: Biscainho, L W P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2411.07364  [pdf, other

    eess.AS cs.SD

    AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and state space models

    Authors: Wallace Abreu, Luiz Wagner Pereira Biscainho

    Abstract: Audio super-resolution aims to enhance low-resolution signals by creating high-frequency content. In this work, we modify the architecture of AERO (a state-of-the-art system for this task) for music super-resolution. SPecifically, we replace its original Attention and LSTM layers with Mamba, a State Space Model (SSM), across all network layers. Mamba is capable of effectively substituting the ment… ▽ More

    Submitted 11 November, 2024; originally announced November 2024.

    Comments: Accepted at LAMIR 2024 Workshop (ISMIR 2024 Satellite Event)

  2. arXiv:2304.07186  [pdf, other

    cs.SD eess.AS

    Adapting Meter Tracking Models to Latin American Music

    Authors: Lucas S. Maia, Martín Rocamora, Luiz W. P. Biscainho, Magdalena Fuentes

    Abstract: Beat and downbeat tracking models have improved significantly in recent years with the introduction of deep learning methods. However, despite these improvements, several challenges remain. Particularly, the adaptation of available models to underrepresented music traditions in MIR is usually synonymous with collecting and annotating large amounts of data, which is impractical and time-consuming.… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

    Comments: Accepted at ISMIR 2022. This version was made after a bug fix in the code, which lead to minor modifications in the results (updated in Figure 1 and Table 1). The paper's conclusions remain unchanged

  3. arXiv:2005.14181  [pdf, other

    eess.AS cs.SD eess.SP stat.AP stat.ML

    Bayesian Restoration of Audio Degraded by Low-Frequency Pulses Modeled via Gaussian Process

    Authors: Hugo Tremonte de Carvalho, Flávio Rainho Ávila, Luiz Wagner Pereira Biscainho

    Abstract: A common defect found when reproducing old vinyl and gramophone recordings with mechanical devices are the long pulses with significant low-frequency content caused by the interaction of the arm-needle system with deep scratches or even breakages on the media surface. Previous approaches to their suppression on digital counterparts of the recordings depend on a prior estimation of the pulse locati… ▽ More

    Submitted 26 September, 2020; v1 submitted 28 May, 2020; originally announced May 2020.

    Comments: 14 pages, 7 figures, 4 tables. Submitted to IEEE Journal of Selected Topics in Signal Processing - Special Issue "Reconstruction of audio from incomplete or highly degraded observations"

  4. arXiv:1810.08707  [pdf

    cs.HC cs.AI cs.SD eess.AS

    Mobile Sound Recognition for the Deaf and Hard of Hearing

    Authors: Leonardo A. Fanzeres, Adriana S. Vivacqua, Luiz W. P. Biscainho

    Abstract: Human perception of surrounding events is strongly dependent on audio cues. Thus, acoustic insulation can seriously impact situational awareness. We present an exploratory study in the domain of assistive computing, eliciting requirements and presenting solutions to problems found in the development of an environmental sound recognition system, which aims to assist deaf and hard of hearing people… ▽ More

    Submitted 19 October, 2018; originally announced October 2018.

    Comments: 25 pages, 8 figures

    MSC Class: 68U35; 68T37; 68T10 ACM Class: H.1.2; H.5.2; H.5.5

  5. Efficient Steered-Response Power Methods for Sound Source Localization Using Microphone Arrays

    Authors: Markus V. S. Lima, Wallace A. Martins, Leonardo O. Nunes, Luiz W. P. Biscainho, Tadeu N. Ferreira, Maurício V. M. Costa, Bowon Lee

    Abstract: This paper proposes an efficient method based on the steered-response power (SRP) technique for sound source localization using microphone arrays: the volumetric SRP (V-SRP). As compared to the SRP, by deploying a sparser volumetric grid, the V-SRP achieves a significant reduction of the computational complexity without sacrificing the accuracy of the location estimates. By appending a fine search… ▽ More

    Submitted 15 February, 2015; v1 submitted 9 July, 2014; originally announced July 2014.

    Comments: 14 pages, 9 figures, 5 tables

    Journal ref: IEEE Signal Processing Letters (Volume:22 , Issue: 8 ), Aug. 2015