Skip to main content

Showing 1–16 of 16 results for author: Martinez, A

Searching in archive eess. Search in all archives.
.
  1. arXiv:2503.08080  [pdf, other

    eess.SY

    Electrifying Heavy-Duty Trucks: Battery-Swapping vs Fast Charging

    Authors: Ruiting Wang, Antoine Martinez, Zaid Allybokus, Wente Zeng, Nicolas Obrecht, Scott Moura

    Abstract: The advantages and disadvantages of Battery Swapping Stations (BSS) for heavy-duty trucks are poorly understood, relative to Fast Charging Stations (FCS) systems. This study evaluates these two charging mechanisms for electric heavy-duty trucks, aiming to compare the systems' efficiency and identify the optimal design for each option. A model was developed to address the planning and operation of… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

  2. arXiv:2503.02915  [pdf, ps, other

    eess.IV cs.CV cs.LG math.NA physics.med-ph

    Computer-aided shape features extraction and regression models for predicting the ascending aortic aneurysm growth rate

    Authors: Leonardo Geronzi, Antonio Martinez, Michel Rochette, Kexin Yan, Aline Bel-Brunon, Pascal Haigron, Pierre Escrig, Jacques Tomasi, Morgan Daniel, Alain Lalande, Siyu Lin, Diana Marcela Marin-Castrillon, Olivier Bouchot, Jean Porterie, Pier Paolo Valentini, Marco Evangelos Biancolini

    Abstract: Objective: ascending aortic aneurysm growth prediction is still challenging in clinics. In this study, we evaluate and compare the ability of local and global shape features to predict ascending aortic aneurysm growth. Material and methods: 70 patients with aneurysm, for which two 3D acquisitions were available, are included. Following segmentation, three local shape features are computed: (1) t… ▽ More

    Submitted 4 March, 2025; originally announced March 2025.

    Journal ref: Volume 162, August 2023, 107052, Computers in Biology and Medicine

  3. arXiv:2405.18992  [pdf, other

    eess.SP

    A Digital Beamforming Receiver Architecture Implemented on a FPGA for Space Applications

    Authors: Eduardo Ortega, Agustín Martínez, Antonio Oliva, Fernando Sanz, Oscar Rodríguez, Manuel Prieto, Pablo Parra, Antonio Da Silva, Sebastián Sánchez

    Abstract: The burgeoning interest within the space community in digital beamforming is largely attributable to the superior flexibility that satellites with active antenna systems offer for a wide range of applications, notably in communication services. This paper delves into the analysis and practical implementation of a Digital Beamforming and Digital Down Conversion (DDC) chain, leveraging a high-speed… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  4. arXiv:2405.13762  [pdf, other

    cs.CV cs.LG cs.MM cs.SD eess.AS

    A Versatile Diffusion Transformer with Mixture of Noise Levels for Audiovisual Generation

    Authors: Gwanghyun Kim, Alonso Martinez, Yu-Chuan Su, Brendan Jou, José Lezama, Agrim Gupta, Lijun Yu, Lu Jiang, Aren Jansen, Jacob Walker, Krishna Somandepalli

    Abstract: Training diffusion models for audiovisual sequences allows for a range of generation tasks by learning conditional distributions of various input-output combinations of the two modalities. Nevertheless, this strategy often requires training a separate model for each task which is expensive. Here, we propose a novel training approach to effectively learn arbitrary conditional distributions in the a… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  5. arXiv:2309.00769  [pdf, other

    eess.IV cs.CV

    Full Reference Video Quality Assessment for Machine Learning-Based Video Codecs

    Authors: Abrar Majeedi, Babak Naderi, Yasaman Hosseinkashi, Juhee Cho, Ruben Alvarez Martinez, Ross Cutler

    Abstract: Machine learning-based video codecs have made significant progress in the past few years. A critical area in the development of ML-based video codecs is an accurate evaluation metric that does not require an expensive and slow subjective test. We show that existing evaluation metrics that were designed and trained on DSP-based video codecs are not highly correlated to subjective opinion when used… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

  6. arXiv:2303.04854  [pdf, other

    eess.IV

    Structural Similarity: When to Use Deep Generative Models on Imbalanced Image Dataset Augmentation

    Authors: Chenqi Guo, Fabian Benitez-Quiroz, Qianli Feng, Aleix Martinez

    Abstract: Improving the performance on an imbalanced training set is one of the main challenges in nowadays Machine Learning. One way to augment and thus re-balance the image dataset is through existing deep generative models, like class-conditional Generative Adversarial Networks (cGAN) or Diffusion Models by synthesizing images on each of the tail-class. Our experiments on imbalanced image dataset classif… ▽ More

    Submitted 8 March, 2023; originally announced March 2023.

  7. arXiv:2207.14463  [pdf, other

    eess.IV cs.CV cs.MM eess.SP stat.ME

    Low-Complexity Loeffler DCT Approximations for Image and Video Coding

    Authors: D. F. G. Coelho, R. J. Cintra, F. M. Bayer, S. Kulasekera, A. Madanayake, P. A. C. Martinez, T. L. T. Silveira, R. S. Oliveira, V. S. Dimitrov

    Abstract: This paper introduced a matrix parametrization method based on the Loeffler discrete cosine transform (DCT) algorithm. As a result, a new class of eight-point DCT approximations was proposed, capable of unifying the mathematical formalism of several eight-point DCT approximations archived in the literature. Pareto-efficient DCT approximations are obtained through multicriteria optimization, where… ▽ More

    Submitted 28 July, 2022; originally announced July 2022.

    Comments: 25 pages, 11 figures, 7 tables

    Journal ref: J. Low Power Electron. Appl. 2018, 8(4), 46

  8. A Novel Approach for Cancellation of Non-Aligned Inter Spreading Factor Interference in LoRa Systems

    Authors: Qiaohan Zhang, Ivo Bizon, Atul Kumar, Ana Belen Martinez, Marwa Chafii, Gerhard Fettweis

    Abstract: Long Range (LoRa) has become a key enabler technology for low power wide area networks. However, due to its ALOHA-based medium access scheme, LoRa has to cope with collisions that limit the capacity and network scalability. Collisions between randomly overlapped signals modulated with different spreading factors (SFs) result in inter-SF interference, which increases the packet loss likelihood when… ▽ More

    Submitted 14 April, 2022; originally announced April 2022.

    Comments: Published in: IEEE Open Journal of the Communications Society (Early Access)

  9. arXiv:2203.09148  [pdf, other

    cs.SD cs.AI cs.CL cs.LG eess.AS

    Prediction of speech intelligibility with DNN-based performance measures

    Authors: Angel Mario Castro Martinez, Constantin Spille, Jana Roßbach, Birger Kollmeier, Bernd T. Meyer

    Abstract: This paper presents a speech intelligibility model based on automatic speech recognition (ASR), combining phoneme probabilities from deep neural networks (DNN) and a performance measure that estimates the word error rate from these probabilities. This model does not require the clean speech reference nor the word labels during testing as the ASR decoding step, which finds the most likely sequence… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

    Journal ref: Computer Speech & Language, 74, p.101329 (2022)

  10. arXiv:2105.04752  [pdf, other

    eess.AS cs.LG cs.SD eess.SP

    Differentiable Signal Processing With Black-Box Audio Effects

    Authors: Marco A. Martínez Ramírez, Oliver Wang, Paris Smaragdis, Nicholas J. Bryan

    Abstract: We present a data-driven approach to automate audio signal processing by incorporating stateful third-party, audio effects as layers within a deep neural network. We then train a deep encoder to analyze input audio and control effect parameters to perform the desired signal manipulation, requiring only input-target paired audio data as supervision. To train our network with non-differentiable blac… ▽ More

    Submitted 10 May, 2021; originally announced May 2021.

    Comments: Presented at the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), June 2021. Source code, demo and audio examples: https://mchijmma.github.io/DeepAFx/

  11. arXiv:2009.10474  [pdf, other

    eess.IV cs.CV

    Classification of COVID-19 in CT Scans using Multi-Source Transfer Learning

    Authors: Alejandro R. Martinez

    Abstract: Since December of 2019, novel coronavirus disease COVID-19 has spread around the world infecting millions of people and upending the global economy. One of the driving reasons behind its high rate of infection is due to the unreliability and lack of RT-PCR testing. At times the turnaround results span as long as a couple of days, only to yield a roughly 70% sensitivity rate. As an alternative, rec… ▽ More

    Submitted 22 September, 2020; originally announced September 2020.

  12. arXiv:1910.10105  [pdf, other

    eess.AS cs.LG cs.SD eess.SP

    Modeling plate and spring reverberation using a DSP-informed deep neural network

    Authors: Marco A. Martínez Ramírez, Emmanouil Benetos, Joshua D. Reiss

    Abstract: Plate and spring reverberators are electromechanical systems first used and researched as means to substitute real room reverberation. Nowadays they are often used in music production for aesthetic reasons due to their particular sonic characteristics. The modeling of these audio processors and their perceptual qualities is difficult since they use mechanical elements together with analog electron… ▽ More

    Submitted 17 April, 2020; v1 submitted 22 October, 2019; originally announced October 2019.

    Comments: Presented at the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Barcelona, Spain, May 2020. Source code, dataset, audio examples and more detailed diagrams: https://mchijmma.github.io/modeling-plate-spring-reverb/

  13. arXiv:1908.10391  [pdf, other

    eess.SY eess.SP

    Hopfield Learning-based and Nonlinear Programming methods for Resource Allocation in OCDMA Networks

    Authors: Cristiane A. Pendeza Martinez, Taufik Abrão, Fábio Renan Durand, Alessandro Goedtel

    Abstract: This paper proposes the deployment of the Hopfield's artificial neural network (H-NN) approach to optimally assign power in optical code division multiple access (OCDMA) systems. Figures of merit such as feasibility of solutions and complexity are compared with the classical power allocation methods found in the literature, such as Sequential Quadratic Programming (SQP) and Augmented Lagrangian Me… ▽ More

    Submitted 4 September, 2019; v1 submitted 27 August, 2019; originally announced August 2019.

    Comments: 29 pages, 11 figures, 5 tables

  14. arXiv:1908.03679  [pdf, other

    eess.IV cs.CV cs.LG

    Distance Map Loss Penalty Term for Semantic Segmentation

    Authors: Francesco Caliva, Claudia Iriondo, Alejandro Morales Martinez, Sharmila Majumdar, Valentina Pedoia

    Abstract: Convolutional neural networks for semantic segmentation suffer from low performance at object boundaries. In medical imaging, accurate representation of tissue surfaces and volumes is important for tracking of disease biomarkers such as tissue morphology and shape features. In this work, we propose a novel distance map derived loss penalty term for semantic segmentation. We propose to use distance… ▽ More

    Submitted 9 August, 2019; originally announced August 2019.

    Comments: Medical Imaging with Deep Learning (MIDL2019) Conference [arXiv:1907.08612], Extended Abstract

    Report number: MIDL/2019/ExtendedAbstract/B1eIcvS45V

  15. arXiv:1905.06148  [pdf, other

    eess.AS cs.LG cs.SD eess.SP

    A general-purpose deep learning approach to model time-varying audio effects

    Authors: Marco A. Martínez Ramírez, Emmanouil Benetos, Joshua D. Reiss

    Abstract: Audio processors whose parameters are modified periodically over time are often referred as time-varying or modulation based audio effects. Most existing methods for modeling these type of effect units are often optimized to a very specific circuit and cannot be efficiently generalized to other time-varying effects. Based on convolutional and recurrent neural networks, we propose a deep learning a… ▽ More

    Submitted 21 June, 2019; v1 submitted 15 May, 2019; originally announced May 2019.

    Comments: audio files: https://mchijmma.github.io/modeling-time-varying/

  16. arXiv:1810.06603  [pdf, other

    eess.AS cs.LG cs.SD eess.SP

    Modeling of nonlinear audio effects with end-to-end deep neural networks

    Authors: Marco A. Martínez Ramirez, Joshua D. Reiss

    Abstract: In the context of music production, distortion effects are mainly used for aesthetic reasons and are usually applied to electric musical instruments. Most existing methods for nonlinear modeling are often either simplified or optimized to a very specific circuit. In this work, we investigate deep learning architectures for audio processing and we aim to find a general purpose end-to-end deep neura… ▽ More

    Submitted 6 March, 2019; v1 submitted 15 October, 2018; originally announced October 2018.

    Comments: Presented at the 2019 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Brighton, UK, May 2019