Skip to main content

Showing 1–14 of 14 results for author: Lopez, J

Searching in archive eess. Search in all archives.
.
  1. arXiv:2507.01750  [pdf, ps, other

    eess.AS cs.SD

    Generalizable Detection of Audio Deepfakes

    Authors: Jose A. Lopez, Georg Stemmer, Héctor Cordourier Maruri

    Abstract: In this paper, we present our comprehensive study aimed at enhancing the generalization capabilities of audio deepfake detection models. We investigate the performance of various pre-trained backbones, including Wav2Vec2, WavLM, and Whisper, across a diverse set of datasets, including those from the ASVspoof challenges and additional sources. Our experiments focus on the effects of different data… ▽ More

    Submitted 2 July, 2025; originally announced July 2025.

    Comments: 8 pages, 3 figures

  2. arXiv:2505.15822  [pdf, other

    eess.IV cs.CV cs.LG

    MambaStyle: Efficient StyleGAN Inversion for Real Image Editing with State-Space Models

    Authors: Jhon Lopez, Carlos Hinojosa, Henry Arguello, Bernard Ghanem

    Abstract: The task of inverting real images into StyleGAN's latent space to manipulate their attributes has been extensively studied. However, existing GAN inversion methods struggle to balance high reconstruction quality, effective editability, and computational efficiency. In this paper, we introduce MambaStyle, an efficient single-stage encoder-based approach for GAN inversion and editing that leverages… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

  3. arXiv:2505.14074  [pdf, ps, other

    cs.HC cs.SD eess.AS

    Recreating Neural Activity During Speech Production with Language and Speech Model Embeddings

    Authors: Owais Mujtaba Khanday, Pablo Rodroguez San Esteban, Zubair Ahmad Lone, Marc Ouellet, Jose Andres Gonzalez Lopez

    Abstract: Understanding how neural activity encodes speech and language production is a fundamental challenge in neuroscience and artificial intelligence. This study investigates whether embeddings from large-scale, self-supervised language and speech models can effectively reconstruct high-gamma neural activity characteristics, key indicators of cortical processing, recorded during speech production. We le… ▽ More

    Submitted 21 May, 2025; v1 submitted 20 May, 2025; originally announced May 2025.

    Comments: Accepted for presentation at Interspeech2025

  4. arXiv:2409.02290  [pdf, other

    cs.RO cs.CV eess.IV

    Unsupervised Welding Defect Detection Using Audio And Video

    Authors: Georg Stemmer, Jose A. Lopez, Juan A. Del Hoyo Ontiveros, Arvind Raju, Tara Thimmanaik, Sovan Biswas

    Abstract: In this work we explore the application of AI to robotic welding. Robotic welding is a widely used technology in many industries, but robots currently do not have the capability to detect welding defects which get introduced due to various reasons in the welding process. We describe how deep-learning methods can be applied to detect weld defects in real-time by recording the welding process with m… ▽ More

    Submitted 3 September, 2024; originally announced September 2024.

    Comments: 21 pages

  5. Privacy-Preserving Deep Learning Using Deformable Operators for Secure Task Learning

    Authors: Fabian Perez, Jhon Lopez, Henry Arguello

    Abstract: In the era of cloud computing and data-driven applications, it is crucial to protect sensitive information to maintain data privacy, ensuring truly reliable systems. As a result, preserving privacy in deep learning systems has become a critical concern. Existing methods for privacy preservation rely on image encryption or perceptual transformation approaches. However, they often suffer from reduce… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: copyright 2024 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

  6. Improvement of Performance in Freezing of Gait detection in Parkinsons Disease using Transformer networks and a single waist worn triaxial accelerometer

    Authors: Luis Sigcha, Luigi Borzì, Ignacio Pavón, Nélson Costa, Susana Costa, Pedro Arezes, Juan-Manuel López, Guillermo De Arcas

    Abstract: Freezing of gait (FOG) is one of the most incapacitating symptoms in Parkinsons disease, affecting more than 50 percent of patients in advanced stages of the disease. The presence of FOG may lead to falls and a loss of independence with a consequent reduction in the quality of life. Wearable technology and artificial intelligence have been used for automatic FOG detection to optimize monitoring. H… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Journal ref: Engineering Applications of Artificial Intelligence Volume 116, November 2022, 105482

  7. arXiv:2404.00777  [pdf, other

    cs.CV cs.AI cs.CR cs.LG eess.IV

    Privacy-preserving Optics for Enhancing Protection in Face De-identification

    Authors: Jhon Lopez, Carlos Hinojosa, Henry Arguello, Bernard Ghanem

    Abstract: The modern surge in camera usage alongside widespread computer vision technology applications poses significant privacy and security concerns. Current artificial intelligence (AI) technologies aid in recognizing relevant events and assisting in daily tasks in homes, offices, hospitals, etc. The need to access or process personal information for these purposes raises privacy concerns. While softwar… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

    Comments: Accepted to CVPR 2024. Project Website and Code coming soon

  8. arXiv:2303.18060  [pdf

    cs.LG cs.AI eess.SY

    NOSTROMO: Lessons learned, conclusions and way forward

    Authors: Mayte Cano, Andrés Perillo, Juan Antonio López, Faustino Tello, Javier Poveda, Francisco Câmara, Francisco Antunes, Christoffer Riis, Ian Crook, Abderrazak Tibichte, Sandrine Molton, David Mocholí, Ricardo Herranz, Gérald Gurtner, Tatjana Bolić, Andrew Cook, Jovana Kuljanin, Xavier Prats

    Abstract: This White Paper sets out to explain the value that metamodelling can bring to air traffic management (ATM) research. It will define metamodelling and explore what it can, and cannot, do. The reader is assumed to have basic knowledge of SESAR: the Single European Sky ATM Research project. An important element of SESAR, as the technological pillar of the Single European Sky initiative, is to bring… ▽ More

    Submitted 29 March, 2023; originally announced March 2023.

    Comments: White Paper of the NOSTROMO, an exploratory research project funded by the SESAR Joint Undertaking (SJU) under the European Union's Horizon 2020 research and innovation programme

  9. EndoMapper dataset of complete calibrated endoscopy procedures

    Authors: Pablo Azagra, Carlos Sostres, Ángel Ferrandez, Luis Riazuelo, Clara Tomasini, Oscar León Barbed, Javier Morlana, David Recasens, Victor M. Batlle, Juan J. Gómez-Rodríguez, Richard Elvira, Julia López, Cristina Oriol, Javier Civera, Juan D. Tardós, Ana Cristina Murillo, Angel Lanas, José M. M. Montiel

    Abstract: Computer-assisted systems are becoming broadly used in medicine. In endoscopy, most research focuses on the automatic detection of polyps or other pathologies, but localization and navigation of the endoscope are completely performed manually by physicians. To broaden this research and bring spatial Artificial Intelligence to endoscopies, data from complete procedures is needed. This paper introdu… ▽ More

    Submitted 10 October, 2023; v1 submitted 29 April, 2022; originally announced April 2022.

    Comments: 17 pages, 14 figures, 8 tables

    Journal ref: Sci Data 10, 671 (2023)

  10. Short-Term Flow-Based Bandwidth Forecasting using Machine Learning

    Authors: Maxime Labonne, Jorge López, Claude Poletti, Jean-Baptiste Munier

    Abstract: This paper proposes a novel framework to predict traffic flows' bandwidth ahead of time. Modern network management systems share a common issue: the network situation evolves between the moment the decision is made and the moment when actions (countermeasures) are applied. This framework converts packets from real-life traffic into flows containing relevant features. Machine learning models, inclu… ▽ More

    Submitted 3 December, 2020; v1 submitted 29 November, 2020; originally announced November 2020.

    Comments: 4 pages, 1 figure 3 tables

  11. arXiv:2010.10618  [pdf, other

    cs.LG cs.AI eess.SY

    Runtime Safety Assurance Using Reinforcement Learning

    Authors: Christopher Lazarus, James G. Lopez, Mykel J. Kochenderfer

    Abstract: The airworthiness and safety of a non-pedigreed autopilot must be verified, but the cost to formally do so can be prohibitive. We can bypass formal verification of non-pedigreed components by incorporating Runtime Safety Assurance (RTSA) as mechanism to ensure safety. RTSA consists of a meta-controller that observes the inputs and outputs of a non-pedigreed component and verifies formally specifie… ▽ More

    Submitted 20 October, 2020; originally announced October 2020.

    Journal ref: 2020 IEEE/AIAA 39th Digital Avionics Systems Conference (DASC)

  12. arXiv:2002.11561  [pdf, other

    cs.SD cs.LG eess.AS

    An Open-set Recognition and Few-Shot Learning Dataset for Audio Event Classification in Domestic Environments

    Authors: Javier Naranjo-Alcazar, Sergi Perez-Castanos, Pedro Zuccarrello, Ana M. Torres, Jose J. Lopez, Franscesc J. Ferri, Maximo Cobos

    Abstract: The problem of training with a small set of positive samples is known as few-shot learning (FSL). It is widely known that traditional deep learning (DL) algorithms usually show very good performance when trained with large datasets. However, in many applications, it is not possible to obtain such a high number of samples. In the image domain, typical FSL applications include those related to face… ▽ More

    Submitted 11 April, 2022; v1 submitted 26 February, 2020; originally announced February 2020.

    Comments: Submitted to IEEEAccess

  13. arXiv:1506.00300  [pdf, other

    math.OC eess.SY

    How To Tame Your Sparsity Constraints

    Authors: Jose A. Lopez

    Abstract: We show that designing sparse $H_\infty$ controllers, in a discrete (LTI) setting, is easy when the controller is assumed to be an FIR filter. In this case, the problem reduces to a static output feedback problem with equality constraints. We show how to obtain an initial guess, for the controller, and then provide a simple algorithm that alternates between two (convex) feasibility programs until… ▽ More

    Submitted 31 May, 2015; originally announced June 2015.

    Comments: 12 pages, 1 figure

  14. arXiv:1504.00905  [pdf, other

    math.OC cs.CV cs.LG eess.SY

    Robust Anomaly Detection Using Semidefinite Programming

    Authors: Jose A. Lopez, Octavia Camps, Mario Sznaier

    Abstract: This paper presents a new approach, based on polynomial optimization and the method of moments, to the problem of anomaly detection. The proposed technique only requires information about the statistical moments of the normal-state distribution of the features of interest and compares favorably with existing approaches (such as Parzen windows and 1-class SVM). In addition, it provides a succinct d… ▽ More

    Submitted 30 May, 2015; v1 submitted 3 April, 2015; originally announced April 2015.

    Comments: 13 pages, 11 figures