Skip to main content

Showing 1–50 of 152 results for author: Perez, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.00539  [pdf, ps, other

    cs.CE physics.flu-dyn

    Ensemble Kalman Filter for Data Assimilation coupled with low-resolution computations techniques applied in Fluid Dynamics

    Authors: Paul Jeanney, Ashton Hetherington, Shady E. Ahmed, David Lanceta, Susana Saiz, José Miguel Perez, Soledad Le Clainche

    Abstract: This paper presents an innovative Reduced-Order Model (ROM) for merging experimental and simulation data using Data Assimilation (DA) to estimate the "True" state of a fluid dynamics system, leading to more accurate predictions. Our methodology introduces a novel approach implementing the Ensemble Kalman Filter (EnKF) within a reduced-dimensional framework, grounded in a robust theoretical foundat… ▽ More

    Submitted 1 July, 2025; v1 submitted 1 July, 2025; originally announced July 2025.

    Comments: article, 49 pages, 29 figures, 4 tables

    MSC Class: 62M20 (Primary) 65F30; 65C20; 76M12 (Secondary) ACM Class: G.1.3; G.3; I.6.3; G.1.10

  2. arXiv:2506.19892  [pdf, ps, other

    cs.CR cs.AI cs.DC cs.LG cs.PF

    RepuNet: A Reputation System for Mitigating Malicious Clients in DFL

    Authors: Isaac Marroqui Penalva, Enrique Tomás Martínez Beltrán, Manuel Gil Pérez, Alberto Huertas Celdrán

    Abstract: Decentralized Federated Learning (DFL) enables nodes to collaboratively train models without a central server, introducing new vulnerabilities since each node independently selects peers for model aggregation. Malicious nodes may exploit this autonomy by sending corrupted models (model poisoning), delaying model submissions (delay attack), or flooding the network with excessive messages, negativel… ▽ More

    Submitted 24 June, 2025; originally announced June 2025.

  3. arXiv:2505.18978  [pdf, other

    cs.CL

    AI4Math: A Native Spanish Benchmark for University-Level Mathematical Reasoning in Large Language Models

    Authors: Miguel Angel Peñaloza Perez, Bruno Lopez Orozco, Jesus Tadeo Cruz Soto, Michelle Bruno Hernandez, Miguel Angel Alvarado Gonzalez, Sandra Malagon

    Abstract: Existing mathematical reasoning benchmarks are predominantly English only or translation-based, which can introduce semantic drift and mask languagespecific reasoning errors. To address this, we present AI4Math, a benchmark of 105 original university level math problems natively authored in Spanish. The dataset spans seven advanced domains (Algebra, Calculus, Geometry, Probability, Number Theory,… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

    Comments: 36 pages, 5 figures

    MSC Class: 68 ACM Class: I.2

  4. arXiv:2505.15693  [pdf, other

    cs.AI

    Average Reward Reinforcement Learning for Omega-Regular and Mean-Payoff Objectives

    Authors: Milad Kazemi, Mateo Perez, Fabio Somenzi, Sadegh Soudjani, Ashutosh Trivedi, Alvaro Velasquez

    Abstract: Recent advances in reinforcement learning (RL) have renewed focus on the design of reward functions that shape agent behavior. Manually designing reward functions is tedious and error-prone. A principled alternative is to specify behaviors in a formal language that can be automatically translated into rewards. Omega-regular languages are a natural choice for this purpose, given their established r… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

    Comments: 29 pages, 6 figures and 2 tables

  5. arXiv:2505.06276  [pdf, other

    cs.RO

    SynSHRP2: A Synthetic Multimodal Benchmark for Driving Safety-critical Events Derived from Real-world Driving Data

    Authors: Liang Shi, Boyu Jiang, Zhenyuan Yuan, Miguel A. Perez, Feng Guo

    Abstract: Driving-related safety-critical events (SCEs), including crashes and near-crashes, provide essential insights for the development and safety evaluation of automated driving systems. However, two major challenges limit their accessibility: the rarity of SCEs and the presence of sensitive privacy information in the data. The Second Strategic Highway Research Program (SHRP 2) Naturalistic Driving Stu… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

    Comments: Accepted as a poster in CVPR 2025

  6. arXiv:2504.06774  [pdf, other

    physics.flu-dyn cs.LG

    Hybrid machine learning models based on physical patterns to accelerate CFD simulations: a short guide on autoregressive models

    Authors: Arindam Sengupta, Rodrigo Abadía-Heredia, Ashton Hetherington, José Miguel Pérez, Soledad Le Clainche

    Abstract: Accurate modeling of the complex dynamics of fluid flows is a fundamental challenge in computational physics and engineering. This study presents an innovative integration of High-Order Singular Value Decomposition (HOSVD) with Long Short-Term Memory (LSTM) architectures to address the complexities of reduced-order modeling (ROM) in fluid dynamics. HOSVD improves the dimensionality reduction proce… ▽ More

    Submitted 9 April, 2025; originally announced April 2025.

  7. arXiv:2503.13450  [pdf, other

    cs.NI cs.IT eess.IV eess.SP

    The Future of IPTV: Security, AI Integration, 5G, and Next-Gen Streaming

    Authors: Georgios Giannakopoulos, Peter Adegbenro, Maria Antonnette Perez

    Abstract: The evolution of Internet Protocol Television (IPTV) has transformed the landscape of digital broadcasting by leveraging high-speed internet connectivity to deliver high-quality multimedia content. IPTV provides a dynamic and interactive television experience through managed networks, ensuring superior Quality of Service (QoS) compared to open-network Internet TV. This study explores the technical… ▽ More

    Submitted 20 March, 2025; v1 submitted 29 December, 2024; originally announced March 2025.

    Comments: 12 pages, 3 figures

  8. arXiv:2503.08284  [pdf, other

    cs.CR

    Neural cyberattacks applied to the vision under realistic visual stimuli

    Authors: Victoria Magdalena López Madejska, Sergio López Bernal, Gregorio Martínez Pérez, Alberto Huertas Celdrán

    Abstract: Brain-Computer Interfaces (BCIs) are systems traditionally used in medicine and designed to interact with the brain to record or stimulate neurons. Despite their benefits, the literature has demonstrated that invasive BCIs focused on neurostimulation present vulnerabilities allowing attackers to gain control. In this context, neural cyberattacks emerged as threats able to disrupt spontaneous neura… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

  9. arXiv:2502.10567  [pdf, other

    cs.LG cs.AI

    Efficient Hierarchical Contrastive Self-supervising Learning for Time Series Classification via Importance-aware Resolution Selection

    Authors: Kevin Garcia, Juan Manuel Perez, Yifeng Gao

    Abstract: Recently, there has been a significant advancement in designing Self-Supervised Learning (SSL) frameworks for time series data to reduce the dependency on data labels. Among these works, hierarchical contrastive learning-based SSL frameworks, which learn representations by contrasting data embeddings at multiple resolutions, have gained considerable attention. Due to their ability to gather more i… ▽ More

    Submitted 14 February, 2025; originally announced February 2025.

    Comments: Appears in IEEEBigData-2024

    ACM Class: I.2

  10. arXiv:2502.00016  [pdf

    cs.CY cs.HC

    Large Language Models for Education: ChemTAsk -- An Open-Source Paradigm for Automated Q&A in the Graduate Classroom

    Authors: Ryann M. Perez, Marie Shimogawa, Yanan Chang, Hoang Anh T. Phan, Jason G. Marmorstein, Evan S. K. Yanagawa, E. James Petersson

    Abstract: Large language models (LLMs) show promise for aiding graduate level education, but are limited by their training data and potential confabulations. We developed ChemTAsk, an open-source pipeline that combines LLMs with retrieval-augmented generation (RAG) to provide accurate, context-specific assistance. ChemTAsk utilizes course materials, including lecture transcripts and primary publications, to… ▽ More

    Submitted 6 February, 2025; v1 submitted 9 January, 2025; originally announced February 2025.

    Comments: 38 pages, 3 figures, 1 table

  11. arXiv:2501.19279  [pdf, other

    cs.LG cs.DC

    S-VOTE: Similarity-based Voting for Client Selection in Decentralized Federated Learning

    Authors: Pedro Miguel Sánchez Sánchez, Enrique Tomás Martínez Beltrán, Chao Feng, Gérôme Bovet, Gregorio Martínez Pérez, Alberto Huertas Celdrán

    Abstract: Decentralized Federated Learning (DFL) enables collaborative, privacy-preserving model training without relying on a central server. This decentralized approach reduces bottlenecks and eliminates single points of failure, enhancing scalability and resilience. However, DFL also introduces challenges such as suboptimal models with non-IID data distributions, increased communication overhead, and res… ▽ More

    Submitted 31 January, 2025; originally announced January 2025.

    Comments: Submitted to IJCNN

  12. arXiv:2501.12229  [pdf, other

    cs.CR cs.DC

    Empower Healthcare through a Self-Sovereign Identity Infrastructure for Secure Electronic Health Data Access

    Authors: Antonio López Martínez, Montassar Naghmouchi, Maryline Laurent, Joaquin Garcia-Alfaro, Manuel Gil Pérez, Antonio Ruiz Martínez, Pantaleone Nespoli

    Abstract: Health data is one of the most sensitive data for people, which attracts the attention of malicious activities. We propose an open-source health data management framework, that follows a patient-centric approach. The proposed framework implements the Self-Sovereign Identity paradigm with innovative technologies such as Decentralized Identifiers and Verifiable Credentials. The framework uses Blockc… ▽ More

    Submitted 21 January, 2025; originally announced January 2025.

    Comments: 40 pages, 11 figures

  13. arXiv:2412.11207  [pdf, other

    cs.LG cs.AI cs.DC cs.NI

    ProFe: Communication-Efficient Decentralized Federated Learning via Distillation and Prototypes

    Authors: Pedro Miguel Sánchez Sánchez, Enrique Tomás Martínez Beltrán, Miguel Fernández Llamas, Gérôme Bovet, Gregorio Martínez Pérez, Alberto Huertas Celdrán

    Abstract: Decentralized Federated Learning (DFL) trains models in a collaborative and privacy-preserving manner while removing model centralization risks and improving communication bottlenecks. However, DFL faces challenges in efficient communication management and model aggregation within decentralized environments, especially with heterogeneous data distributions. Thus, this paper introduces ProFe, a nov… ▽ More

    Submitted 15 December, 2024; originally announced December 2024.

  14. arXiv:2410.12174  [pdf, other

    cs.CL

    Exploring Large Language Models for Hate Speech Detection in Rioplatense Spanish

    Authors: Juan Manuel Pérez, Paula Miguel, Viviana Cotik

    Abstract: Hate speech detection deals with many language variants, slang, slurs, expression modalities, and cultural nuances. This outlines the importance of working with specific corpora, when addressing hate speech within the scope of Natural Language Processing, recently revolutionized by the irruption of Large Language Models. This work presents a brief analysis of the performance of large language mode… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

  15. arXiv:2410.06127  [pdf, other

    cs.LG

    De-VertiFL: A Solution for Decentralized Vertical Federated Learning

    Authors: Alberto Huertas Celdrán, Chao Feng, Sabyasachi Banik, Gerome Bovet, Gregorio Martinez Perez, Burkhard Stiller

    Abstract: Federated Learning (FL), introduced in 2016, was designed to enhance data privacy in collaborative model training environments. Among the FL paradigm, horizontal FL, where clients share the same set of features but different data samples, has been extensively studied in both centralized and decentralized settings. In contrast, Vertical Federated Learning (VFL), which is crucial in real-world decen… ▽ More

    Submitted 4 February, 2025; v1 submitted 8 October, 2024; originally announced October 2024.

  16. arXiv:2409.12427  [pdf, other

    cs.LG cs.CY

    Sustainable Visions: Unsupervised Machine Learning Insights on Global Development Goals

    Authors: Alberto García-Rodríguez, Matias Núñez, Miguel Robles Pérez, Tzipe Govezensky, Rafael A. Barrio, Carlos Gershenson, Kimmo K. Kaski, Julia Tagüeña

    Abstract: The 2030 Agenda for Sustainable Development of the United Nations outlines 17 goals for countries of the world to address global challenges in their development. However, the progress of countries towards these goal has been slower than expected and, consequently, there is a need to investigate the reasons behind this fact. In this study, we have used a novel data-driven methodology to analyze tim… ▽ More

    Submitted 10 March, 2025; v1 submitted 18 September, 2024; originally announced September 2024.

  17. arXiv:2409.07194  [pdf, other

    cs.CR cs.AI cs.GT

    Cyber Deception: State of the art, Trends and Open challenges

    Authors: Pedro Beltrán López, Manuel Gil Pérez, Pantaleone Nespoli

    Abstract: The growing interest in cybersecurity has significantly increased articles designing and implementing various Cyber Deception (CYDEC) mechanisms. This trend reflects the urgent need for new strategies to address cyber threats effectively. Since its emergence, CYDEC has established itself as an innovative defense against attackers, thanks to its proactive and reactive capabilities, finding applicat… ▽ More

    Submitted 11 September, 2024; originally announced September 2024.

    Comments: 38 pages

  18. arXiv:2409.05994  [pdf, other

    cs.CL cs.AI

    MessIRve: A Large-Scale Spanish Information Retrieval Dataset

    Authors: Francisco Valentini, Viviana Cotik, Damián Furman, Ivan Bercovich, Edgar Altszyler, Juan Manuel Pérez

    Abstract: Information retrieval (IR) is the task of finding relevant documents in response to a user query. Although Spanish is the second most spoken native language, current IR benchmarks lack Spanish data, hindering the development of information access tools for Spanish speakers. We introduce MessIRve, a large-scale Spanish IR dataset with around 730 thousand queries from Google's autocomplete API and r… ▽ More

    Submitted 9 September, 2024; originally announced September 2024.

  19. RAVE Checklist: Recommendations for Overcoming Challenges in Retrospective Safety Studies of Automated Driving Systems

    Authors: John M. Scanlon, Eric R. Teoh, David G. Kidd, Kristofer D. Kusano, Jonas Bärgman, Geoffrey Chi-Johnston, Luigi Di Lillo, Francesca Favaro, Carol Flannagan, Henrik Liers, Bonnie Lin, Magdalena Lindman, Shane McLaughlin, Miguel Perez, Trent Victor

    Abstract: The public, regulators, and domain experts alike seek to understand the effect of deployed SAE level 4 automated driving system (ADS) technologies on safety. The recent expansion of ADS technology deployments is paving the way for early stage safety impact evaluations, whereby the observational data from both an ADS and a representative benchmark fleet are compared to quantify safety performance.… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

  20. arXiv:2407.11345  [pdf, other

    cs.CL cs.SD eess.AS

    Beyond Binary: Multiclass Paraphasia Detection with Generative Pretrained Transformers and End-to-End Models

    Authors: Matthew Perez, Aneesha Sampath, Minxue Niu, Emily Mower Provost

    Abstract: Aphasia is a language disorder that can lead to speech errors known as paraphasias, which involve the misuse, substitution, or invention of words. Automatic paraphasia detection can help those with Aphasia by facilitating clinical assessment and treatment planning options. However, most automatic paraphasia detection works have focused solely on binary detection, which involves recognizing only th… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  21. arXiv:2407.07258  [pdf, other

    cs.CL cs.LG

    Identification of emotions on Twitter during the 2022 electoral process in Colombia

    Authors: Juan Jose Iguaran Fernandez, Juan Manuel Perez, German Rosati

    Abstract: The study of Twitter as a means for analyzing social phenomena has gained interest in recent years due to the availability of large amounts of data in a relatively spontaneous environment. Within opinion-mining tasks, emotion detection is specially relevant, as it allows for the identification of people's subjective responses to different social events in a more granular way than traditional senti… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  22. arXiv:2405.09318  [pdf, other

    cs.CR cs.LG

    Transfer Learning in Pre-Trained Large Language Models for Malware Detection Based on System Calls

    Authors: Pedro Miguel Sánchez Sánchez, Alberto Huertas Celdrán, Gérôme Bovet, Gregorio Martínez Pérez

    Abstract: In the current cybersecurity landscape, protecting military devices such as communication and battlefield management systems against sophisticated cyber attacks is crucial. Malware exploits vulnerabilities through stealth methods, often evading traditional detection mechanisms such as software signatures. The application of ML/DL in vulnerability detection has been extensively explored in the lite… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: Submitted to IEEE MILCOM 2024

  23. arXiv:2404.15324  [pdf, other

    eess.SP cs.AI eess.SY

    Advanced simulation-based predictive modelling for solar irradiance sensor farms

    Authors: José L. Risco-Martín, Ignacio-Iker Prado-Rujas, Javier Campoy, María S. Pérez, Katzalin Olcoz

    Abstract: As solar power continues to grow and replace traditional energy sources, the need for reliable forecasting models becomes increasingly important to ensure the stability and efficiency of the grid. However, the management of these models still needs to be improved, and new tools and technologies are required to handle the deployment and control of solar facilities. This work introduces a novel fram… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

    Journal ref: Journal of Simulation, pp. 1-18, 2024

  24. arXiv:2404.10730   

    cs.LG cs.AI

    Insight Gained from Migrating a Machine Learning Model to Intelligence Processing Units

    Authors: Hieu Le, Zhenhua He, Mai Le, Dhruva K. Chakravorty, Lisa M. Perez, Akhil Chilumuru, Yan Yao, Jiefu Chen

    Abstract: The discoveries in this paper show that Intelligence Processing Units (IPUs) offer a viable accelerator alternative to GPUs for machine learning (ML) applications within the fields of materials science and battery research. We investigate the process of migrating a model from GPU to IPU and explore several optimization techniques, including pipelining and gradient accumulation, aimed at enhancing… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: arXiv admin note: This version has been removed by arXiv administrators as the submitter did not have the right to agree to the license at the time of submission

  25. Does Differentially Private Synthetic Data Lead to Synthetic Discoveries?

    Authors: Ileana Montoya Perez, Parisa Movahedi, Valtteri Nieminen, Antti Airola, Tapio Pahikkala

    Abstract: Background: Synthetic data has been proposed as a solution for sharing anonymized versions of sensitive biomedical datasets. Ideally, synthetic data should preserve the structure and statistical properties of the original data, while protecting the privacy of the individual subjects. Differential privacy (DP) is currently considered the gold standard approach for balancing this trade-off. Object… ▽ More

    Submitted 23 August, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

    Journal ref: Methods Inf Med 2024; 63(01/02): 035-051

  26. arXiv:2403.07562  [pdf, other

    cs.SE cs.LG

    A Flexible Cell Classification for ML Projects in Jupyter Notebooks

    Authors: Miguel Perez, Selin Aydin, Horst Lichter

    Abstract: Jupyter Notebook is an interactive development environment commonly used for rapid experimentation of machine learning (ML) solutions. Describing the ML activities performed along code cells improves the readability and understanding of Notebooks. Manual annotation of code cells is time-consuming and error-prone. Therefore, tools have been developed that classify the cells of a notebook concerning… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: 9 pages, 3 figures

  27. arXiv:2402.09191  [pdf, other

    cs.CR cs.NI cs.PF eess.SY

    Cyber Deception Reactive: TCP Stealth Redirection to On-Demand Honeypots

    Authors: Pedro Beltran Lopez, Pantaleone Nespoli, Manuel Gil Perez

    Abstract: Cybersecurity is developing rapidly, and new methods of defence against attackers are appearing, such as Cyber Deception (CYDEC). CYDEC consists of deceiving the enemy who performs actions without realising that he/she is being deceived. This article proposes designing, implementing, and evaluating a deception mechanism based on the stealthy redirection of TCP communications to an on-demand honey… ▽ More

    Submitted 20 February, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  28. arXiv:2402.02384  [pdf

    eess.SP cs.AR cs.SD eess.AS

    Acoustic Local Positioning With Encoded Emission Beacons

    Authors: Jesus Urena, Alvaro Hernandez, Juan Jesus Garcia, Jose Manuel Villadangos, Maria del Carmen Perez, David Gualda, Fernando J. Alvarez, Teodoro Aguilera

    Abstract: Acoustic local positioning systems (ALPSs) are an interesting alternative for indoor positioning due to certain advantages over other approaches, including their relatively high accuracy, low cost, and room-level signal propagation. Centimeter-level or fine-grained indoor positioning can be an asset for robot navigation, guiding a person to, for instance, a particular piece in a museum or to a spe… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Journal ref: Proceedings of the IEEE, vol. 106, no. 6, pp. 1042-1062, Jun. 2018

  29. arXiv:2401.16434  [pdf

    eess.SY cs.LG eess.SP

    A novel ANROA based control approach for grid-tied multi-functional solar energy conversion system

    Authors: Dinanath Prasad, Narendra Kumar, Rakhi Sharma, Hasmat Malik, Fausto Pedro García Márquez, Jesús María Pinar Pérez

    Abstract: An adaptive control approach for a three-phase grid-interfaced solar photovoltaic system based on the new Neuro-Fuzzy Inference System with Rain Optimization Algorithm (ANROA) methodology is proposed and discussed in this manuscript. This method incorporates an Adaptive Neuro-fuzzy Inference System (ANFIS) with a Rain Optimization Algorithm (ROA). The ANFIS controller has excellent maximum trackin… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: The paper was published in Energy Reports journal (ELSEVIER). Cite as: Prasad, D., Kumar, N., Sharma, R., Malik, H., Márquez, F. P. G., & Pinar-Pérez, J. M. (2023). A novel ANROA based control approach for grid-tied multi-functional solar energy conversion system. Energy Reports, 9, 2044-2057

    Journal ref: Energy Reports (2023) Elsevier

  30. arXiv:2401.13320  [pdf, other

    cs.DC cs.IR

    A Big Data Architecture for Early Identification and Categorization of Dark Web Sites

    Authors: Javier Pastor-Galindo, Hông-Ân Sandlin, Félix Gómez Mármol, Gérôme Bovet, Gregorio Martínez Pérez

    Abstract: The dark web has become notorious for its association with illicit activities and there is a growing need for systems to automate the monitoring of this space. This paper proposes an end-to-end scalable architecture for the early identification of new Tor sites and the daily analysis of their content. The solution is built using an Open Source Big Data stack for data serving with Kubernetes, Kafka… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  31. arXiv:2401.08251  [pdf

    cs.GT econ.GN eess.SY

    A techno-economic model for avoiding conflicts of interest between owners of offshore wind farms and maintenance suppliers

    Authors: Alberto Pliego Marugán, Fausto Pedro García Márquez, Jesús María Pinar Pérez

    Abstract: Currently, wind energy is one of the most important sources of renewable energy. Offshore locations for wind turbines are increasingly exploited because of their numerous advantages. However, offshore wind farms require high investment in maintenance service. Due to its complexity and special requirements, maintenance service is usually outsourced by wind farm owners. In this paper, we propose a n… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: Published in Renewable and Sustainable Energy Reviews (ELSEVIER) 10 July 2022. DOI: https://doi.org/10.1016/j.rser.2022.112753 Cite as: Marugán, A. P., Márquez, F. P. G., & Pérez, J. M. P. (2022). A techno-economic model for avoiding conflicts of interest between owners of offshore wind farms and maintenance suppliers. Renewable and Sustainable Energy Reviews, 168, 112753

  32. arXiv:2312.14717  [pdf

    physics.data-an cs.RO

    Kinematic Characterization of Micro-Mobility Vehicles During Evasive Maneuvers

    Authors: Paolo Terranova, Shu-Yuan Liu, Sparsh Jain, Johan Engstrom, Miguel Perez

    Abstract: There is an increasing need to comprehensively characterize the kinematic performances of different Micromobility Vehicles (MMVs). This study aims to: 1) characterize the kinematic behaviors of different MMVs during emergency maneuvers; 2) explore the influence of different MMV power sources on the device performances; 3) investigate if piecewise linear models are suitable for modeling MMV traject… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

    Comments: 21 pages, 8 figures

  33. arXiv:2312.10518  [pdf, other

    cs.SD cs.AI eess.AS

    Seq2seq for Automatic Paraphasia Detection in Aphasic Speech

    Authors: Matthew Perez, Duc Le, Amrit Romana, Elise Jones, Keli Licata, Emily Mower Provost

    Abstract: Paraphasias are speech errors that are often characteristic of aphasia and they represent an important signal in assessing disease severity and subtype. Traditionally, clinicians manually identify paraphasias by transcribing and analyzing speech-language samples, which can be a time-consuming and burdensome process. Identifying paraphasias automatically can greatly help clinicians with the transcr… ▽ More

    Submitted 16 December, 2023; originally announced December 2023.

  34. arXiv:2312.09938  [pdf, other

    cs.LG cs.AI cs.MA

    Assume-Guarantee Reinforcement Learning

    Authors: Milad Kazemi, Mateo Perez, Fabio Somenzi, Sadegh Soudjani, Ashutosh Trivedi, Alvaro Velasquez

    Abstract: We present a modular approach to \emph{reinforcement learning} (RL) in environments consisting of simpler components evolving in parallel. A monolithic view of such modular environments may be prohibitively large to learn, or may require unrealizable communication between the components in the form of a centralized controller. Our proposed approach is based on the assume-guarantee paradigm where t… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: This is the extended version of the paper accepted in the SRRAI Special Track at the Conference on Artificial Intelligence (AAAI-24)

  35. arXiv:2312.08602  [pdf, other

    cs.LO cs.LG

    Omega-Regular Decision Processes

    Authors: Ernst Moritz Hahn, Mateo Perez, Sven Schewe, Fabio Somenzi, Ashutosh Trivedi, Dominik Wojtczak

    Abstract: Regular decision processes (RDPs) are a subclass of non-Markovian decision processes where the transition and reward functions are guarded by some regular property of the past (a lookback). While RDPs enable intuitive and succinct representation of non-Markovian decision processes, their expressive power coincides with finite-state Markov decision processes (MDPs). We introduce omega-regular decis… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

  36. arXiv:2311.05270  [pdf, other

    cs.HC

    Evaluation of Data Processing and Machine Learning Techniques in P300-based Authentication using Brain-Computer Interfaces

    Authors: Eduardo López Bernal, Sergio López Bernal, Gregorio Martínez Pérez, Alberto Huertas Celdrán

    Abstract: Brain-Computer Interfaces (BCIs) are used in various application scenarios allowing direct communication between the brain and computers. Specifically, electroencephalography (EEG) is one of the most common techniques for obtaining evoked potentials resulting from external stimuli, as the P300 potential is elicited from known images. The combination of Machine Learning (ML) and P300 potentials is… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

  37. arXiv:2310.12248  [pdf, other

    cs.LG cs.LO

    A PAC Learning Algorithm for LTL and Omega-regular Objectives in MDPs

    Authors: Mateo Perez, Fabio Somenzi, Ashutosh Trivedi

    Abstract: Linear temporal logic (LTL) and omega-regular objectives -- a superset of LTL -- have seen recent use as a way to express non-Markovian objectives in reinforcement learning. We introduce a model-based probably approximately correct (PAC) learning algorithm for omega-regular objectives in Markov decision processes (MDPs). As part of the development of our algorithm, we introduce the epsilon-recurre… ▽ More

    Submitted 20 February, 2024; v1 submitted 18 October, 2023; originally announced October 2023.

  38. arXiv:2308.07469  [pdf, other

    cs.LG cs.AI cs.FL

    Omega-Regular Reward Machines

    Authors: Ernst Moritz Hahn, Mateo Perez, Sven Schewe, Fabio Somenzi, Ashutosh Trivedi, Dominik Wojtczak

    Abstract: Reinforcement learning (RL) is a powerful approach for training agents to perform tasks, but designing an appropriate reward mechanism is critical to its success. However, in many cases, the complexity of the learning objectives goes beyond the capabilities of the Markovian assumption, necessitating a more sophisticated reward mechanism. Reward machines and omega-regular languages are two formalis… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

    Comments: To appear in ECAI-2023

  39. arXiv:2308.05978  [pdf, other

    cs.CR cs.AI

    CyberForce: A Federated Reinforcement Learning Framework for Malware Mitigation

    Authors: Chao Feng, Alberto Huertas Celdran, Pedro Miguel Sanchez Sanchez, Jan Kreischer, Jan von der Assen, Gerome Bovet, Gregorio Martinez Perez, Burkhard Stiller

    Abstract: Recent research has shown that the integration of Reinforcement Learning (RL) with Moving Target Defense (MTD) can enhance cybersecurity in Internet-of-Things (IoT) devices. Nevertheless, the practicality of existing work is hindered by data privacy concerns associated with centralized data processing in RL, and the unsatisfactory time needed to learn right MTD techniques that are effective agains… ▽ More

    Submitted 30 September, 2024; v1 submitted 11 August, 2023; originally announced August 2023.

  40. arXiv:2307.11730  [pdf, other

    cs.CR cs.AI cs.DC cs.LG cs.NI

    Mitigating Communications Threats in Decentralized Federated Learning through Moving Target Defense

    Authors: Enrique Tomás Martínez Beltrán, Pedro Miguel Sánchez Sánchez, Sergio López Bernal, Gérôme Bovet, Manuel Gil Pérez, Gregorio Martínez Pérez, Alberto Huertas Celdrán

    Abstract: The rise of Decentralized Federated Learning (DFL) has enabled the training of machine learning models across federated participants, fostering decentralized model aggregation and reducing dependence on a server. However, this approach introduces unique communication security challenges that have yet to be thoroughly addressed in the literature. These challenges primarily originate from the decent… ▽ More

    Submitted 9 December, 2023; v1 submitted 21 July, 2023; originally announced July 2023.

  41. arXiv:2306.15562  [pdf, other

    cs.DC

    Challenges and Opportunities for RISC-V Architectures towards Genomics-based Workloads

    Authors: Gonzalo Gomez-Sanchez, Aaron Call, Xavier Teruel, Lorena Alonso, Ignasi Moran, Miguel Angel Perez, David Torrents, Josep Ll. Berral

    Abstract: The use of large-scale supercomputing architectures is a hard requirement for scientific computing Big-Data applications. An example is genomics analytics, where millions of data transformations and tests per patient need to be done to find relevant clinical indicators. Therefore, to ensure open and broad access to high-performance technologies, governments, and academia are pushing toward the int… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

    Journal ref: Presented at the ISC High-Performance Computing 2023

  42. arXiv:2306.15559  [pdf, other

    cs.CR cs.AI cs.LG

    RansomAI: AI-powered Ransomware for Stealthy Encryption

    Authors: Jan von der Assen, Alberto Huertas Celdrán, Janik Luechinger, Pedro Miguel Sánchez Sánchez, Gérôme Bovet, Gregorio Martínez Pérez, Burkhard Stiller

    Abstract: Cybersecurity solutions have shown promising performance when detecting ransomware samples that use fixed algorithms and encryption rates. However, due to the current explosion of Artificial Intelligence (AI), sooner than later, ransomware (and malware in general) will incorporate AI techniques to intelligently and dynamically adapt its encryption behavior to be undetected. It might result in inef… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

  43. arXiv:2306.09750  [pdf, other

    cs.LG cs.AI cs.DC cs.NI

    Fedstellar: A Platform for Decentralized Federated Learning

    Authors: Enrique Tomás Martínez Beltrán, Ángel Luis Perales Gómez, Chao Feng, Pedro Miguel Sánchez Sánchez, Sergio López Bernal, Gérôme Bovet, Manuel Gil Pérez, Gregorio Martínez Pérez, Alberto Huertas Celdrán

    Abstract: In 2016, Google proposed Federated Learning (FL) as a novel paradigm to train Machine Learning (ML) models across the participants of a federation while preserving data privacy. Since its birth, Centralized FL (CFL) has been the most used approach, where a central entity aggregates participants' models to create a global one. However, CFL presents limitations such as communication bottlenecks, sin… ▽ More

    Submitted 8 April, 2024; v1 submitted 16 June, 2023; originally announced June 2023.

  44. arXiv:2306.08495  [pdf, other

    cs.CR

    Single-board Device Individual Authentication based on Hardware Performance and Autoencoder Transformer Models

    Authors: Pedro Miguel Sánchez Sánchez, Alberto Huertas Celdrán, Gérôme Bovet, Gregorio Martínez Pérez

    Abstract: The proliferation of the Internet of Things (IoT) has led to the emergence of crowdsensing applications, where a multitude of interconnected devices collaboratively collect and analyze data. Ensuring the authenticity and integrity of the data collected by these devices is crucial for reliable decision-making and maintaining trust in the system. Traditional authentication methods are often vulnerab… ▽ More

    Submitted 11 November, 2023; v1 submitted 14 June, 2023; originally announced June 2023.

  45. arXiv:2305.17115  [pdf, other

    cs.LO cs.LG

    Policy Synthesis and Reinforcement Learning for Discounted LTL

    Authors: Rajeev Alur, Osbert Bastani, Kishor Jothimurugan, Mateo Perez, Fabio Somenzi, Ashutosh Trivedi

    Abstract: The difficulty of manually specifying reward functions has led to an interest in using linear temporal logic (LTL) to express objectives for reinforcement learning (RL). However, LTL has the downside that it is sensitive to small perturbations in the transition probabilities, which prevents probably approximately correct (PAC) learning without additional assumptions. Time discounting provides a wa… ▽ More

    Submitted 29 May, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

  46. arXiv:2303.03755  [pdf, other

    cs.CV cs.AI cs.LG

    DLT: Conditioned layout generation with Joint Discrete-Continuous Diffusion Layout Transformer

    Authors: Elad Levi, Eli Brosh, Mykola Mykhailych, Meir Perez

    Abstract: Generating visual layouts is an essential ingredient of graphic design. The ability to condition layout generation on a partial subset of component attributes is critical to real-world applications that involve user interaction. Recently, diffusion models have demonstrated high-quality generative performances in various domains. However, it is unclear how to apply diffusion models to the natural r… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

  47. arXiv:2302.13784  [pdf, other

    cs.CL cs.LG

    Solution for the EPO CodeFest on Green Plastics: Hierarchical multi-label classification of patents relating to green plastics using deep learning

    Authors: Tingting Qiao, Gonzalo Moro Perez

    Abstract: This work aims at hierarchical multi-label patents classification for patents disclosing technologies related to green plastics. This is an emerging field for which there is currently no classification scheme, and hence, no labeled data is available, making this task particularly challenging. We first propose a classification scheme for this technology and a way to learn a machine learning model t… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

  48. STB-VMM: Swin Transformer Based Video Motion Magnification

    Authors: Ricard Lado-Roigé, Marco A. Pérez

    Abstract: The goal of video motion magnification techniques is to magnify small motions in a video to reveal previously invisible or unseen movement. Its uses extend from bio-medical applications and deepfake detection to structural modal analysis and predictive maintenance. However, discerning small motion from noise is a complex task, especially when attempting to magnify very subtle, often sub-pixel move… ▽ More

    Submitted 27 March, 2023; v1 submitted 20 February, 2023; originally announced February 2023.

    Comments: Code available at: https://github.com/RLado/STB-VMM

    Journal ref: Knowl.-Based Syst. 269 (2023) 110493

  49. arXiv:2302.09844  [pdf, other

    cs.CR cs.AI

    FederatedTrust: A Solution for Trustworthy Federated Learning

    Authors: Pedro Miguel Sánchez Sánchez, Alberto Huertas Celdrán, Ning Xie, Gérôme Bovet, Gregorio Martínez Pérez, Burkhard Stiller

    Abstract: The rapid expansion of the Internet of Things (IoT) and Edge Computing has presented challenges for centralized Machine and Deep Learning (ML/DL) methods due to the presence of distributed data silos that hold sensitive information. To address concerns regarding data privacy, collaborative and privacy-preserving ML/DL techniques like Federated Learning (FL) have emerged. However, ensuring data pri… ▽ More

    Submitted 6 July, 2023; v1 submitted 20 February, 2023; originally announced February 2023.

  50. arXiv:2301.06187  [pdf, ps, other

    cs.CV cs.AI cs.LG

    CNN-Based Action Recognition and Pose Estimation for Classifying Animal Behavior from Videos: A Survey

    Authors: Michael Perez, Corey Toler-Franklin

    Abstract: Classifying the behavior of humans or animals from videos is important in biomedical fields for understanding brain function and response to stimuli. Action recognition, classifying activities performed by one or more subjects in a trimmed video, forms the basis of many of these techniques. Deep learning models for human action recognition have progressed significantly over the last decade. Recent… ▽ More

    Submitted 15 January, 2023; originally announced January 2023.

    Comments: 29 pages, 20 figures

    ACM Class: I.2; I.4; I.5