Skip to main content

Showing 1–50 of 168 results for author: Pérez, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.00150  [pdf, ps, other

    cs.SE cs.AI

    Supporting architecture evaluation for ATAM scenarios with LLMs

    Authors: Rafael Capilla, J. Andrés Díaz-Pace, Yamid Ramírez, Jennifer Pérez, Vanessa Rodríguez-Horcajo

    Abstract: Architecture evaluation methods have long been used to evaluate software designs. Several evaluation methods have been proposed and used to analyze tradeoffs between different quality attributes. Having competing qualities leads to conflicts for selecting which quality-attribute scenarios are the most suitable ones that an architecture should tackle and for prioritizing the scenarios required by t… ▽ More

    Submitted 30 May, 2025; originally announced June 2025.

  2. arXiv:2505.06267  [pdf, other

    cs.SE cs.AI cs.LG

    AKD : Adversarial Knowledge Distillation For Large Language Models Alignment on Coding tasks

    Authors: Ilyas Oulkadda, Julien Perez

    Abstract: The widespread adoption of Large Language Models (LLMs) for code generation, exemplified by GitHub Copilot\footnote{A coding extension powered by a Code-LLM to assist in code completion tasks} surpassing a million users, highlights the transformative potential of these tools in improving developer productivity. However, this rapid growth also underscores critical concerns regarding the quality, sa… ▽ More

    Submitted 5 May, 2025; originally announced May 2025.

  3. arXiv:2505.02148  [pdf, other

    cs.CV

    Spotting the Unexpected (STU): A 3D LiDAR Dataset for Anomaly Segmentation in Autonomous Driving

    Authors: Alexey Nekrasov, Malcolm Burdorf, Stewart Worrall, Bastian Leibe, Julie Stephany Berrio Perez

    Abstract: To operate safely, autonomous vehicles (AVs) need to detect and handle unexpected objects or anomalies on the road. While significant research exists for anomaly detection and segmentation in 2D, research progress in 3D is underexplored. Existing datasets lack high-quality multimodal data that are typically found in AVs. This paper presents a novel dataset for anomaly segmentation in driving scena… ▽ More

    Submitted 4 May, 2025; originally announced May 2025.

    Comments: Accepted for publication at CVPR 2025. Project page: https://www.vision.rwth-aachen.de/stu-dataset

  4. arXiv:2504.16538  [pdf, other

    cs.CV cs.LG

    Streetscape Analysis with Generative AI (SAGAI): Vision-Language Assessment and Mapping of Urban Scenes

    Authors: Joan Perez, Giovanni Fusco

    Abstract: Streetscapes are an essential component of urban space. Their assessment is presently either limited to morphometric properties of their mass skeleton or requires labor-intensive qualitative evaluations of visually perceived qualities. This paper introduces SAGAI: Streetscape Analysis with Generative Artificial Intelligence, a modular workflow for scoring street-level urban scenes using open-acces… ▽ More

    Submitted 23 April, 2025; originally announced April 2025.

    Comments: 25 pages, 6 figures in main paper, 6 figures in appendices

    ACM Class: I.2; I.4; J.4

  5. arXiv:2504.15845  [pdf, ps, other

    cs.PL

    Contrasting Deadlock-Free Session Processes (Extended Version)

    Authors: Juan C. Jaramillo, Jorge A. Pérez

    Abstract: Deadlock freedom is a crucial property for message-passing programs. Over the years, several different type systems for concurrent processes that ensure deadlock freedom have been proposed; this diversity raises the question of how they compare. We address this question, considering two type systems not covered in prior work: Kokke etal's HCP, a type system based on a linear logic with hypersequen… ▽ More

    Submitted 25 April, 2025; v1 submitted 22 April, 2025; originally announced April 2025.

    Comments: Full version of an ECOOP 25 paper

  6. arXiv:2504.06774  [pdf, other

    physics.flu-dyn cs.LG

    Hybrid machine learning models based on physical patterns to accelerate CFD simulations: a short guide on autoregressive models

    Authors: Arindam Sengupta, Rodrigo Abadía-Heredia, Ashton Hetherington, José Miguel Pérez, Soledad Le Clainche

    Abstract: Accurate modeling of the complex dynamics of fluid flows is a fundamental challenge in computational physics and engineering. This study presents an innovative integration of High-Order Singular Value Decomposition (HOSVD) with Long Short-Term Memory (LSTM) architectures to address the complexities of reduced-order modeling (ROM) in fluid dynamics. HOSVD improves the dimensionality reduction proce… ▽ More

    Submitted 9 April, 2025; originally announced April 2025.

  7. arXiv:2504.03814  [pdf, other

    cs.LG cs.AI cs.CL

    Recursive Training Loops in LLMs: How training data properties modulate distribution shift in generated data?

    Authors: Grgur Kovač, Jérémy Perez, Rémy Portelas, Peter Ford Dominey, Pierre-Yves Oudeyer

    Abstract: Large language models (LLMs) are increasingly contributing to the creation of content on the Internet. This creates a feedback loop as subsequent generations of models will be trained on this generated, synthetic data. This phenomenon is receiving increasing interest, in particular because previous studies have shown that it may lead to distribution shift - models misrepresent and forget the true… ▽ More

    Submitted 8 April, 2025; v1 submitted 4 April, 2025; originally announced April 2025.

    MSC Class: 68T50 ACM Class: I.2.7

  8. arXiv:2502.14156  [pdf, other

    cs.CV

    Mixed Signals: A Diverse Point Cloud Dataset for Heterogeneous LiDAR V2X Collaboration

    Authors: Katie Z Luo, Minh-Quan Dao, Zhenzhen Liu, Mark Campbell, Wei-Lun Chao, Kilian Q. Weinberger, Ezio Malis, Vincent Fremont, Bharath Hariharan, Mao Shan, Stewart Worrall, Julie Stephany Berrio Perez

    Abstract: Vehicle-to-everything (V2X) collaborative perception has emerged as a promising solution to address the limitations of single-vehicle perception systems. However, existing V2X datasets are limited in scope, diversity, and quality. To address these gaps, we present Mixed Signals, a comprehensive V2X dataset featuring 45.1k point clouds and 240.6k bounding boxes collected from three connected autono… ▽ More

    Submitted 19 February, 2025; originally announced February 2025.

  9. arXiv:2502.10567  [pdf, other

    cs.LG cs.AI

    Efficient Hierarchical Contrastive Self-supervising Learning for Time Series Classification via Importance-aware Resolution Selection

    Authors: Kevin Garcia, Juan Manuel Perez, Yifeng Gao

    Abstract: Recently, there has been a significant advancement in designing Self-Supervised Learning (SSL) frameworks for time series data to reduce the dependency on data labels. Among these works, hierarchical contrastive learning-based SSL frameworks, which learn representations by contrasting data embeddings at multiple resolutions, have gained considerable attention. Due to their ability to gather more i… ▽ More

    Submitted 14 February, 2025; originally announced February 2025.

    Comments: Appears in IEEEBigData-2024

    ACM Class: I.2

  10. A Gentle Overview of Asynchronous Session-based Concurrency: Deadlock Freedom by Typing

    Authors: Bas van den Heuvel, Jorge A. Pérez

    Abstract: While formal models of concurrency tend to focus on synchronous communication, asynchronous communication is relevant in practice. In this paper, we will discuss asynchronous communication in the context of session-based concurrency, the model of computation in which session types specify the structure of the two-party protocols implemented by the channels of a communicating process. We overview r… ▽ More

    Submitted 11 December, 2024; originally announced December 2024.

    Comments: In Proceedings ICE 2024, arXiv:2412.07570

    ACM Class: D.1.3; D.2.4; D.3.1

    Journal ref: EPTCS 414, 2024, pp. 1-20

  11. Robots in the Wild: Contextually-Adaptive Human-Robot Interactions in Urban Public Environments

    Authors: Xinyan Yu, Yiyuan Wang, Tram Thi Minh Tran, Yi Zhao, Julie Stephany Berrio Perez, Marius Hoggenmuller, Justine Humphry, Lian Loke, Lynn Masuda, Callum Parker, Martin Tomitsch, Stewart Worrall

    Abstract: The increasing transition of human-robot interaction (HRI) context from controlled settings to dynamic, real-world public environments calls for enhanced adaptability in robotic systems. This can go beyond algorithmic navigation or traditional HRI strategies in structured settings, requiring the ability to navigate complex public urban systems containing multifaceted dynamics and various socio-tec… ▽ More

    Submitted 9 December, 2024; v1 submitted 5 December, 2024; originally announced December 2024.

  12. arXiv:2412.02384  [pdf, other

    cs.SE cs.LO

    Theory building for empirical software engineering in qualitative research: Operationalization

    Authors: Jorge Pérez, Jessica Díaz, Ángel González-Prieto, Sergio Gil-Borrás

    Abstract: Context: This work is part of a research project whose ultimate goal is to systematize theory building in qualitative research in the field of software engineering. The proposed methodology involves four phases: conceptualization, operationalization, testing, and application. In previous work, we performed the conceptualization of a theory that investigates the structure of IT departments and team… ▽ More

    Submitted 3 December, 2024; originally announced December 2024.

    Comments: 22 pages, 7 figures

    ACM Class: D.2.0

  13. arXiv:2411.18677  [pdf, other

    cs.CV cs.AI cs.LG

    MatchDiffusion: Training-free Generation of Match-cuts

    Authors: Alejandro Pardo, Fabio Pizzati, Tong Zhang, Alexander Pondaven, Philip Torr, Juan Camilo Perez, Bernard Ghanem

    Abstract: Match-cuts are powerful cinematic tools that create seamless transitions between scenes, delivering strong visual and metaphorical connections. However, crafting match-cuts is a challenging, resource-intensive process requiring deliberate artistic planning. In MatchDiffusion, we present the first training-free method for match-cut generation using text-to-video diffusion models. MatchDiffusion lev… ▽ More

    Submitted 27 November, 2024; originally announced November 2024.

    Comments: https://matchdiffusion.github.io

  14. Typed Non-determinism in Concurrent Calculi: The Eager Way

    Authors: Bas van den Heuvel, Daniele Nantes-Sobrinho, Joseph W. N. Paulus, Jorge A. Pérez

    Abstract: We consider the problem of designing typed concurrent calculi with non-deterministic choice in which types leverage linearity for controlling resources, thereby ensuring strong correctness properties for processes. This problem is constrained by the delicate tension between non-determinism and linearity. Prior work developed a session-typed π-calculus with standard non-deterministic choice; well-t… ▽ More

    Submitted 7 December, 2024; v1 submitted 12 November, 2024; originally announced November 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2408.07915

    Journal ref: Electronic Notes in Theoretical Informatics and Computer Science, Volume 4 - Proceedings of MFPS XL (December 11, 2024) entics:14735

  15. arXiv:2410.20280  [pdf, other

    cs.CV cs.AI

    MarDini: Masked Autoregressive Diffusion for Video Generation at Scale

    Authors: Haozhe Liu, Shikun Liu, Zijian Zhou, Mengmeng Xu, Yanping Xie, Xiao Han, Juan C. Pérez, Ding Liu, Kumara Kahatapitiya, Menglin Jia, Jui-Chieh Wu, Sen He, Tao Xiang, Jürgen Schmidhuber, Juan-Manuel Pérez-Rúa

    Abstract: We introduce MarDini, a new family of video diffusion models that integrate the advantages of masked auto-regression (MAR) into a unified diffusion model (DM) framework. Here, MAR handles temporal planning, while DM focuses on spatial generation in an asymmetric network design: i) a MAR-based planning model containing most of the parameters generates planning signals for each masked frame using lo… ▽ More

    Submitted 26 October, 2024; originally announced October 2024.

    Comments: Project Page: https://mardini-vidgen.github.io

  16. arXiv:2410.15690  [pdf, other

    cs.CL

    Efficient Terminology Integration for LLM-based Translation in Specialized Domains

    Authors: Sejoon Kim, Mingi Sung, Jeonghwan Lee, Hyunkuk Lim, Jorge Froilan Gimenez Perez

    Abstract: Traditional machine translation methods typically involve training models directly on large parallel corpora, with limited emphasis on specialized terminology. However, In specialized fields such as patent, finance, or biomedical domains, terminology is crucial for translation, with many terms that needs to be translated following agreed-upon conventions. In this paper we introduce a methodology t… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

    Comments: Accepted to WMT 2024

  17. arXiv:2410.12174  [pdf, other

    cs.CL

    Exploring Large Language Models for Hate Speech Detection in Rioplatense Spanish

    Authors: Juan Manuel Pérez, Paula Miguel, Viviana Cotik

    Abstract: Hate speech detection deals with many language variants, slang, slurs, expression modalities, and cultural nuances. This outlines the importance of working with specific corpora, when addressing hate speech within the scope of Natural Language Processing, recently revolutionized by the irruption of Large Language Models. This work presents a brief analysis of the performance of large language mode… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

  18. arXiv:2410.04855  [pdf, other

    cs.RO cs.AI cs.LG

    Unsupervised Skill Discovery for Robotic Manipulation through Automatic Task Generation

    Authors: Paul Jansonnie, Bingbing Wu, Julien Perez, Jan Peters

    Abstract: Learning skills that interact with objects is of major importance for robotic manipulation. These skills can indeed serve as an efficient prior for solving various manipulation tasks. We propose a novel Skill Learning approach that discovers composable behaviors by solving a large and diverse number of autonomously generated tasks. Our method learns skills allowing the robot to consistently and ro… ▽ More

    Submitted 7 October, 2024; originally announced October 2024.

    Comments: Accepted at the 2024 IEEE-RAS International Conference on Humanoid Robots

  19. arXiv:2409.05994  [pdf, other

    cs.CL cs.AI

    MessIRve: A Large-Scale Spanish Information Retrieval Dataset

    Authors: Francisco Valentini, Viviana Cotik, Damián Furman, Ivan Bercovich, Edgar Altszyler, Juan Manuel Pérez

    Abstract: Information retrieval (IR) is the task of finding relevant documents in response to a user query. Although Spanish is the second most spoken native language, current IR benchmarks lack Spanish data, hindering the development of information access tools for Spanish speakers. We introduce MessIRve, a large-scale Spanish IR dataset with around 730 thousand queries from Google's autocomplete API and r… ▽ More

    Submitted 9 September, 2024; originally announced September 2024.

  20. arXiv:2408.13135  [pdf, other

    cs.CV cs.AI

    Deep Learning at the Intersection: Certified Robustness as a Tool for 3D Vision

    Authors: Gabriel Pérez S, Juan C. Pérez, Motasem Alfarra, Jesús Zarzar, Sara Rojas, Bernard Ghanem, Pablo Arbeláez

    Abstract: This paper presents preliminary work on a novel connection between certified robustness in machine learning and the modeling of 3D objects. We highlight an intriguing link between the Maximal Certified Radius (MCR) of a classifier representing a space's occupancy and the space's Signed Distance Function (SDF). Leveraging this relationship, we propose to use the certification method of randomized s… ▽ More

    Submitted 23 August, 2024; originally announced August 2024.

    Comments: This paper is an accepted extended abstract to the LatinX workshop at ICCV 2023. This was uploaded a year late

  21. arXiv:2408.09223  [pdf, other

    cs.NE

    A theoretical framework for reservoir computing on networks of organic electrochemical transistors

    Authors: Nicholas W. Landry, Beckett R. Hyde, Jake C. Perez, Sean E. Shaheen, Juan G. Restrepo

    Abstract: Efficient and accurate prediction of physical systems is important even when the rules of those systems cannot be easily learned. Reservoir computing, a type of recurrent neural network with fixed nonlinear units, is one such prediction method and is valued for its ease of training. Organic electrochemical transistors (OECTs) are physical devices with nonlinear transient properties that can be use… ▽ More

    Submitted 17 August, 2024; originally announced August 2024.

    Comments: 10 pages, 8 figures

  22. Learning and Motivational Impact of Game-Based Learning: Comparing Face-to-Face and Online Formats on Computer Science Education

    Authors: Daniel López-Fernández, Aldo Gordillo, Jennifer Pérez, Edmundo Tovar

    Abstract: Contribution: This article analyzes the learning and motivational impact of teacher-authored educational video games on computer science education and compares its effectiveness in both face-to-face and online (remote) formats. This work presents comparative data and findings obtained from 217 students who played the game in a face-to-face format (control group) and 104 students who played the gam… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: 10 pages, 3 figures. Accepted version of a journal article published in IEEE Transactions on Education

    Journal ref: IEEE Transactions on Education, Volume 66, Issue 4, 2023

  23. arXiv:2407.07258  [pdf, other

    cs.CL cs.LG

    Identification of emotions on Twitter during the 2022 electoral process in Colombia

    Authors: Juan Jose Iguaran Fernandez, Juan Manuel Perez, German Rosati

    Abstract: The study of Twitter as a means for analyzing social phenomena has gained interest in recent years due to the availability of large amounts of data in a relatively spontaneous environment. Within opinion-mining tasks, emotion detection is specially relevant, as it allows for the identification of people's subjective responses to different social events in a more granular way than traditional senti… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  24. arXiv:2407.06391  [pdf, other

    cs.LO

    Around Classical and Intuitionistic Linear Processes

    Authors: Juan C. Jaramillo, Dan Frumin, Jorge A. Pérez

    Abstract: Curry-Howard correspondences between Linear Logic (LL) and session types provide a firm foundation for concurrent processes. As the correspondences hold for intuitionistic and classic versions of LL (ILL and CLL), we obtain two different families of type systems for concurrency. An open question remains: how do these two families exactly relate to each other? Based upon a translation from CLL to I… ▽ More

    Submitted 22 July, 2024; v1 submitted 8 July, 2024; originally announced July 2024.

    Comments: Full version, 19 pages + appendices

  25. arXiv:2407.04503  [pdf, ps, other

    physics.soc-ph cs.AI cs.MA

    When LLMs Play the Telephone Game: Cultural Attractors as Conceptual Tools to Evaluate LLMs in Multi-turn Settings

    Authors: Jérémy Perez, Grgur Kovač, Corentin Léger, Cédric Colas, Gaia Molinaro, Maxime Derex, Pierre-Yves Oudeyer, Clément Moulin-Frier

    Abstract: As large language models (LLMs) start interacting with each other and generating an increasing amount of text online, it becomes crucial to better understand how information is transformed as it passes from one LLM to the next. While significant research has examined individual LLM behaviors, existing studies have largely overlooked the collective behaviors and information distortions arising from… ▽ More

    Submitted 2 June, 2025; v1 submitted 5 July, 2024; originally announced July 2024.

    Comments: Code available at https://github.com/jeremyperez2/TelephoneGameLLM. Companion website with a Data Explorer tool at https://sites.google.com/view/telephone-game-llm

    MSC Class: 68T50 ACM Class: I.2.7

  26. arXiv:2406.06474  [pdf, other

    cs.AI cs.CL

    Towards a Personal Health Large Language Model

    Authors: Justin Cosentino, Anastasiya Belyaeva, Xin Liu, Nicholas A. Furlotte, Zhun Yang, Chace Lee, Erik Schenck, Yojan Patel, Jian Cui, Logan Douglas Schneider, Robby Bryant, Ryan G. Gomes, Allen Jiang, Roy Lee, Yun Liu, Javier Perez, Jameson K. Rogers, Cathy Speed, Shyam Tailor, Megan Walker, Jeffrey Yu, Tim Althoff, Conor Heneghan, John Hernandez, Mark Malhotra , et al. (9 additional authors not shown)

    Abstract: In health, most large language model (LLM) research has focused on clinical tasks. However, mobile and wearable devices, which are rarely integrated into such tasks, provide rich, longitudinal data for personal health monitoring. Here we present Personal Health Large Language Model (PH-LLM), fine-tuned from Gemini for understanding and reasoning over numerical time-series personal health data. We… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 72 pages

  27. arXiv:2406.06464  [pdf, other

    cs.AI cs.CL

    Transforming Wearable Data into Health Insights using Large Language Model Agents

    Authors: Mike A. Merrill, Akshay Paruchuri, Naghmeh Rezaei, Geza Kovacs, Javier Perez, Yun Liu, Erik Schenck, Nova Hammerquist, Jake Sunshine, Shyam Tailor, Kumar Ayush, Hao-Wei Su, Qian He, Cory Y. McLean, Mark Malhotra, Shwetak Patel, Jiening Zhan, Tim Althoff, Daniel McDuff, Xin Liu

    Abstract: Despite the proliferation of wearable health trackers and the importance of sleep and exercise to health, deriving actionable personalized insights from wearable data remains a challenge because doing so requires non-trivial open-ended analysis of these data. The recent rise of large language model (LLM) agents, which can use tools to reason about and interact with the world, presents a promising… ▽ More

    Submitted 11 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

    Comments: 38 pages

  28. arXiv:2405.17146  [pdf, other

    cs.CV

    Compressed-Language Models for Understanding Compressed File Formats: a JPEG Exploration

    Authors: Juan C. Pérez, Alejandro Pardo, Mattia Soldan, Hani Itani, Juan Leon-Alcazar, Bernard Ghanem

    Abstract: This study investigates whether Compressed-Language Models (CLMs), i.e. language models operating on raw byte streams from Compressed File Formats~(CFFs), can understand files compressed by CFFs. We focus on the JPEG format as a representative CFF, given its commonality and its representativeness of key concepts in compression, such as entropy coding and run-length encoding. We test if CLMs unders… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  29. arXiv:2403.08882  [pdf, other

    cs.MA cs.AI q-bio.PE

    Cultural evolution in populations of Large Language Models

    Authors: Jérémy Perez, Corentin Léger, Marcela Ovando-Tellez, Chris Foulon, Joan Dussauld, Pierre-Yves Oudeyer, Clément Moulin-Frier

    Abstract: Research in cultural evolution aims at providing causal explanations for the change of culture over time. Over the past decades, this field has generated an important body of knowledge, using experimental, historical, and computational methods. While computational models have been very successful at generating testable hypotheses about the effects of several factors, such as population structure o… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: 17 pages, 20 figures. Open-source code available at https://github.com/jeremyperez2/LLM-Culture

    MSC Class: 68T50 ACM Class: I.2.7

  30. arXiv:2403.07842  [pdf, other

    cs.LG cs.CR

    Quantifying and Mitigating Privacy Risks for Tabular Generative Models

    Authors: Chaoyi Zhu, Jiayi Tang, Hans Brouwer, Juan F. Pérez, Marten van Dijk, Lydia Y. Chen

    Abstract: Synthetic data from generative models emerges as the privacy-preserving data-sharing solution. Such a synthetic data set shall resemble the original data without revealing identifiable private information. The backbone technology of tabular synthesizers is rooted in image generative models, ranging from Generative Adversarial Networks (GANs) to recent diffusion models. Recent prior work sheds ligh… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  31. arXiv:2402.00823  [pdf, other

    cs.LG cs.AI cs.RO

    SLIM: Skill Learning with Multiple Critics

    Authors: David Emukpere, Bingbing Wu, Julien Perez, Jean-Michel Renders

    Abstract: Self-supervised skill learning aims to acquire useful behaviors that leverage the underlying dynamics of the environment. Latent variable models, based on mutual information maximization, have been successful in this task but still struggle in the context of robotic manipulation. As it requires impacting a possibly large set of degrees of freedom composing the environment, mutual information maxim… ▽ More

    Submitted 21 March, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Comments: Accepted at IEEE ICRA 2024

  32. arXiv:2401.16434  [pdf

    eess.SY cs.LG eess.SP

    A novel ANROA based control approach for grid-tied multi-functional solar energy conversion system

    Authors: Dinanath Prasad, Narendra Kumar, Rakhi Sharma, Hasmat Malik, Fausto Pedro García Márquez, Jesús María Pinar Pérez

    Abstract: An adaptive control approach for a three-phase grid-interfaced solar photovoltaic system based on the new Neuro-Fuzzy Inference System with Rain Optimization Algorithm (ANROA) methodology is proposed and discussed in this manuscript. This method incorporates an Adaptive Neuro-fuzzy Inference System (ANFIS) with a Rain Optimization Algorithm (ROA). The ANFIS controller has excellent maximum trackin… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: The paper was published in Energy Reports journal (ELSEVIER). Cite as: Prasad, D., Kumar, N., Sharma, R., Malik, H., Márquez, F. P. G., & Pinar-Pérez, J. M. (2023). A novel ANROA based control approach for grid-tied multi-functional solar energy conversion system. Energy Reports, 9, 2044-2057

    Journal ref: Energy Reports (2023) Elsevier

  33. arXiv:2401.14763  [pdf, ps, other

    cs.LO

    Comparing Session Type Systems derived from Linear Logic

    Authors: Bas van den Heuvel, Jorge A. Pérez

    Abstract: Session types are a typed approach to message-passing concurrency, where types describe sequences of intended exchanges over channels. Session type systems have been given strong logical foundations via Curry-Howard correspondences with linear logic, a resource-aware logic that naturally captures structured interactions. These logical foundations provide an elegant framework to specify and (static… ▽ More

    Submitted 22 August, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

    Comments: Preprint to appear in JLAMP; revised/extended version of https://doi.org/10.4204/EPTCS.314.1

  34. arXiv:2401.08251  [pdf

    cs.GT econ.GN eess.SY

    A techno-economic model for avoiding conflicts of interest between owners of offshore wind farms and maintenance suppliers

    Authors: Alberto Pliego Marugán, Fausto Pedro García Márquez, Jesús María Pinar Pérez

    Abstract: Currently, wind energy is one of the most important sources of renewable energy. Offshore locations for wind turbines are increasingly exploited because of their numerous advantages. However, offshore wind farms require high investment in maintenance service. Due to its complexity and special requirements, maintenance service is usually outsourced by wind farm owners. In this paper, we propose a n… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: Published in Renewable and Sustainable Energy Reviews (ELSEVIER) 10 July 2022. DOI: https://doi.org/10.1016/j.rser.2022.112753 Cite as: Marugán, A. P., Márquez, F. P. G., & Pérez, J. M. P. (2022). A techno-economic model for avoiding conflicts of interest between owners of offshore wind farms and maintenance suppliers. Renewable and Sustainable Energy Reviews, 168, 112753

  35. arXiv:2312.12487  [pdf, other

    cs.LG cs.AI

    Adaptive Guidance: Training-free Acceleration of Conditional Diffusion Models

    Authors: Angela Castillo, Jonas Kohler, Juan C. Pérez, Juan Pablo Pérez, Albert Pumarola, Bernard Ghanem, Pablo Arbeláez, Ali Thabet

    Abstract: This paper presents a comprehensive study on the role of Classifier-Free Guidance (CFG) in text-conditioned diffusion models from the perspective of inference efficiency. In particular, we relax the default choice of applying CFG in all diffusion steps and instead search for efficient guidance policies. We formulate the discovery of such policies in the differentiable Neural Architecture Search fr… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  36. Split and Rephrase with Large Language Models

    Authors: David Ponce, Thierry Etchegoyhen, Jesús Calleja Pérez, Harritxu Gete

    Abstract: The Split and Rephrase (SPRP) task, which consists in splitting complex sentences into a sequence of shorter grammatical sentences, while preserving the original meaning, can facilitate the processing of complex texts for humans and machines alike. It is also a valuable testbed to evaluate natural language processing models, as it requires modelling complex grammatical aspects. In this work, we ev… ▽ More

    Submitted 3 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  37. arXiv:2311.07894  [pdf

    cs.CR

    Security in Drones

    Authors: Jonathan Morgan, Julio Perez, Jordan Wade, Sundar Krishnan

    Abstract: Drones are used in our everyday world for private, commercial, and government uses. It is important to establish both the cyber threats drone users face and security practices to combat those threats. Privacy will always be the main concern when using drones. Protecting information legally collected on drones and protecting people from the illegal collection of their data are topics that security… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  38. arXiv:2310.19075  [pdf, other

    cs.LG cs.AI cs.CV

    Bespoke Solvers for Generative Flow Models

    Authors: Neta Shaul, Juan Perez, Ricky T. Q. Chen, Ali Thabet, Albert Pumarola, Yaron Lipman

    Abstract: Diffusion or flow-based models are powerful generative paradigms that are notoriously hard to sample as samples are defined as solutions to high-dimensional Ordinary or Stochastic Differential Equations (ODEs/SDEs) which require a large Number of Function Evaluations (NFE) to approximate well. Existing methods to alleviate the costly sampling process include model distillation and designing dedica… ▽ More

    Submitted 29 October, 2023; originally announced October 2023.

  39. arXiv:2310.07173  [pdf

    quant-ph cs.ET

    Unleashing quantum algorithms with Qinterpreter: bridging the gap between theory and practice across leading quantum computing platforms

    Authors: Wilmer Contreras Sepúlveda, Ángel David Torres-Palencia, José Javier Sánchez Mondragón, Braulio Misael Villegas-Martínez, J. Jesús Escobedo-Alatorre, Sandra Gesing, Néstor Lozano-Crisóstomo, Julio César García-Melgarejo, Juan Carlos Sánchez Pérez, Eddie Nelson Palacios- Pérez, Omar PalilleroSandoval

    Abstract: Quantum computing is a rapidly emerging and promising field that has the potential to revolutionize numerous research domains, including drug design, network technologies and sustainable energy. Due to the inherent complexity and divergence from classical computing, several major quantum computing libraries have been developed to implement quantum algorithms, namely IBM Qiskit, Amazon Braket, Cirq… ▽ More

    Submitted 16 October, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: Final article submitted to Peer J computer science Journal

  40. Controlling the Solo12 Quadruped Robot with Deep Reinforcement Learning

    Authors: Michel Aractingi, Pierre-Alexandre Léziart, Thomas Flayols, Julien Perez, Tomi Silander, Philippe Souères

    Abstract: Quadruped robots require robust and general locomotion skills to exploit their mobility potential in complex and challenging environments. In this work, we present the first implementation of a robust end-to-end learning-based controller on the Solo12 quadruped. Our method is based on deep reinforcement learning of joint impedance references. The resulting control policies follow a commanded veloc… ▽ More

    Submitted 2 August, 2023; originally announced September 2023.

    Report number: Rapport LAAS no 22263

    Journal ref: Scientific Reports, 2023, 13 (11945), pp.12

  41. arXiv:2309.08428  [pdf, other

    cs.CY

    Virtual Harassment, Real Understanding: Using a Serious Game and Bayesian Networks to Study Cyberbullying

    Authors: Jaime Pérez, Mario Castro, Edmond Awad, Gregorio López

    Abstract: Cyberbullying among minors is a pressing concern in our digital society, necessitating effective prevention and intervention strategies. Traditional data collection methods often intrude on privacy and yield limited insights. This study explores an innovative approach, employing a serious game - designed with purposes beyond entertainment - as a non-intrusive tool for data collection and education… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

  42. arXiv:2309.06046  [pdf, other

    cs.LG cs.AI cs.CV cs.NE

    BatMan-CLR: Making Few-shots Meta-Learners Resilient Against Label Noise

    Authors: Jeroen M. Galjaard, Robert Birke, Juan Perez, Lydia Y. Chen

    Abstract: The negative impact of label noise is well studied in classical supervised learning yet remains an open research question in meta-learning. Meta-learners aim to adapt to unseen learning tasks by learning a good initial model in meta-training and consecutively fine-tuning it according to new tasks during meta-testing. In this paper, we present the first extensive analysis of the impact of varying l… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

    Comments: 10 pages,3 figures

  43. arXiv:2308.15075  [pdf, other

    cs.NI

    Benchmarking 5G MEC and Cloud infrastructures for planning IoT messaging of CCAM data

    Authors: Felipe Mogollón, Zaloa Fernández, Josu Pérez, Ángel Martín

    Abstract: Vehicles embed lots of sensors supporting driving and safety. Combined with connectivity, they bring new possibilities for Connected, Cooperative and Automated Mobility (CCAM) services that exploit local and global data for a wide understanding beyond the myopic view of local sensors. Internet of Things (IoT) messaging solutions are ideal for vehicular data as they ship core features like the sepa… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

    Comments: 6 pages, 5 figures, 6 tables, IEEE International Conference on Intelligent Transportation Systems

  44. arXiv:2308.02976  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Spanish Pre-trained BERT Model and Evaluation Data

    Authors: José Cañete, Gabriel Chaperon, Rodrigo Fuentes, Jou-Hui Ho, Hojin Kang, Jorge Pérez

    Abstract: The Spanish language is one of the top 5 spoken languages in the world. Nevertheless, finding resources to train or evaluate Spanish language models is not an easy task. In this paper we help bridge this gap by presenting a BERT-based language model pre-trained exclusively on Spanish data. As a second contribution, we also compiled several tasks specifically for the Spanish language in a single re… ▽ More

    Submitted 5 August, 2023; originally announced August 2023.

    Comments: Published as workshop paper at Practical ML for Developing Countries Workshop @ ICLR 2020

  45. arXiv:2308.02197  [pdf, other

    cs.NI cs.DB

    Edge Dynamic Map architecture for C-ITS applications

    Authors: Mikel García, Gorka Velez, Josu Pérez, Ángel Martín, Zaloa Fernández, Naiara Aginako

    Abstract: Cooperative Intelligent Transport Systems (C-ITS) create, share and process massive amounts of data which needs to be real-time managed to enable new cooperative and autonomous driving applications. Vehicle-to-Everything (V2X) communications facilitate information exchange among vehicles and infrastructures using various protocols. By providing computer power, data storage, and low latency capabil… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

    Comments: Accepted in the 26th IEEE International Conference on Intelligent Transportation Systems (ITSC 2023)

  46. arXiv:2308.01165  [pdf, ps, other

    cs.LO

    Termination in Concurrency, Revisited

    Authors: Joseph W. N. Paulus, Jorge A. Pérez, Daniele Nantes-Sobrinho

    Abstract: Termination is a central property in sequential programming models: a term is terminating if all its reduction sequences are finite. Termination is also important in concurrency in general, and for message-passing programs in particular. A variety of type systems that enforce termination by typing have been developed. In this paper, we rigorously compare several type systems for $π$-calculus proce… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

  47. arXiv:2306.10985  [pdf, other

    cs.CL cs.LG cs.RO

    LARG, Language-based Automatic Reward and Goal Generation

    Authors: Julien Perez, Denys Proux, Claude Roux, Michael Niemaz

    Abstract: Goal-conditioned and Multi-Task Reinforcement Learning (GCRL and MTRL) address numerous problems related to robot learning, including locomotion, navigation, and manipulation scenarios. Recent works focusing on language-defined robotic manipulation tasks have led to the tedious production of massive human annotations to create dataset of textual descriptions associated with trajectories. To levera… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

  48. arXiv:2306.08904  [pdf, other

    cs.CV

    Enhancing Neural Rendering Methods with Image Augmentations

    Authors: Juan C. Pérez, Sara Rojas, Jesus Zarzar, Bernard Ghanem

    Abstract: Faithfully reconstructing 3D geometry and generating novel views of scenes are critical tasks in 3D computer vision. Despite the widespread use of image augmentations across computer vision applications, their potential remains underexplored when learning neural rendering methods (NRMs) for 3D scenes. This paper presents a comprehensive analysis of the use of image augmentations in NRMs, where we… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

  49. arXiv:2306.04204  [pdf, ps, other

    cs.PL

    Monitoring Blackbox Implementations of Multiparty Session Protocols

    Authors: Bas van den Heuvel, Jorge A. Pérez, Rares A. Dobre

    Abstract: We present a framework for the distributed monitoring of networks of components that coordinate by message-passing, following multiparty session protocols specified as global types. We improve over prior works by (i) supporting components whose exact specification is unknown ("blackboxes") and (ii) covering protocols that cannot be analyzed by existing techniques. We first give a procedure for syn… ▽ More

    Submitted 3 October, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: Full version with appendices of our RV'23 paper

  50. Generation of Probabilistic Synthetic Data for Serious Games: A Case Study on Cyberbullying

    Authors: Jaime Pérez, Mario Castro, Edmond Awad, Gregorio López

    Abstract: Synthetic data generation has been a growing area of research in recent years. However, its potential applications in serious games have not been thoroughly explored. Advances in this field could anticipate data modelling and analysis, as well as speed up the development process. To try to fill this gap in the literature, we propose a simulator architecture for generating probabilistic synthetic d… ▽ More

    Submitted 3 July, 2023; v1 submitted 2 June, 2023; originally announced June 2023.

    Journal ref: Knowledge-Based Systems, Volume 286, 2024, pp. 111440, 2024