Skip to main content

Showing 1–50 of 73 results for author: Lopez, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.15822  [pdf, other

    eess.IV cs.CV cs.LG

    MambaStyle: Efficient StyleGAN Inversion for Real Image Editing with State-Space Models

    Authors: Jhon Lopez, Carlos Hinojosa, Henry Arguello, Bernard Ghanem

    Abstract: The task of inverting real images into StyleGAN's latent space to manipulate their attributes has been extensively studied. However, existing GAN inversion methods struggle to balance high reconstruction quality, effective editability, and computational efficiency. In this paper, we introduce MambaStyle, an efficient single-stage encoder-based approach for GAN inversion and editing that leverages… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

  2. arXiv:2505.14074  [pdf, ps, other

    cs.HC cs.SD eess.AS

    Recreating Neural Activity During Speech Production with Language and Speech Model Embeddings

    Authors: Owais Mujtaba Khanday, Pablo Rodroguez San Esteban, Zubair Ahmad Lone, Marc Ouellet, Jose Andres Gonzalez Lopez

    Abstract: Understanding how neural activity encodes speech and language production is a fundamental challenge in neuroscience and artificial intelligence. This study investigates whether embeddings from large-scale, self-supervised language and speech models can effectively reconstruct high-gamma neural activity characteristics, key indicators of cortical processing, recorded during speech production. We le… ▽ More

    Submitted 21 May, 2025; v1 submitted 20 May, 2025; originally announced May 2025.

    Comments: Accepted for presentation at Interspeech2025

  3. arXiv:2505.03174  [pdf, other

    cs.RO cs.CV cs.LG

    Automated Data Curation Using GPS & NLP to Generate Instruction-Action Pairs for Autonomous Vehicle Vision-Language Navigation Datasets

    Authors: Guillermo Roque, Erika Maquiling, Jose Giovanni Tapia Lopez, Ross Greer

    Abstract: Instruction-Action (IA) data pairs are valuable for training robotic systems, especially autonomous vehicles (AVs), but having humans manually annotate this data is costly and time-inefficient. This paper explores the potential of using mobile application Global Positioning System (GPS) references and Natural Language Processing (NLP) to automatically generate large volumes of IA commands and resp… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

  4. arXiv:2505.02874  [pdf, other

    cs.LG cs.AI

    Uncertainty Quantification for Machine Learning in Healthcare: A Survey

    Authors: L. Julián Lechuga López, Shaza Elsharief, Dhiyaa Al Jorf, Firas Darwish, Congbo Ma, Farah E. Shamout

    Abstract: Uncertainty Quantification (UQ) is pivotal in enhancing the robustness, reliability, and interpretability of Machine Learning (ML) systems for healthcare, optimizing resources and improving patient care. Despite the emergence of ML-based clinical decision support tools, the lack of principled quantification of uncertainty in ML models remains a major challenge. Current reviews have a narrow focus… ▽ More

    Submitted 4 May, 2025; originally announced May 2025.

    Comments: 46 pages, 3 figures, 2 tables, AHLI Conference on Health, Inference, and Learning (CHIL)

  5. arXiv:2503.15282  [pdf, other

    cs.SE

    SENAI: Towards Software Engineering Native Generative Artificial Intelligence

    Authors: Mootez Saad, José Antonio Hernández López, Boqi Chen, Neil Ernst, Dániel Varró, Tushar Sharma

    Abstract: Large Language Models have significantly advanced the field of code generation, demonstrating the ability to produce functionally correct code snippets. However, advancements in generative AI for code overlook foundational Software Engineering (SE) principles such as modularity, and single responsibility, and concepts such as cohesion and coupling which are critical for creating maintainable, scal… ▽ More

    Submitted 19 March, 2025; originally announced March 2025.

    Comments: 5 pages, 1 figure

  6. arXiv:2501.10471  [pdf, other

    cs.LG q-bio.QM stat.ML

    Village-Net Clustering: A Rapid approach to Non-linear Unsupervised Clustering of High-Dimensional Data

    Authors: Aditya Ballal, Esha Datta, Gregory A. DePaul, Erik Carlsson, Ye Chen-Izu, Javier E. López, Leighton T. Izu

    Abstract: Clustering large high-dimensional datasets with diverse variable is essential for extracting high-level latent information from these datasets. Here, we developed an unsupervised clustering algorithm, we call "Village-Net". Village-Net is specifically designed to effectively cluster high-dimension data without priori knowledge on the number of existing clusters. The algorithm operates in two phase… ▽ More

    Submitted 16 January, 2025; originally announced January 2025.

    Comments: Software available at https://villagenet.streamlit.app/

  7. arXiv:2411.18157  [pdf, other

    cs.CL cs.AI

    A survey on cutting-edge relation extraction techniques based on language models

    Authors: Jose A. Diaz-Garcia, Julio Amador Diaz Lopez

    Abstract: This comprehensive survey delves into the latest advancements in Relation Extraction (RE), a pivotal task in natural language processing essential for applications across biomedical, financial, and legal sectors. This study highlights the evolution and current state of RE techniques by analyzing 137 papers presented at the Association for Computational Linguistics (ACL) conferences over the past f… ▽ More

    Submitted 27 November, 2024; originally announced November 2024.

    Comments: 50 pages, under review in Artificial Intelligence Review

  8. arXiv:2411.16795  [pdf, other

    cs.SE

    Why do Machine Learning Notebooks Crash? An Empirical Study on Public Python Jupyter Notebooks

    Authors: Yiran Wang, Willem Meijer, José Antonio Hernández López, Ulf Nilsson, Dániel Varró

    Abstract: Jupyter notebooks have become central in data science, integrating code, text and output in a flexible environment. With the rise of machine learning (ML), notebooks are increasingly used for prototyping and data analysis. However, due to their dependence on complex ML libraries and the flexible notebook semantics that allow cells to be run in any order, notebooks are susceptible to software bugs… ▽ More

    Submitted 27 May, 2025; v1 submitted 25 November, 2024; originally announced November 2024.

  9. arXiv:2411.15409  [pdf, other

    cs.AR

    Exploring the Sparsity-Quantization Interplay on a Novel Hybrid SNN Event-Driven Architecture

    Authors: Ilkin Aliyev, Jesus Lopez, Tosiron Adegbija

    Abstract: Spiking Neural Networks (SNNs) offer potential advantages in energy efficiency but currently trail Artificial Neural Networks (ANNs) in versatility, largely due to challenges in efficient input encoding. Recent work shows that direct coding achieves superior accuracy with fewer timesteps than traditional rate coding. However, there is a lack of specialized hardware to fully exploit the potential o… ▽ More

    Submitted 22 November, 2024; originally announced November 2024.

    Journal ref: Design, Automation and Test in Europe Conference 2025

  10. arXiv:2411.15368  [pdf, other

    cs.SE cs.LG cs.PL

    The Power of Types: Exploring the Impact of Type Checking on Neural Bug Detection in Dynamically Typed Languages

    Authors: Boqi Chen, José Antonio Hernández López, Gunter Mussbacher, Dániel Varró

    Abstract: Motivation: Automated bug detection in dynamically typed languages such as Python is essential for maintaining code quality. The lack of mandatory type annotations in such languages can lead to errors that are challenging to identify early with traditional static analysis tools. Recent progress in deep neural networks has led to increased use of neural bug detectors. In statically typed languages,… ▽ More

    Submitted 16 January, 2025; v1 submitted 22 November, 2024; originally announced November 2024.

    Comments: Accepted by ICSE'25 Research Track

  11. Omics-driven hybrid dynamic modeling of bioprocesses with uncertainty estimation

    Authors: Sebastián Espinel-Ríos, José Montaño López, José L. Avalos

    Abstract: This work presents an omics-driven modeling pipeline that integrates machine-learning tools to facilitate the dynamic modeling of multiscale biological systems. Random forests and permutation feature importance are proposed to mine omics datasets, guiding feature selection and dimensionality reduction for dynamic modeling. Continuous and differentiable machine-learning functions can be trained to… ▽ More

    Submitted 15 December, 2024; v1 submitted 24 October, 2024; originally announced October 2024.

  12. arXiv:2409.02290  [pdf, other

    cs.RO cs.CV eess.IV

    Unsupervised Welding Defect Detection Using Audio And Video

    Authors: Georg Stemmer, Jose A. Lopez, Juan A. Del Hoyo Ontiveros, Arvind Raju, Tara Thimmanaik, Sovan Biswas

    Abstract: In this work we explore the application of AI to robotic welding. Robotic welding is a widely used technology in many industries, but robots currently do not have the capability to detect welding defects which get introduced due to various reasons in the welding process. We describe how deep-learning methods can be applied to detect weld defects in real-time by recording the welding process with m… ▽ More

    Submitted 3 September, 2024; originally announced September 2024.

    Comments: 21 pages

  13. arXiv:2408.11154  [pdf, other

    cs.IT

    Tight entropy bound based on p-quasinorms

    Authors: Juan Pablo Lopez

    Abstract: In the present paper we prove a family of tight upper and lower bounds for the Shannon entropy and von Neumann entropy based on the p-norms. This allows us to have an entropy estimate, a criterion for the finiteness of it and a bound on the difference of entropy, additionally, we did some numerical tests that show the efficiency of our approximations.

    Submitted 20 August, 2024; originally announced August 2024.

  14. ALPINE: An adaptive language-agnostic pruning method for language models for code

    Authors: Mootez Saad, José Antonio Hernández López, Boqi Chen, Dániel Varró, Tushar Sharma

    Abstract: Language models of code have demonstrated state-of-the-art performance across various software engineering and source code analysis tasks. However, their demanding computational resource requirements and consequential environmental footprint remain as significant challenges. This work introduces ALPINE, an adaptive programming language-agnostic pruning technique designed to substantially reduce th… ▽ More

    Submitted 10 February, 2025; v1 submitted 4 July, 2024; originally announced July 2024.

    Comments: Accepted to the The ACM International Conference on the Foundations of Software Engineering (FSE) (FSE 2025)

  15. arXiv:2405.09305  [pdf, other

    cs.LG

    Gradient Boosted Filters For Signal Processing

    Authors: Jose A. Lopez, Georg Stemmer, Hector A. Cordourier

    Abstract: Gradient boosted decision trees have achieved remarkable success in several domains, particularly those that work with static tabular data. However, the application of gradient boosted models to signal processing is underexplored. In this work, we introduce gradient boosted filters for dynamic data, by employing Hammerstein systems in place of decision trees. We discuss the relationship of our app… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: 9 pages, 12 figures. Submitted to ICML 2024 and subsequently rejected for insufficient evaluation

  16. Privacy-Preserving Deep Learning Using Deformable Operators for Secure Task Learning

    Authors: Fabian Perez, Jhon Lopez, Henry Arguello

    Abstract: In the era of cloud computing and data-driven applications, it is crucial to protect sensitive information to maintain data privacy, ensuring truly reliable systems. As a result, preserving privacy in deep learning systems has become a critical concern. Existing methods for privacy preservation rely on image encryption or perceptual transformation approaches. However, they often suffer from reduce… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: copyright 2024 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

  17. Improvement of Performance in Freezing of Gait detection in Parkinsons Disease using Transformer networks and a single waist worn triaxial accelerometer

    Authors: Luis Sigcha, Luigi Borzì, Ignacio Pavón, Nélson Costa, Susana Costa, Pedro Arezes, Juan-Manuel López, Guillermo De Arcas

    Abstract: Freezing of gait (FOG) is one of the most incapacitating symptoms in Parkinsons disease, affecting more than 50 percent of patients in advanced stages of the disease. The presence of FOG may lead to falls and a loss of independence with a consequent reduction in the quality of life. Wearable technology and artificial intelligence have been used for automatic FOG detection to optimize monitoring. H… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Journal ref: Engineering Applications of Artificial Intelligence Volume 116, November 2022, 105482

  18. arXiv:2404.00777  [pdf, other

    cs.CV cs.AI cs.CR cs.LG eess.IV

    Privacy-preserving Optics for Enhancing Protection in Face De-identification

    Authors: Jhon Lopez, Carlos Hinojosa, Henry Arguello, Bernard Ghanem

    Abstract: The modern surge in camera usage alongside widespread computer vision technology applications poses significant privacy and security concerns. Current artificial intelligence (AI) technologies aid in recognizing relevant events and assisting in daily tasks in homes, offices, hospitals, etc. The need to access or process personal information for these purposes raises privacy concerns. While softwar… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

    Comments: Accepted to CVPR 2024. Project Website and Code coming soon

  19. Implementation of the Principal Component Analysis onto High-Performance Computer Facilities for Hyperspectral Dimensionality Reduction: Results and Comparisons

    Authors: E. Martel, R. Lazcano, J. Lopez, D. Madroñal, R. Salvador, S. Lopez, E. Juarez, R. Guerra, C. Sanz, R. Sarmiento

    Abstract: Dimensionality reduction represents a critical preprocessing step in order to increase the efficiency and the performance of many hyperspectral imaging algorithms. However, dimensionality reduction algorithms, such as the Principal Component Analysis (PCA), suffer from their computationally demanding nature, becoming advisable for their implementation onto high-performance computer architectures f… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: 30 pages, 10 figures

  20. arXiv:2402.05739  [pdf, other

    physics.soc-ph cs.MA q-bio.PE

    Critical mobility in policy making for epidemic containment

    Authors: Jesús A. Moreno López, Sandro Meloni, Jose J. Ramasco

    Abstract: When considering airborne epidemic spreading in social systems, a natural connection arises between mobility and epidemic contacts. As individuals travel, possibilities to encounter new people either at the final destination or during the transportation process appear. Such contacts can lead to new contagion events. In fact, mobility has been a crucial target for early non-pharmaceutical containme… ▽ More

    Submitted 9 May, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: 13 pages, 5 figures

  21. arXiv:2401.07930  [pdf, other

    cs.SE

    On Inter-dataset Code Duplication and Data Leakage in Large Language Models

    Authors: José Antonio Hernández López, Boqi Chen, Mootez Saaz, Tushar Sharma, Dániel Varró

    Abstract: Motivation. Large language models (LLMs) have exhibited remarkable proficiency in diverse software engineering (SE) tasks. Handling such tasks typically involves acquiring foundational coding knowledge on large, general-purpose datasets during a pre-training phase, and subsequently refining on smaller, task-specific datasets as part of a fine-tuning phase. Problem statement. While intra-dataset… ▽ More

    Submitted 1 August, 2024; v1 submitted 15 January, 2024; originally announced January 2024.

  22. CAD-compatible structural shape optimization with a movable Bézier tetrahedral mesh

    Authors: Jorge López, Cosmin Anitescu, Timon Rabczuk

    Abstract: This paper presents the development of a complete CAD-compatible framework for structural shape optimization in 3D. The boundaries of the domain are described using NURBS while the interior is discretized with Bézier tetrahedra. The tetrahedral mesh is obtained from the mesh generator software Gmsh. A methodology to reconstruct the NURBS surfaces from the triangular faces of the boundary mesh is p… ▽ More

    Submitted 29 December, 2023; originally announced December 2023.

    Journal ref: Computer Methods in Applied Mechanics and Engineering, Volume 367, 2020, 113066

  23. arXiv:2312.14053  [pdf, other

    cs.CV

    Dual Attention U-Net with Feature Infusion: Pushing the Boundaries of Multiclass Defect Segmentation

    Authors: Rasha Alshawi, Md Tamjidul Hoque, Md Meftahul Ferdaus, Mahdi Abdelguerfi, Kendall Niles, Ken Prathak, Joe Tom, Jordan Klein, Murtada Mousa, Johny Javier Lopez

    Abstract: The proposed architecture, Dual Attentive U-Net with Feature Infusion (DAU-FI Net), addresses challenges in semantic segmentation, particularly on multiclass imbalanced datasets with limited samples. DAU-FI Net integrates multiscale spatial-channel attention mechanisms and feature injection to enhance precision in object localization. The core employs a multiscale depth-separable convolution block… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: under review in IEEE Transactions on Artificial Intelligence

  24. arXiv:2312.00794  [pdf, other

    cs.CV cs.AI cs.CY cs.LG stat.AP

    Informative Priors Improve the Reliability of Multimodal Clinical Data Classification

    Authors: L. Julian Lechuga Lopez, Tim G. J. Rudner, Farah E. Shamout

    Abstract: Machine learning-aided clinical decision support has the potential to significantly improve patient care. However, existing efforts in this domain for principled quantification of uncertainty have largely been limited to applications of ad-hoc solutions that do not consistently improve reliability. In this work, we consider stochastic neural networks and design a tailor-made multimodal data-driven… ▽ More

    Submitted 16 November, 2023; originally announced December 2023.

    Comments: Published in ML4H 2023 Findings Track Collection

  25. arXiv:2309.08033  [pdf, other

    cs.CV

    Depth Estimation from a Single Optical Encoded Image using a Learned Colored-Coded Aperture

    Authors: Jhon Lopez, Edwin Vargas, Henry Arguello

    Abstract: Depth estimation from a single image of a conventional camera is a challenging task since depth cues are lost during the acquisition process. State-of-the-art approaches improve the discrimination between different depths by introducing a binary-coded aperture (CA) in the lens aperture that generates different coded blur patterns at different depths. Color-coded apertures (CCA) can also produce co… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

  26. arXiv:2307.11879  [pdf, other

    cs.NI cs.CR

    Software defined networking flow admission and routing under minimal security constraints

    Authors: Jorge López, Charalampos Chatzinakis, Marc Cartigny, Claude Poletti

    Abstract: In recent years, computer networks and telecommunications in general have been shifting paradigms to adopt software-centric approaches. Software Defined Networking (SDN) is one of such paradigms that centralizes control and intelligent applications can be defined on top of this architecture. The latter enables the definition of the network behavior by means of software. In this work, we propose an… ▽ More

    Submitted 21 July, 2023; originally announced July 2023.

    Comments: 8 pages, 10 figures, as submitted to TRUSTCOM23

  27. arXiv:2306.17747  [pdf, other

    cs.MA cs.AI math.DS math.OC nlin.AO

    Discriminatory or Samaritan -- which AI is needed for humanity? An Evolutionary Game Theory Analysis of Hybrid Human-AI populations

    Authors: Tim Booker, Manuel Miranda, Jesús A. Moreno López, José María Ramos Fernández, Max Reddel, Valeria Widler, Filippo Zimmaro, Alberto Antonioni, The Anh Han

    Abstract: As artificial intelligence (AI) systems are increasingly embedded in our lives, their presence leads to interactions that shape our behaviour, decision-making, and social interactions. Existing theoretical research has primarily focused on human-to-human interactions, overlooking the unique dynamics triggered by the presence of AI. In this paper, resorting to methods from evolutionary game theory,… ▽ More

    Submitted 3 July, 2023; v1 submitted 30 June, 2023; originally announced June 2023.

    Comments: This work is the result of the Complexity72h 2023 workshop

  28. OWAdapt: An adaptive loss function for deep learning using OWA operators

    Authors: Sebastián Maldonado, Carla Vairetti, Katherine Jara, Miguel Carrasco, Julio López

    Abstract: In this paper, we propose a fuzzy adaptive loss function for enhancing deep learning performance in classification tasks. Specifically, we redefine the cross-entropy loss to effectively address class-level noise conditions, including the challenging problem of class imbalance. Our approach introduces aggregation operators, leveraging the power of fuzzy logic to improve classification accuracy. The… ▽ More

    Submitted 12 October, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

    Comments: 15 pages, 1 figure, published

    Journal ref: Knowledge-based Systems 280, 111022 (2023)

  29. arXiv:2303.18060  [pdf

    cs.LG cs.AI eess.SY

    NOSTROMO: Lessons learned, conclusions and way forward

    Authors: Mayte Cano, Andrés Perillo, Juan Antonio López, Faustino Tello, Javier Poveda, Francisco Câmara, Francisco Antunes, Christoffer Riis, Ian Crook, Abderrazak Tibichte, Sandrine Molton, David Mocholí, Ricardo Herranz, Gérald Gurtner, Tatjana Bolić, Andrew Cook, Jovana Kuljanin, Xavier Prats

    Abstract: This White Paper sets out to explain the value that metamodelling can bring to air traffic management (ATM) research. It will define metamodelling and explore what it can, and cannot, do. The reader is assumed to have basic knowledge of SESAR: the Single European Sky ATM Research project. An important element of SESAR, as the technological pillar of the Single European Sky initiative, is to bring… ▽ More

    Submitted 29 March, 2023; originally announced March 2023.

    Comments: White Paper of the NOSTROMO, an exploratory research project funded by the SESAR Joint Undertaking (SJU) under the European Union's Horizon 2020 research and innovation programme

  30. Privacy-preserving machine learning for healthcare: open challenges and future perspectives

    Authors: Alejandro Guerra-Manzanares, L. Julian Lechuga Lopez, Michail Maniatakos, Farah E. Shamout

    Abstract: Machine Learning (ML) has recently shown tremendous success in modeling various healthcare prediction tasks, ranging from disease diagnosis and prognosis to patient treatment. Due to the sensitive nature of medical data, privacy must be considered along the entire ML pipeline, from model training to inference. In this paper, we conduct a review of recent literature concerning Privacy-Preserving Ma… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

    Comments: ICLR 2023 Workshop on Trustworthy Machine Learning for Healthcare (TML4H)

    Journal ref: Trustworthy Machine Learning for Healthcare. TML4H 2023. Lecture Notes in Computer Science, vol 13932

  31. arXiv:2302.10669  [pdf

    cs.LG cs.AI cs.RO

    UAV Path Planning Employing MPC- Reinforcement Learning Method Considering Collision Avoidance

    Authors: Mahya Ramezani, Hamed Habibi, Jose luis Sanchez Lopez, Holger Voos

    Abstract: In this paper, we tackle the problem of Unmanned Aerial (UA V) path planning in complex and uncertain environments by designing a Model Predictive Control (MPC), based on a Long-Short-Term Memory (LSTM) network integrated into the Deep Deterministic Policy Gradient algorithm. In the proposed solution, LSTM-MPC operates as a deterministic policy within the DDPG network, and it leverages a predictin… ▽ More

    Submitted 7 March, 2023; v1 submitted 21 February, 2023; originally announced February 2023.

  32. Cache Placement in an NDN Based LEO Satellite Network Constellation

    Authors: Miguel Rodríguez-Pérez, Sergio Herrería-Alonso, Andrés Suárez-González, José Carlos López-Ardao, Raúl Rodríguez-Rubio

    Abstract: The efforts to replace the successful, albeit aging, TCP/IP Internet architecture with a better suited one have driving research interest to information-centric alternatives. The Named Data Networking (NDN) architecture is probably one of the main contenders to become the network layer of the future Internet thanks to its inbuilt support for mobility, in-network caching, security and, in general,… ▽ More

    Submitted 27 December, 2022; originally announced December 2022.

    Comments: 9 pages, 5 figures

    Journal ref: IEEE Transactions on Aerospace and Electronic Systems, vol. 59, no. 4, August 2023, pp. 3579-3587

  33. arXiv:2211.01226  [pdf, other

    cs.CV

    DEArt: Dataset of European Art

    Authors: Artem Reshetnikov, Maria-Cristina Marinescu, Joaquim More Lopez

    Abstract: Large datasets that were made publicly available to the research community over the last 20 years have been a key enabling factor for the advances in deep learning algorithms for NLP or computer vision. These datasets are generally pairs of aligned image / manually annotated metadata, where images are photographs of everyday life. Scholarly and historical content, on the other hand, treat subjects… ▽ More

    Submitted 3 November, 2022; v1 submitted 2 November, 2022; originally announced November 2022.

    Comments: VISART VI. Workshop at the European Conference of Computer Vision (ECCV)

  34. arXiv:2210.00508  [pdf, ps, other

    math.CO cs.DM

    The lexicographically least square-free word with a given prefix

    Authors: Siddharth Berera, Andrés Gómez-Colunga, Joey Lakerdas-Gayle, John López, Mauditra Matin, Daniel Roebuck, Eric Rowland, Noam Scully, Juliet Whidden

    Abstract: The lexicographically least square-free infinite word on the alphabet of non-negative integers with a given prefix $p$ is denoted $L(p)$. When $p$ is the empty word, this word was shown by Guay-Paquet and Shallit to be the ruler sequence. For other prefixes, the structure is significantly more complicated. In this paper, we show that $L(p)$ reflects the structure of the ruler sequence for several… ▽ More

    Submitted 2 November, 2022; v1 submitted 2 October, 2022; originally announced October 2022.

    Journal ref: The Electronic Journal of Combinatorics 30 (2023) #P3.11 (43 pages)

  35. arXiv:2206.11719  [pdf, other

    cs.CL cs.AI cs.LG cs.PL cs.SE

    AST-Probe: Recovering abstract syntax trees from hidden representations of pre-trained language models

    Authors: José Antonio Hernández López, Martin Weyssow, Jesús Sánchez Cuadrado, Houari Sahraoui

    Abstract: The objective of pre-trained language models is to learn contextual representations of textual data. Pre-trained language models have become mainstream in natural language processing and code modeling. Using probes, a technique to study the linguistic properties of hidden vector spaces, previous works have shown that these pre-trained language models encode simple linguistic properties in their hi… ▽ More

    Submitted 10 September, 2022; v1 submitted 23 June, 2022; originally announced June 2022.

  36. arXiv:2205.10469  [pdf, other

    cs.CV cs.AI cs.LG

    Masterful: A Training Platform for Computer Vision Models

    Authors: Samuel Wookey, Yaoshiang Ho, Tom Rikert, Juan David Gil Lopez, Juan Manuel Muñoz Beancur, Santiago Cortes, Ray Tawil, Aaron Sabin, Jack Lynch, Travis Harper, Nikhil Gajendrakumar

    Abstract: Masterful is a software platform to train deep learning computer vision models. Data and model architecture are inputs to the platform, and the output is a trained model. The platform's primary goal is to maximize a trained model's accuracy, which it achieves through its regularization and semi-supervised learning implementations. The platform's secondary goal is to minimize the amount of manual e… ▽ More

    Submitted 20 May, 2022; originally announced May 2022.

  37. EndoMapper dataset of complete calibrated endoscopy procedures

    Authors: Pablo Azagra, Carlos Sostres, Ángel Ferrandez, Luis Riazuelo, Clara Tomasini, Oscar León Barbed, Javier Morlana, David Recasens, Victor M. Batlle, Juan J. Gómez-Rodríguez, Richard Elvira, Julia López, Cristina Oriol, Javier Civera, Juan D. Tardós, Ana Cristina Murillo, Angel Lanas, José M. M. Montiel

    Abstract: Computer-assisted systems are becoming broadly used in medicine. In endoscopy, most research focuses on the automatic detection of polyps or other pathologies, but localization and navigation of the endoscope are completely performed manually by physicians. To broaden this research and bring spatial Artificial Intelligence to endoscopies, data from complete procedures is needed. This paper introdu… ▽ More

    Submitted 10 October, 2023; v1 submitted 29 April, 2022; originally announced April 2022.

    Comments: 17 pages, 14 figures, 8 tables

    Journal ref: Sci Data 10, 671 (2023)

  38. Survey on Federated Learning Threats: concepts, taxonomy on attacks and defences, experimental study and challenges

    Authors: Nuria Rodríguez-Barroso, Daniel Jiménez López, M. Victoria Luzón, Francisco Herrera, Eugenio Martínez-Cámara

    Abstract: Federated learning is a machine learning paradigm that emerges as a solution to the privacy-preservation demands in artificial intelligence. As machine learning, federated learning is threatened by adversarial attacks against the integrity of the learning model and the privacy of data via a distributed approach to tackle local and global learning. This weak point is exacerbated by the inaccessibil… ▽ More

    Submitted 20 January, 2022; originally announced January 2022.

    Journal ref: Information Fusion (2022)

  39. arXiv:2108.11220  [pdf, ps, other

    cs.SE cs.LG

    Toward Formal Data Set Verification for Building Effective Machine Learning Models

    Authors: Jorge López, Maxime Labonne, Claude Poletti

    Abstract: In order to properly train a machine learning model, data must be properly collected. To guarantee a proper data collection, verifying that the collected data set holds certain properties is a possible solution. For example, guaranteeing that the data set contains samples across the whole input space, or that the data set is balanced w.r.t. different classes. We present a formal approach for verif… ▽ More

    Submitted 25 August, 2021; originally announced August 2021.

    Comments: Preprint submitted to IC3K 2021

  40. arXiv:2107.07630  [pdf, other

    cs.AI cs.HC

    Evaluation of Human-AI Teams for Learned and Rule-Based Agents in Hanabi

    Authors: Ho Chit Siu, Jaime D. Pena, Edenna Chen, Yutai Zhou, Victor J. Lopez, Kyle Palko, Kimberlee C. Chang, Ross E. Allen

    Abstract: Deep reinforcement learning has generated superhuman AI in competitive games such as Go and StarCraft. Can similar learning techniques create a superior AI teammate for human-machine collaborative games? Will humans prefer AI teammates that improve objective team performance or those that improve subjective metrics of trust? In this study, we perform a single-blind evaluation of teams of humans an… ▽ More

    Submitted 21 October, 2021; v1 submitted 15 July, 2021; originally announced July 2021.

    Comments: Accepted for publication at NeurIPS 2021

  41. arXiv:2107.07217  [pdf, other

    cs.NI cs.SE

    Dynamic Link Network Emulation: a Model-based Design

    Authors: Erick Petersen, Jorge López, Natalia Kushik, Claude Poletti, Djamal Zeghlache

    Abstract: This paper presents the design and architecture of a network emulator whose links' parameters (such as delay and bandwidth) vary at different time instances. The emulator can thus be used in order to test and evaluate novel solutions for such networks, before their final deployment. To achieve this goal, different existing technologies are carefully combined to emulate link dynamicity, automatic t… ▽ More

    Submitted 4 March, 2022; v1 submitted 15 July, 2021; originally announced July 2021.

    Comments: 8 pages, as submitted to the 17th International Conference on Evaluation of Novel Approaches to Software Engineering

  42. arXiv:2105.14276  [pdf, other

    cs.CY cs.AI cs.CL cs.LG

    Correcting public opinion trends through Bayesian data assimilation

    Authors: Robin Hendrickx, Rossella Arcucci, Julio Amador Dıaz Lopez, Yi-Ke Guo, Mark Kennedy

    Abstract: Measuring public opinion is a key focus during democratic elections, enabling candidates to gauge their popularity and alter their campaign strategies accordingly. Traditional survey polling remains the most popular estimation technique, despite its cost and time intensity, measurement errors, lack of real-time capabilities and lagged representation of public opinion. In recent years, Twitter opin… ▽ More

    Submitted 29 May, 2021; originally announced May 2021.

  43. arXiv:2105.14144  [pdf

    nlin.AO cs.CY

    A mechanism of Individualistic Indirect Reciprocity with internal and external dynamics

    Authors: Mario Ignacio González Silva, Ricardo Armando González Silva, Héctor Alfonso Juárez López, Antonio Aguilera Ontiveros

    Abstract: The cooperation mechanism of indirect reciprocity has been studied by making multiple variations of its parts. This research proposes a new variant of Nowak and Sigmund model, focused on agents' attitude; it is called Individualistic Indirect Reciprocity. In our model, an agent reinforces its strategy to the extent to which it makes a profit. We also include conditions related to the environment,… ▽ More

    Submitted 28 May, 2021; originally announced May 2021.

    Comments: 24 pages

    MSC Class: I.6

  44. arXiv:2101.05339  [pdf, other

    cond-mat.mtrl-sci cs.AI cs.LG

    Accelerating amorphous polymer electrolyte screening by learning to reduce errors in molecular dynamics simulated properties

    Authors: Tian Xie, Arthur France-Lanord, Yanming Wang, Jeffrey Lopez, Michael Austin Stolberg, Megan Hill, Graham Michael Leverick, Rafael Gomez-Bombarelli, Jeremiah A. Johnson, Yang Shao-Horn, Jeffrey C. Grossman

    Abstract: Polymer electrolytes are promising candidates for the next generation lithium-ion battery technology. Large scale screening of polymer electrolytes is hindered by the significant cost of molecular dynamics (MD) simulation in amorphous systems: the amorphous structure of polymers requires multiple, repeated sampling to reduce noise and the slow relaxation requires long simulation time for convergen… ▽ More

    Submitted 15 March, 2022; v1 submitted 13 January, 2021; originally announced January 2021.

    Comments: 29 pages, 6 figures + supplementary information

    Journal ref: Nature communications 13.1 (2022): 1-10

  45. Short-Term Flow-Based Bandwidth Forecasting using Machine Learning

    Authors: Maxime Labonne, Jorge López, Claude Poletti, Jean-Baptiste Munier

    Abstract: This paper proposes a novel framework to predict traffic flows' bandwidth ahead of time. Modern network management systems share a common issue: the network situation evolves between the moment the decision is made and the moment when actions (countermeasures) are applied. This framework converts packets from real-life traffic into flows containing relevant features. Machine learning models, inclu… ▽ More

    Submitted 3 December, 2020; v1 submitted 29 November, 2020; originally announced November 2020.

    Comments: 4 pages, 1 figure 3 tables

  46. arXiv:2011.02559  [pdf, ps, other

    cs.LG cs.AI

    Adaptive Stress Testing of Trajectory Predictions in Flight Management Systems

    Authors: Robert J. Moss, Ritchie Lee, Nicholas Visser, Joachim Hochwarth, James G. Lopez, Mykel J. Kochenderfer

    Abstract: To find failure events and their likelihoods in flight-critical systems, we investigate the use of an advanced black-box stress testing approach called adaptive stress testing. We analyze a trajectory predictor from a developmental commercial flight management system which takes as input a collection of lateral waypoints and en-route environmental conditions. Our aim is to search for failure event… ▽ More

    Submitted 4 November, 2020; originally announced November 2020.

    Comments: 10 pages, 10 figures, 6 algorithms. Digital Avionics Systems Conference (DASC) 2020

  47. arXiv:2010.10618  [pdf, other

    cs.LG cs.AI eess.SY

    Runtime Safety Assurance Using Reinforcement Learning

    Authors: Christopher Lazarus, James G. Lopez, Mykel J. Kochenderfer

    Abstract: The airworthiness and safety of a non-pedigreed autopilot must be verified, but the cost to formally do so can be prohibitive. We can bypass formal verification of non-pedigreed components by incorporating Runtime Safety Assurance (RTSA) as mechanism to ensure safety. RTSA consists of a meta-controller that observes the inputs and outputs of a non-pedigreed component and verifies formally specifie… ▽ More

    Submitted 20 October, 2020; originally announced October 2020.

    Journal ref: 2020 IEEE/AIAA 39th Digital Avionics Systems Conference (DASC)

  48. arXiv:2009.10822  [pdf, other

    cs.NI

    Priority Flow Admission and Routing in SDN: Exact and Heuristic Approaches

    Authors: Jorge López, Maxime Labonne, Claude Poletti, Dallal Belabed

    Abstract: This paper proposes a novel admission and routing scheme which takes into account arbitrarily assigned priorities for network flows. The presented approach leverages the centralized Software Defined Networking (SDN) capabilities in order to do so. Exact and heuristic approaches to the stated Priority Flow Admission and Routing (PFAR) problem are provided. The exact approach which provides an optim… ▽ More

    Submitted 22 September, 2020; originally announced September 2020.

    Comments: As submitted to NCA 2020

  49. arXiv:2009.10051  [pdf, ps, other

    cs.SE

    On using SMT-solvers for Modeling and Verifying Dynamic Network Emulators

    Authors: Erick Petersen, Jorge López, Natalia Kushik, Claude Poletti, Djamal Zeghlache

    Abstract: A novel model-based approach to verify dynamic networks is proposed; the approach consists in formally describing the network topology and dynamic link parameters. A many sorted first order logic formula is constructed to check the model with respect to a set of properties. The network consistency is verified using an SMT-solver, and the formula is used for the run-time network verification when a… ▽ More

    Submitted 13 October, 2020; v1 submitted 21 September, 2020; originally announced September 2020.

    Comments: New version with experimental results submitted to NCA 2020 as a work in progress paper

  50. arXiv:2008.11858  [pdf, other

    cs.SE cs.IR

    MAR: A structure-based search engine for models

    Authors: José Antonio Hernández López, Jesús Sánchez Cuadrado

    Abstract: The availability of shared software models provides opportunities for reusing, adapting and learning from them. Public models are typically stored in a variety of locations, including model repositories, regular source code repositories, web pages, etc. To profit from them developers need effective search mechanisms to locate the models relevant for their tasks. However, to date, there has been li… ▽ More

    Submitted 26 August, 2020; originally announced August 2020.