Skip to main content

Showing 1–12 of 12 results for author: Alonso, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.10192  [pdf, other

    cs.SE cs.CL

    Red Teaming Contemporary AI Models: Insights from Spanish and Basque Perspectives

    Authors: Miguel Romero-Arjona, Pablo Valle, Juan C. Alonso, Ana B. Sánchez, Miriam Ugarte, Antonia Cazalilla, Vicente Cambrón, José A. Parejo, Aitor Arrieta, Sergio Segura

    Abstract: The battle for AI leadership is on, with OpenAI in the United States and DeepSeek in China as key contenders. In response to these global trends, the Spanish government has proposed ALIA, a public and transparent AI infrastructure incorporating small language models designed to support Spanish and co-official languages such as Basque. This paper presents the results of Red Teaming sessions, where… ▽ More

    Submitted 13 March, 2025; originally announced March 2025.

  2. arXiv:2410.10609  [pdf, other

    cs.LG stat.ML

    Lambda-Skip Connections: the architectural component that prevents Rank Collapse

    Authors: Federico Arangath Joseph, Jerome Sieber, Melanie N. Zeilinger, Carmen Amo Alonso

    Abstract: Rank collapse, a phenomenon where embedding vectors in sequence models rapidly converge to a uniform token or equilibrium state, has recently gained attention in the deep learning literature. This phenomenon leads to reduced expressivity and potential training instabilities due to vanishing gradients. Empirical evidence suggests that architectural components like skip connections, LayerNorm, and M… ▽ More

    Submitted 13 February, 2025; v1 submitted 14 October, 2024; originally announced October 2024.

  3. arXiv:2408.11841  [pdf, other

    cs.CY cs.AI cs.CL

    Could ChatGPT get an Engineering Degree? Evaluating Higher Education Vulnerability to AI Assistants

    Authors: Beatriz Borges, Negar Foroutan, Deniz Bayazit, Anna Sotnikova, Syrielle Montariol, Tanya Nazaretzky, Mohammadreza Banaei, Alireza Sakhaeirad, Philippe Servant, Seyed Parsa Neshaei, Jibril Frej, Angelika Romanou, Gail Weiss, Sepideh Mamooler, Zeming Chen, Simin Fan, Silin Gao, Mete Ismayilzada, Debjit Paul, Alexandre Schöpfer, Andrej Janchevski, Anja Tiede, Clarence Linden, Emanuele Troiani, Francesco Salvi , et al. (65 additional authors not shown)

    Abstract: AI assistants are being increasingly used by students enrolled in higher education institutions. While these tools provide opportunities for improved teaching and education, they also pose significant challenges for assessment and learning outcomes. We conceptualize these challenges through the lens of vulnerability, the potential for university assessments and learning outcomes to be impacted by… ▽ More

    Submitted 27 November, 2024; v1 submitted 7 August, 2024; originally announced August 2024.

    Comments: 20 pages, 8 figures

    Journal ref: PNAS (2024) Vol. 121 | No. 49

  4. arXiv:2405.15731  [pdf, other

    cs.LG cs.AI eess.SY

    Understanding the differences in Foundation Models: Attention, State Space Models, and Recurrent Neural Networks

    Authors: Jerome Sieber, Carmen Amo Alonso, Alexandre Didier, Melanie N. Zeilinger, Antonio Orvieto

    Abstract: Softmax attention is the principle backbone of foundation models for various artificial intelligence applications, yet its quadratic complexity in sequence length can limit its inference throughput in long-context settings. To address this challenge, alternative architectures such as linear attention, State Space Models (SSMs), and Recurrent Neural Networks (RNNs) have been considered as more effi… ▽ More

    Submitted 8 December, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

    Comments: NeurIPS 2024

  5. arXiv:2405.15454  [pdf, other

    cs.CL eess.SY

    Linearly Controlled Language Generation with Performative Guarantees

    Authors: Emily Cheng, Marco Baroni, Carmen Amo Alonso

    Abstract: The increasing prevalence of Large Language Models (LMs) in critical applications highlights the need for controlled language generation strategies that are not only computationally efficient but that also enjoy performance guarantees. To achieve this, we use a common model of concept semantics as linearly represented in an LM's latent space. In particular, we take the view that natural language g… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  6. arXiv:2403.16899  [pdf, other

    eess.SY cs.CL cs.LG

    State Space Models as Foundation Models: A Control Theoretic Overview

    Authors: Carmen Amo Alonso, Jerome Sieber, Melanie N. Zeilinger

    Abstract: In recent years, there has been a growing interest in integrating linear state-space models (SSM) in deep neural network architectures of foundation models. This is exemplified by the recent success of Mamba, showing better performance than the state-of-the-art Transformer architectures in language tasks. Foundation models, like e.g. GPT-4, aim to encode sequential data into a latent space in orde… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  7. arXiv:2403.10762  [pdf, other

    cs.RO

    NARRATE: Versatile Language Architecture for Optimal Control in Robotics

    Authors: Seif Ismail, Antonio Arbues, Ryan Cotterell, René Zurbrügg, Carmen Amo Alonso

    Abstract: The impressive capabilities of Large Language Models (LLMs) have led to various efforts to enable robots to be controlled through natural language instructions, opening exciting possibilities for human-robot interaction The goal is for the motor-control task to be performed accurately, efficiently and safely while also enjoying the flexibility imparted by LLMs to specify and adjust the task throug… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  8. arXiv:2306.08004  [pdf

    cs.LG cs.AI

    Detection and classification of faults aimed at preventive maintenance of PV systems

    Authors: Edgar Hernando Sepúlveda Oviedo, Louise Travé-Massuyès, Audine Subias, Marko Pavlov, Corinne Alonso

    Abstract: Diagnosis in PV systems aims to detect, locate and identify faults. Diagnosing these faults is vital to guarantee energy production and extend the useful life of PV power plants. In the literature, multiple machine learning approaches have been proposed for this purpose. However, few of these works have paid special attention to the detection of fine faults and the specialized process of extractio… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

    Journal ref: XI Congreso Internacional de Ingenier{í}a Mec{á}nica, Mecatr{ó}nica y Automatizaci{ó}n 2023, Universidad Nacional de Colombia, Apr 2023, Carthag{è}ne, Colombia

  9. arXiv:2306.08003  [pdf

    cs.LG cs.AI

    DTW k-means clustering for fault detection in photovoltaic modules

    Authors: Edgar Hernando Sepúlveda Oviedo, Louise Travé-Massuyès, Audine Subias, Marko Pavlov, Corinne Alonso

    Abstract: The increase in the use of photovoltaic (PV) energy in the world has shown that the useful life and maintenance of a PV plant directly depend on theability to quickly detect severe faults on a PV plant. To solve this problem of detection, data based approaches have been proposed in the literature.However, these previous solutions consider only specific behavior of one or few faults. Most of these… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

    Journal ref: XI Congreso Internacional de Ingenier{í}a Mec{á}nica, Mecatr{ó}nica y Automatizaci{ó}n 2023, Apr 2023, Carthag{è}ne, Colombia

  10. arXiv:2206.13342  [pdf, other

    cs.CV cs.CL cs.IR cs.LG

    Open Set Classification of Untranscribed Handwritten Documents

    Authors: José Ramón Prieto, Juan José Flores, Enrique Vidal, Alejandro H. Toselli, David Garrido, Carlos Alonso

    Abstract: Huge amounts of digital page images of important manuscripts are preserved in archives worldwide. The amounts are so large that it is generally unfeasible for archivists to adequately tag most of the documents with the required metadata so as to low proper organization of the archives and effective exploration by scholars and the general public. The class or ``typology'' of a document is perhaps t… ▽ More

    Submitted 20 June, 2022; originally announced June 2022.

  11. arXiv:2103.14990  [pdf, other

    cs.DC eess.SY

    Effective GPU Parallelization of Distributed and Localized Model Predictive Control

    Authors: Carmen Amo Alonso, Shih-Hao Tseng

    Abstract: To effectively control large-scale distributed systems online, model predictive control (MPC) has to swiftly solve the underlying high-dimensional optimization. There are multiple techniques applied to accelerate the solving process in the literature, mainly attributed to software-based algorithmic advancements and hardware-assisted computation enhancements. However, those methods focus on arithme… ▽ More

    Submitted 27 March, 2021; originally announced March 2021.

    Comments: Submitted to 2021 Control and Decision Conference

  12. Novel Common Vehicle Information Model (CVIM) for Future Automotive Vehicle Big Data Marketplaces

    Authors: Johannes Pillmann, Christian Wietfeld, Adrian Zarcula, Thomas Raugust, Daniel Calvo Alonso

    Abstract: Even though connectivity services have been introduced in many of the most recent car models, access to vehicle data is currently limited due to its proprietary nature. The European project AutoMat has therefore developed an open Marketplace providing a single point of access for brand-independent vehicle data. Thereby, vehicle sensor data can be leveraged for the design and implementation of enti… ▽ More

    Submitted 21 February, 2018; originally announced February 2018.

    Journal ref: Intelligent Vehicles Symposium (IV), 2017 IEEE