Skip to main content

Showing 1–50 of 153 results for author: Martinez, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.18213  [pdf, ps, other

    cs.AI

    A Conceptual Framework for AI Capability Evaluations

    Authors: María Victoria Carro, Denise Alejandra Mester, Francisca Gauna Selasco, Luca Nicolás Forziati Gangi, Matheo Sandleris Musa, Lola Ramos Pereyra, Mario Leiva, Juan Gustavo Corvalan, María Vanina Martinez, Gerardo Simari

    Abstract: As AI systems advance and integrate into society, well-designed and transparent evaluations are becoming essential tools in AI governance, informing decisions by providing evidence about system capabilities and risks. Yet there remains a lack of clarity on how to perform these assessments both comprehensively and reliably. To address this gap, we propose a conceptual framework for analyzing AI cap… ▽ More

    Submitted 22 June, 2025; originally announced June 2025.

    Comments: arXiv admin note: text overlap with arXiv:2306.04181 by other authors

  2. arXiv:2506.17857  [pdf, ps, other

    q-bio.BM cs.LG

    AbRank: A Benchmark Dataset and Metric-Learning Framework for Antibody-Antigen Affinity Ranking

    Authors: Chunan Liu, Aurelien Pelissier, Yanjun Shao, Lilian Denzler, Andrew C. R. Martin, Brooks Paige, Mariia Rodriguez Martinez

    Abstract: Accurate prediction of antibody-antigen (Ab-Ag) binding affinity is essential for therapeutic design and vaccine development, yet the performance of current models is limited by noisy experimental labels, heterogeneous assay conditions, and poor generalization across the vast antibody and antigen sequence space. We introduce AbRank, a large-scale benchmark and evaluation framework that reframes af… ▽ More

    Submitted 21 June, 2025; originally announced June 2025.

  3. arXiv:2506.17208  [pdf, ps, other

    cs.SE cs.AI cs.CL

    Dissecting the SWE-Bench Leaderboards: Profiling Submitters and Architectures of LLM- and Agent-Based Repair Systems

    Authors: Matias Martinez, Xavier Franch

    Abstract: The rapid progress in Automated Program Repair (APR) has been driven by advances in AI, particularly large language models (LLMs) and agent-based systems. SWE-Bench is a recent benchmark designed to evaluate LLM-based repair systems using real issues and pull requests mined from 12 popular open-source Python repositories. Its public leaderboards, SWE-Bench Lite and SWE-Bench Verified, have become… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

  4. arXiv:2506.09977  [pdf, ps, other

    cs.AI

    How Do People Revise Inconsistent Beliefs? Examining Belief Revision in Humans with User Studies

    Authors: Stylianos Loukas Vasileiou, Antonio Rago, Maria Vanina Martinez, William Yeoh

    Abstract: Understanding how humans revise their beliefs in light of new information is crucial for developing AI systems which can effectively model, and thus align with, human reasoning. While theoretical belief revision frameworks rely on a set of principles that establish how these operations are performed, empirical evidence from cognitive psychology suggests that people may follow different patterns wh… ▽ More

    Submitted 11 June, 2025; originally announced June 2025.

  5. arXiv:2504.19042  [pdf, other

    physics.ins-det cs.AI cs.LG hep-ex nucl-ex

    Generative Models for Fast Simulation of Cherenkov Detectors at the Electron-Ion Collider

    Authors: James Giroux, Michael Martinez, Cristiano Fanelli

    Abstract: The integration of Deep Learning (DL) into experimental nuclear and particle physics has driven significant progress in simulation and reconstruction workflows. However, traditional simulation frameworks such as Geant4 remain computationally intensive, especially for Cherenkov detectors, where simulating optical photon transport through complex geometries and reflective surfaces introduces a major… ▽ More

    Submitted 26 April, 2025; originally announced April 2025.

    Comments: 45 pages, 27 figures

  6. arXiv:2504.06324  [pdf, other

    cs.CY cs.AI

    From Stability to Inconsistency: A Study of Moral Preferences in LLMs

    Authors: Monika Jotautaite, Mary Phuong, Chatrik Singh Mangat, Maria Angelica Martinez

    Abstract: As large language models (LLMs) increasingly integrate into our daily lives, it becomes crucial to understand their implicit biases and moral tendencies. To address this, we introduce a Moral Foundations LLM dataset (MFD-LLM) grounded in Moral Foundations Theory, which conceptualizes human morality through six core foundations. We propose a novel evaluation method that captures the full spectrum o… ▽ More

    Submitted 8 April, 2025; originally announced April 2025.

  7. arXiv:2504.03624  [pdf, other

    cs.CL cs.AI cs.LG

    Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

    Authors: NVIDIA, :, Aaron Blakeman, Aarti Basant, Abhinav Khattar, Adithya Renduchintala, Akhiad Bercovich, Aleksander Ficek, Alexis Bjorlin, Ali Taghibakhshi, Amala Sanjay Deshmukh, Ameya Sunil Mahabaleshwarkar, Andrew Tao, Anna Shors, Ashwath Aithal, Ashwin Poojary, Ayush Dattagupta, Balaram Buddharaju, Bobby Chen, Boris Ginsburg, Boxin Wang, Brandon Norick, Brian Butterfield, Bryan Catanzaro, Carlo del Mundo , et al. (176 additional authors not shown)

    Abstract: As inference-time scaling becomes critical for enhanced reasoning capabilities, it is increasingly becoming important to build models that are efficient to infer. We introduce Nemotron-H, a family of 8B and 56B/47B hybrid Mamba-Transformer models designed to reduce inference cost for a given accuracy level. To achieve this goal, we replace the majority of self-attention layers in the common Transf… ▽ More

    Submitted 15 April, 2025; v1 submitted 4 April, 2025; originally announced April 2025.

  8. arXiv:2503.17952  [pdf, other

    cs.CL

    SLIDE: Sliding Localized Information for Document Extraction

    Authors: Divyansh Singh, Manuel Nunez Martinez, Bonnie J. Dorr, Sonja Schmer Galunder

    Abstract: Constructing accurate knowledge graphs from long texts and low-resource languages is challenging, as large language models (LLMs) experience degraded performance with longer input chunks. This problem is amplified in low-resource settings where data scarcity hinders accurate entity and relationship extraction. Contextual retrieval methods, while improving retrieval accuracy, struggle with long doc… ▽ More

    Submitted 23 March, 2025; originally announced March 2025.

  9. arXiv:2501.13706  [pdf

    cs.CE physics.comp-ph

    Analysis of Eccentric Coaxial Waveguides Filled with Lossy Anisotropic Media via Finite Difference

    Authors: Raul O. Ribeiro, Maria A. Martinez, Guilherme S. Rosa, Rafael A. Penchel

    Abstract: This study presents a finite difference method (FDM) to model the electromagnetic field propagation in eccentric coaxial waveguides filled with lossy uniaxially anisotropic media. The formulation utilizes conformal transformation to map the eccentric circular waveguide into an equivalent concentric one. In the concentric problem, we introduce a novel normalized Helmholtz equation to decouple TM an… ▽ More

    Submitted 23 January, 2025; originally announced January 2025.

    Comments: This work was presented at the SBMO 2024 - XXI Brazilian Symposium on Microwaves and Optoelectronics. For more information about the conference, please visit https://www.sbmo.org.br/sbmo/2024/home

  10. arXiv:2412.15441  [pdf, other

    cs.SE cs.AI cs.LG

    Energy consumption of code small language models serving with runtime engines and execution providers

    Authors: Francisco Durán, Matias Martinez, Patricia Lago, Silverio Martínez-Fernández

    Abstract: Background. The rapid growth of Language Models (LMs), particularly in code generation, requires substantial computational resources, raising concerns about energy consumption and environmental impact. Optimizing LMs inference for energy efficiency is crucial, and Small Language Models (SLMs) offer a promising solution to reduce resource demands. Aim. Our goal is to analyze the impact of deep le… ▽ More

    Submitted 19 December, 2024; originally announced December 2024.

    Comments: 26 pages, submitted to journal

  11. arXiv:2412.10509  [pdf, other

    cs.AI cs.CL cs.LG

    Do Large Language Models Show Biases in Causal Learning?

    Authors: Maria Victoria Carro, Francisca Gauna Selasco, Denise Alejandra Mester, Margarita Gonzales, Mario A. Leiva, Maria Vanina Martinez, Gerardo I. Simari

    Abstract: Causal learning is the cognitive process of developing the capability of making causal inferences based on available information, often guided by normative principles. This process is prone to errors and biases, such as the illusion of causality, in which people perceive a causal relationship between two variables despite lacking supporting evidence. This cognitive bias has been proposed to underl… ▽ More

    Submitted 13 December, 2024; originally announced December 2024.

    Comments: 15 pages, 6 figures

  12. arXiv:2412.03982  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    Exploring Fully Convolutional Networks for the Segmentation of Hyperspectral Imaging Applied to Advanced Driver Assistance Systems

    Authors: Jon Gutiérrez-Zaballa, Koldo Basterretxea, Javier Echanobe, M. Victoria Martínez, Inés del Campo

    Abstract: Advanced Driver Assistance Systems (ADAS) are designed with the main purpose of increasing the safety and comfort of vehicle occupants. Most of current computer vision-based ADAS perform detection and tracking tasks quite successfully under regular conditions, but are not completely reliable, particularly under adverse weather and changing lighting conditions, neither in complex situations with ma… ▽ More

    Submitted 5 December, 2024; originally announced December 2024.

    Comments: arXiv admin note: text overlap with arXiv:2411.19274

    Journal ref: Design and Architecture for Signal and Image Processing (DASIP 2022)

  13. arXiv:2411.19274  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    On-chip Hyperspectral Image Segmentation with Fully Convolutional Networks for Scene Understanding in Autonomous Driving

    Authors: Jon Gutiérrez-Zaballa, Koldo Basterretxea, Javier Echanobe, M. Victoria Martínez, Unai Martínez-Corral, Óscar Mata Carballeira, Inés del Campo

    Abstract: Most of current computer vision-based advanced driver assistance systems (ADAS) perform detection and tracking of objects quite successfully under regular conditions. However, under adverse weather and changing lighting conditions, and in complex situations with many overlapping objects, these systems are not completely reliable. The spectral reflectance of the different objects in a driving scene… ▽ More

    Submitted 28 November, 2024; originally announced November 2024.

    Journal ref: 2023 Journal of Systems Architecture (JSA)

  14. arXiv:2411.17543  [pdf, other

    cs.CV cs.AI cs.AR cs.LG eess.IV

    Rapid Deployment of Domain-specific Hyperspectral Image Processors with Application to Autonomous Driving

    Authors: Jon Gutiérrez-Zaballa, Koldo Basterretxea, Javier Echanobe, Óscar Mata-Carballeira, M. Victoria Martínez

    Abstract: The article discusses the use of low cost System-On-Module (SOM) platforms for the implementation of efficient hyperspectral imaging (HSI) processors for application in autonomous driving. The work addresses the challenges of shaping and deploying multiple layer fully convolutional networks (FCN) for low-latency, on-board image semantic segmentation using resource- and power-constrained processing… ▽ More

    Submitted 26 November, 2024; originally announced November 2024.

    Journal ref: 2023 30th IEEE International Conference on Electronics, Circuits and Systems (ICECS)

  15. arXiv:2411.17530  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    HSI-Drive v2.0: More Data for New Challenges in Scene Understanding for Autonomous Driving

    Authors: Jon Gutiérrez-Zaballa, Koldo Basterretxea, Javier Echanobe, M. Victoria Martínez, Unai Martínez-Corral

    Abstract: We present the updated version of the HSI-Drive dataset aimed at developing automated driving systems (ADS) using hyperspectral imaging (HSI). The v2.0 version includes new annotated images from videos recorded during winter and fall in real driving scenarios. Added to the spring and summer images included in the previous v1.1 version, the new dataset contains 752 images covering the four seasons.… ▽ More

    Submitted 26 November, 2024; originally announced November 2024.

    Journal ref: 2023 IEEE Symposium Series on Computational Intelligence (SSCI)

  16. arXiv:2411.04328  [pdf, other

    cs.CL

    Balancing Transparency and Accuracy: A Comparative Analysis of Rule-Based and Deep Learning Models in Political Bias Classification

    Authors: Manuel Nunez Martinez, Sonja Schmer-Galunder, Zoey Liu, Sangpil Youm, Chathuri Jayaweera, Bonnie J. Dorr

    Abstract: The unchecked spread of digital information, combined with increasing political polarization and the tendency of individuals to isolate themselves from opposing political viewpoints, has driven researchers to develop systems for automatically detecting political bias in media. This trend has been further fueled by discussions on social media. We explore methods for categorizing bias in US news art… ▽ More

    Submitted 6 November, 2024; originally announced November 2024.

  17. arXiv:2410.14627  [pdf, other

    cs.SE cs.AI cs.CL

    CELI: Controller-Embedded Language Model Interactions

    Authors: Jan-Samuel Wagner, Dave DeCaprio, Abishek Chiffon Muthu Raja, Jonathan M. Holman, Lauren K. Brady, Sky C. Cheung, Hosein Barzekar, Eric Yang, Mark Anthony Martinez II, David Soong, Sriram Sridhar, Han Si, Brandon W. Higgs, Hisham Hamadeh, Scott Ogden

    Abstract: We introduce Controller-Embedded Language Model Interactions (CELI), a framework that integrates control logic directly within language model (LM) prompts, facilitating complex, multi-stage task execution. CELI addresses limitations of existing prompt engineering and workflow optimization techniques by embedding control logic directly within the operational context of language models, enabling dyn… ▽ More

    Submitted 18 October, 2024; originally announced October 2024.

    Comments: 26 pages, 2 figures

    MSC Class: 68T50; 68Q32; 68N19 ACM Class: I.2.6; I.2.7; D.2.2

  18. arXiv:2410.10252  [pdf, other

    cs.DM

    The formula for the completion time of project networks

    Authors: Manuel Castejón-Limas, Gabriel Medina Martínez, Virginia Riego del Castillo, Laura Fernández-Robles

    Abstract: This paper formulates the completion time $τ$ of a project network as $ τ=\|\mathbf{R} \mathbf{t} \|_\infty $ where the rows of $\mathbf{R}$ are simple paths of the network and $\mathbf{t}$ is a column vector representing the duration of the activities. Considering this product as a linear transformation leads to interesting findings on the topological relevance of both paths and activities using… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

    Comments: 20 pages, 2 figures

  19. arXiv:2410.06491  [pdf, other

    cs.AI cs.LG

    Honesty to Subterfuge: In-Context Reinforcement Learning Can Make Honest Models Reward Hack

    Authors: Leo McKee-Reid, Christoph Sträter, Maria Angelica Martinez, Joe Needham, Mikita Balesni

    Abstract: Previous work has shown that training "helpful-only" LLMs with reinforcement learning on a curriculum of gameable environments can lead models to generalize to egregious specification gaming, such as editing their own reward function or modifying task checklists to appear more successful. We show that gpt-4o, gpt-4o-mini, o1-preview, and o1-mini - frontier models trained to be helpful, harmless, a… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

    Comments: 20 pages, 9 figures

  20. arXiv:2409.15813  [pdf, other

    cs.CV cs.AI cs.MM

    Layer-wise Model Merging for Unsupervised Domain Adaptation in Segmentation Tasks

    Authors: Roberto Alcover-Couso, Juan C. SanMiguel, Marcos Escudero-Viñolo, Jose M Martínez

    Abstract: Merging parameters of multiple models has resurfaced as an effective strategy to enhance task performance and robustness, but prior work is limited by the high costs of ensemble creation and inference. In this paper, we leverage the abundance of freely accessible trained models to introduce a cost-free approach to model merging. It focuses on a layer-wise integration of merged models, aiming to ma… ▽ More

    Submitted 24 September, 2024; originally announced September 2024.

  21. arXiv:2409.02281  [pdf, other

    cs.CV cs.LG

    K-Origins: Better Colour Quantification for Neural Networks

    Authors: Lewis Mason, Mark Martinez

    Abstract: K-Origins is a neural network layer designed to improve image-based network performances when learning colour, or intensities, is beneficial. Over 250 encoder-decoder convolutional networks are trained and tested on 16-bit synthetic data, demonstrating that K-Origins improves semantic segmentation accuracy in two scenarios: object detection with low signal-to-noise ratios, and segmenting multiple… ▽ More

    Submitted 3 September, 2024; originally announced September 2024.

    Comments: 16 pages, 13 figures, 1 table

  22. arXiv:2408.06875  [pdf, ps, other

    cs.AI

    Advancing Interactive Explainable AI via Belief Change Theory

    Authors: Antonio Rago, Maria Vanina Martinez

    Abstract: As AI models become ever more complex and intertwined in humans' daily lives, greater levels of interactivity of explainable AI (XAI) methods are needed. In this paper, we propose the use of belief change theory as a formal foundation for operators that model the incorporation of new information, i.e. user feedback in interactive XAI, to logical representations of data-driven classifiers. We argue… ▽ More

    Submitted 14 August, 2024; v1 submitted 13 August, 2024; originally announced August 2024.

    Comments: 9 pages. To be published at KR 2024

  23. arXiv:2408.01050  [pdf, other

    cs.SE cs.CL cs.LG

    The Impact of Hyperparameters on Large Language Model Inference Performance: An Evaluation of vLLM and HuggingFace Pipelines

    Authors: Matias Martinez

    Abstract: The recent surge of open-source large language models (LLMs) enables developers to create AI-based solutions while maintaining control over aspects such as privacy and compliance, thereby providing governance and ownership of the model deployment process. To utilize these LLMs, inference engines are needed. These engines load the model's weights onto available resources, such as GPUs, and process… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

  24. arXiv:2406.11704  [pdf, other

    cs.CL cs.AI cs.LG

    Nemotron-4 340B Technical Report

    Authors: Nvidia, :, Bo Adler, Niket Agarwal, Ashwath Aithal, Dong H. Anh, Pallab Bhattacharya, Annika Brundyn, Jared Casper, Bryan Catanzaro, Sharon Clay, Jonathan Cohen, Sirshak Das, Ayush Dattagupta, Olivier Delalleau, Leon Derczynski, Yi Dong, Daniel Egert, Ellie Evans, Aleksander Ficek, Denys Fridman, Shaona Ghosh, Boris Ginsburg, Igor Gitman, Tomasz Grzegorzek , et al. (58 additional authors not shown)

    Abstract: We release the Nemotron-4 340B model family, including Nemotron-4-340B-Base, Nemotron-4-340B-Instruct, and Nemotron-4-340B-Reward. Our models are open access under the NVIDIA Open Model License Agreement, a permissive model license that allows distribution, modification, and use of the models and its outputs. These models perform competitively to open access models on a wide range of evaluation be… ▽ More

    Submitted 6 August, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

  25. arXiv:2405.14020  [pdf, other

    cs.LG cs.AI

    Unlearning Information Bottleneck: Machine Unlearning of Systematic Patterns and Biases

    Authors: Ling Han, Hao Huang, Dustin Scheinost, Mary-Anne Hartley, María Rodríguez Martínez

    Abstract: Effective adaptation to distribution shifts in training data is pivotal for sustaining robustness in neural networks, especially when removing specific biases or outdated information, a process known as machine unlearning. Traditional approaches typically assume that data variations are random, which makes it difficult to adjust the model parameters accurately to remove patterns and characteristic… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  26. Optimization of resources for digital radio transmission over IBOC FM through max-min fairness

    Authors: Mónica Rico Martínez, Juan Carlos Vesga Ferreira, Joel Carroll Vargas, María Consuelo Rodríguez Niño, Andrés Alejandro Diaz Toro, William Alexander Cuevas Carrero

    Abstract: The equitable distribution of resources in a network is a complex process, considering that not all nodes have the same requirements, and the In-Band On-Channel (IBOC) hybrid transmission system is no exception. The IBOC system utilizes a hybrid in-band transmission to simultaneously broadcast analog and digital audio over the FM band. This article proposes the use of a Max-Min Fairness (MMF) algo… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: 10 pages, 3 table

  27. arXiv:2403.14291  [pdf, other

    cs.CV

    Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models

    Authors: Pablo Marcos-Manchón, Roberto Alcover-Couso, Juan C. SanMiguel, Jose M. Martínez

    Abstract: Diffusion models represent a new paradigm in text-to-image generation. Beyond generating high-quality images from text prompts, models such as Stable Diffusion have been successfully extended to the joint generation of semantic segmentation pseudo-masks. However, current extensions primarily rely on extracting attentions linked to prompt words used for image synthesis. This approach limits the gen… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Journal ref: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024)

  28. arXiv:2402.16819  [pdf, other

    cs.CL cs.AI cs.LG

    Nemotron-4 15B Technical Report

    Authors: Jupinder Parmar, Shrimai Prabhumoye, Joseph Jennings, Mostofa Patwary, Sandeep Subramanian, Dan Su, Chen Zhu, Deepak Narayanan, Aastha Jhunjhunwala, Ayush Dattagupta, Vibhu Jawa, Jiwei Liu, Ameya Mahabaleshwarkar, Osvald Nitski, Annika Brundyn, James Maki, Miguel Martinez, Jiaxuan You, John Kamalu, Patrick LeGresley, Denys Fridman, Jared Casper, Ashwath Aithal, Oleksii Kuchaiev, Mohammad Shoeybi , et al. (2 additional authors not shown)

    Abstract: We introduce Nemotron-4 15B, a 15-billion-parameter large multilingual language model trained on 8 trillion text tokens. Nemotron-4 15B demonstrates strong performance when assessed on English, multilingual, and coding tasks: it outperforms all existing similarly-sized open models on 4 out of 7 downstream evaluation areas and achieves competitive performance to the leading open models in the remai… ▽ More

    Submitted 27 February, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

  29. arXiv:2402.09265  [pdf, ps, other

    cs.DB cs.AI cs.LO

    Computational Complexity of Preferred Subset Repairs on Data-Graphs

    Authors: Nina Pardal, Santiago Cifuentes, Edwin Pin, Maria Vanina Martinez, Sergio Abriola

    Abstract: Preferences are a pivotal component in practical reasoning, especially in tasks that involve decision-making over different options or courses of action that could be pursued. In this work, we focus on repairing and querying inconsistent knowledge bases in the form of graph databases, which involves finding a way to solve conflicts in the knowledge base and considering answers that are entailed fr… ▽ More

    Submitted 27 May, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

    Comments: Appendix

    MSC Class: 68P15; 68T27; 03B70; 68T37

  30. Identifying architectural design decisions for achieving green ML serving

    Authors: Francisco Durán, Silverio Martínez-Fernández, Matias Martinez, Patricia Lago

    Abstract: The growing use of large machine learning models highlights concerns about their increasing computational demands. While the energy consumption of their training phase has received attention, fewer works have considered the inference phase. For ML inference, the binding of ML models to the ML system for user access, known as ML serving, is a critical yet understudied step for achieving efficiency… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

    Comments: Accepted for publication as short paper in Conference on AI Engineering Software Engineering for AI (CAIN 2024)

  31. arXiv:2401.12731  [pdf, other

    cs.AI cs.LG cs.LO

    The Distributional Uncertainty of the SHAP score in Explainable Machine Learning

    Authors: Santiago Cifuentes, Leopoldo Bertossi, Nina Pardal, Sergio Abriola, Maria Vanina Martinez, Miguel Romero

    Abstract: Attribution scores reflect how important the feature values in an input entity are for the output of a machine learning model. One of the most popular attribution scores is the SHAP score, which is an instantiation of the general Shapley value used in coalition game theory. The definition of this score relies on a probability distribution on the entity population. Since the exact distribution is g… ▽ More

    Submitted 13 August, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

    Comments: In ECAI 2024 proceedings

    MSC Class: 68T37; 68T27

  32. arXiv:2311.17393  [pdf, other

    cs.AI

    Comparison of metaheuristics for the firebreak placement problem: a simulation-based optimization approach

    Authors: David Palacios-Meneses, Jaime Carrasco, Sebastián Dávila, Maximiliano Martínez, Rodrigo Mahaluf, Andrés Weintraub

    Abstract: The problem of firebreak placement is crucial for fire prevention, and its effectiveness at landscape scale will depend on their ability to impede the progress of future wildfires. To provide an adequate response, it is therefore necessary to consider the stochastic nature of fires, which are highly unpredictable from ignition to extinction. Thus, the placement of firebreaks can be considered a st… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  33. Beyond Certificates: 6G-ready Access Control for the Service-Based Architecture with Decentralized Identifiers and Verifiable Credentials

    Authors: Sandro Rodriguez Garzon, Hai Dinh Tuan, Maria Mora Martinez, Axel Küpper, Hans Joachim Einsiedler, Daniela Schneider

    Abstract: Next generation mobile networks are poised to transition from monolithic structures owned and operated by single mobile network operators into multi-stakeholder networks where various parties contribute with infrastructure, resources, and services. However, a federation of networks and services brings along a crucial challenge: Guaranteeing secure and trustworthy access control among network entit… ▽ More

    Submitted 23 February, 2024; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: This work has been submitted to the IEEE for possible publication

    Journal ref: 2024 Joint European Conference on Networks and Communications & 6G Summit (EuCNC/6G Summit), 2024, pp. 830-835

  34. On the Feasibility of Cross-Language Detection of Malicious Packages in npm and PyPI

    Authors: Piergiorgio Ladisa, Serena Elisa Ponta, Nicola Ronzoni, Matias Martinez, Olivier Barais

    Abstract: Current software supply chains heavily rely on open-source packages hosted in public repositories. Given the popularity of ecosystems like npm and PyPI, malicious users started to spread malware by publishing open-source packages containing malicious code. Recent works apply machine learning techniques to detect malicious packages in the npm ecosystem. However, the scarcity of samples poses a chal… ▽ More

    Submitted 14 October, 2023; originally announced October 2023.

    Comments: Proceedings of Annual Computer Security Applications Conference (ACSAC '23), December 4--8, 2023, Austin, TX, USA

  35. arXiv:2310.06177  [pdf, other

    cs.LG

    DockGame: Cooperative Games for Multimeric Rigid Protein Docking

    Authors: Vignesh Ram Somnath, Pier Giuseppe Sessa, Maria Rodriguez Martinez, Andreas Krause

    Abstract: Protein interactions and assembly formation are fundamental to most biological processes. Predicting the assembly structure from constituent proteins -- referred to as the protein docking task -- is thus a crucial step in protein design applications. Most traditional and deep learning methods for docking have focused mainly on binary docking, following either a search-based, regression-based, or g… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: Under Review

  36. arXiv:2309.03797  [pdf, other

    cs.LG

    Conformal Autoregressive Generation: Beam Search with Coverage Guarantees

    Authors: Nicolas Deutschmann, Marvin Alberts, María Rodríguez Martínez

    Abstract: We introduce two new extensions to the beam search algorithm based on conformal predictions (CP) to produce sets of sequences with theoretical coverage guarantees. The first method is very simple and proposes dynamically-sized subsets of beam search results but, unlike typical CP procedures, has an upper bound on the achievable guarantee depending on a post-hoc calibration measure. Our second algo… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

    Comments: 11 pages, 4 figures

  37. arXiv:2308.00619  [pdf, other

    quant-ph cs.ET hep-ex

    A quantum algorithm for track reconstruction in the LHCb vertex detector

    Authors: Davide Nicotra, Miriam Lucio Martinez, Jacco Andreas de Vries, Marcel Merk, Kurt Driessens, Ronald Leonard Westra, Domenica Dibenedetto, Daniel Hugo Cámpora Pérez

    Abstract: High-energy physics is facing increasingly computational challenges in real-time event reconstruction for the near-future high-luminosity era. Using the LHCb vertex detector as a use-case, we explore a new algorithm for particle track reconstruction based on the minimisation of an Ising-like Hamiltonian with a linear algebra approach. The use of a classical matrix inversion technique results in tr… ▽ More

    Submitted 17 October, 2023; v1 submitted 1 August, 2023; originally announced August 2023.

    Comments: 23 pages, 10 figures

  38. The Hitchhiker's Guide to Malicious Third-Party Dependencies

    Authors: Piergiorgio Ladisa, Merve Sahin, Serena Elisa Ponta, Marco Rosa, Matias Martinez, Olivier Barais

    Abstract: The increasing popularity of certain programming languages has spurred the creation of ecosystem-specific package repositories and package managers. Such repositories (e.g., npm, PyPI) serve as public databases that users can query to retrieve packages for various functionalities, whereas package managers automatically handle dependency resolution and package installation on the client side. These… ▽ More

    Submitted 6 October, 2023; v1 submitted 18 July, 2023; originally announced July 2023.

    Comments: Proceedings of the 2023 Workshop on Software Supply Chain Offensive Research and Ecosystem Defenses (SCORED '23), November 30, 2023, Copenhagen, Denmark

  39. arXiv:2307.07076  [pdf

    cs.HC cs.CL

    An Analysis of Dialogue Repair in Virtual Voice Assistants

    Authors: Matthew Carson Galbraith, Mireia Gómez i Martínez

    Abstract: Language speakers often use what are known as repair initiators to mend fundamental disconnects that occur between them during verbal communication. Previous research in this field has mainly focused on the human-to-human use of repair initiator. We proposed an examination of dialogue repair structure wherein the dialogue initiator is human and the party that initiates or responds to the repair is… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

    Comments: 2021, non-print, unpublished version

  40. arXiv:2305.19901  [pdf, other

    cs.LG stat.ML

    Adaptive Conformal Regression with Jackknife+ Rescaled Scores

    Authors: Nicolas Deutschmann, Mattia Rigotti, Maria Rodriguez Martinez

    Abstract: Conformal regression provides prediction intervals with global coverage guarantees, but often fails to capture local error distributions, leading to non-homogeneous coverage. We address this with a new adaptive method based on rescaling conformal scores with an estimate of local score distribution, inspired by the Jackknife+ method, which enables the use of calibration data in conformal scores wit… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

    Comments: 24 pages, 7 figures

  41. arXiv:2304.05200  [pdf, other

    cs.CR cs.SE

    Journey to the Center of Software Supply Chain Attacks

    Authors: Piergiorgio Ladisa, Serena Elisa Ponta, Antonino Sabetta, Matias Martinez, Olivier Barais

    Abstract: This work discusses open-source software supply chain attacks and proposes a general taxonomy describing how attackers conduct them. We then provide a list of safeguards to mitigate such attacks. We present our tool "Risk Explorer for Software Supply Chains" to explore such information and we discuss its industrial use-cases.

    Submitted 11 April, 2023; originally announced April 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2204.04008

  42. arXiv:2302.13961  [pdf, other

    cs.CV

    Soft labelling for semantic segmentation: Bringing coherence to label down-sampling

    Authors: Roberto Alcover-Couso, Marcos Escudero-Vinolo, Juan C. SanMiguel, Jose M. Martinez

    Abstract: In semantic segmentation, training data down-sampling is commonly performed due to limited resources, the need to adapt image size to the model input, or improve data augmentation. This down-sampling typically employs different strategies for the image data and the annotated labels. Such discrepancy leads to mismatches between the down-sampled color and label images. Hence, the training performanc… ▽ More

    Submitted 19 February, 2024; v1 submitted 27 February, 2023; originally announced February 2023.

  43. arXiv:2302.11419  [pdf, other

    cs.LG q-bio.QM

    Aligned Diffusion Schrödinger Bridges

    Authors: Vignesh Ram Somnath, Matteo Pariset, Ya-Ping Hsieh, Maria Rodriguez Martinez, Andreas Krause, Charlotte Bunne

    Abstract: Diffusion Schrödinger bridges (DSB) have recently emerged as a powerful framework for recovering stochastic dynamics via their marginal observations at different time points. Despite numerous successful applications, existing algorithms for solving DSBs have so far failed to utilize the structure of aligned data, which naturally arises in many biological phenomena. In this paper, we propose a nove… ▽ More

    Submitted 28 April, 2024; v1 submitted 22 February, 2023; originally announced February 2023.

  44. arXiv:2302.02205  [pdf, other

    cs.OH

    Automating Crochet Patterns for Surfaces of Revolution

    Authors: Megan Martinez, Amanda Taylor Lipnicki

    Abstract: A surface of revolution is created by taking a curve in the $xy$-plane and rotating it about some axis. We develop a program which automatically generates crochet patterns for surfaces by revolution when they are obtained by rotating about the $x$-axis. In order to accomplish this, we invoke the arclength integral to determine where to take measurements for each row. In addition, a distance measur… ▽ More

    Submitted 31 January, 2023; originally announced February 2023.

    MSC Class: 00A66

  45. arXiv:2302.00967  [pdf, other

    cs.LG cs.AI cs.SE

    Energy Efficiency of Training Neural Network Architectures: An Empirical Study

    Authors: Yinlena Xu, Silverio Martínez-Fernández, Matias Martinez, Xavier Franch

    Abstract: The evaluation of Deep Learning models has traditionally focused on criteria such as accuracy, F1 score, and related measures. The increasing availability of high computational power environments allows the creation of deeper and more complex models. However, the computations needed to train such models entail a large carbon footprint. In this work, we study the relations between DL model architec… ▽ More

    Submitted 2 February, 2023; originally announced February 2023.

    Comments: Accepted in HICSS 2023. For its published version refer to the Proceedings of the 56th Hawaii International Conference on System Sciences; URI https://hdl.handle.net/10125/102727

    ACM Class: D.2; I.2

    Journal ref: Proceedings of the 56th Hawaii International Conference on System Sciences, pp. 781-790 (2023)

  46. arXiv:2211.15538  [pdf, other

    cs.CV

    Graph Convolutional Network for Multi-Target Multi-Camera Vehicle Tracking

    Authors: Elena Luna, Juan Carlos San Miguel, José María Martínez, Marcos Escudero-Viñolo

    Abstract: This letter focuses on the task of Multi-Target Multi-Camera vehicle tracking. We propose to associate single-camera trajectories into multi-camera global trajectories by training a Graph Convolutional Network. Our approach simultaneously processes all cameras providing a global solution, and it is also robust to large cameras unsynchronizations. Furthermore, we design a new loss function to deal… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

  47. Energy Consumption of Automated Program Repair

    Authors: Matias Martinez, Silverio Martínez-Fernández, Xavier Franch

    Abstract: Automated program repair (APR) aims to automatize the process of repairing software bugs in order to reduce the cost of maintaining software programs. Moreover, the success (given by the accuracy metric) of APR approaches has increased in recent years. However, no previous work has considered the energy impact of repairing bugs automatically using APR. The field of green software research aims to… ▽ More

    Submitted 5 February, 2024; v1 submitted 22 November, 2022; originally announced November 2022.

    Journal ref: 2024 IEEE/ACM 46th International Conference on Software Engineering: Companion Proceedings (ICSE-Companion '24), April 14--20, 2024, Lisbon, Portugal

  48. arXiv:2210.03998  [pdf, other

    cs.CR

    Towards the Detection of Malicious Java Packages

    Authors: Piergiorgio Ladisa, Henrik Plate, Matias Martinez, Olivier Barais, Serena Elisa Ponta

    Abstract: Open-source software supply chain attacks aim at infecting downstream users by poisoning open-source packages. The common way of consuming such artifacts is through package repositories and the development of vetting strategies to detect such attacks is ongoing research. Despite its popularity, the Java ecosystem is the less explored one in the context of supply chain attacks. In this paper we p… ▽ More

    Submitted 8 October, 2022; originally announced October 2022.

  49. arXiv:2210.03180  [pdf, other

    eess.IV cs.CV cs.LG

    A ResNet is All You Need? Modeling A Strong Baseline for Detecting Referable Diabetic Retinopathy in Fundus Images

    Authors: Tomás Castilla, Marcela S. Martínez, Mercedes Leguía, Ignacio Larrabide, José Ignacio Orlando

    Abstract: Deep learning is currently the state-of-the-art for automated detection of referable diabetic retinopathy (DR) from color fundus photographs (CFP). While the general interest is put on improving results through methodological innovations, it is not clear how good these approaches perform compared to standard deep classification models trained with the appropriate settings. In this paper we propose… ▽ More

    Submitted 6 October, 2022; originally announced October 2022.

    Comments: Accepted for publication at the 18th International Symposium on Medical Information Processing and Analysis (SIPAIM 2022)

  50. arXiv:2209.04933  [pdf, other

    cs.LG math.DG stat.CO

    Dimensionality Reduction using Elastic Measures

    Authors: J. Derek Tucker, Matthew T. Martinez, Jose M. Laborde

    Abstract: With the recent surge in big data analytics for hyper-dimensional data there is a renewed interest in dimensionality reduction techniques for machine learning applications. In order for these methods to improve performance gains and understanding of the underlying data, a proper metric needs to be identified. This step is often overlooked and metrics are typically chosen without consideration of t… ▽ More

    Submitted 19 January, 2023; v1 submitted 7 September, 2022; originally announced September 2022.