Skip to main content

Showing 1–50 of 127 results for author: Del, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.05147  [pdf, ps, other

    cond-mat.stat-mech cond-mat.dis-nn cs.LG

    Pseudo-likelihood produces associative memories able to generalize, even for asymmetric couplings

    Authors: Francesco D'Amico, Dario Bocchi, Luca Maria Del Bono, Saverio Rossi, Matteo Negri

    Abstract: Energy-based probabilistic models learned by maximizing the likelihood of the data are limited by the intractability of the partition function. A widely used workaround is to maximize the pseudo-likelihood, which replaces the global normalization with tractable local normalizations. Here we show that, in the zero-temperature limit, a network trained to maximize pseudo-likelihood naturally implemen… ▽ More

    Submitted 7 July, 2025; originally announced July 2025.

  2. arXiv:2505.23500  [pdf, ps, other

    cs.SE cs.CL cs.DL

    Identity resolution of software metadata using Large Language Models

    Authors: Eva Martín del Pico, Josep Lluís Gelpí, Salvador Capella-Gutiérrez

    Abstract: Software is an essential component of research. However, little attention has been paid to it compared with that paid to research data. Recently, there has been an increase in efforts to acknowledge and highlight the importance of software in research activities. Structured metadata from platforms like bio.tools, Bioconductor, and Galaxy ToolShed offers valuable insights into research software i… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

  3. arXiv:2505.22598  [pdf, ps, other

    cond-mat.dis-nn cond-mat.stat-mech cs.AI cs.LG physics.comp-ph

    On the performance of machine-learning-assisted Monte Carlo in sampling from simple statistical physics models

    Authors: Luca Maria Del Bono, Federico Ricci-Tersenghi, Francesco Zamponi

    Abstract: Recent years have seen a rise in the application of machine learning techniques to aid the simulation of hard-to-sample systems that cannot be studied using traditional methods. Despite the introduction of many different architectures and procedures, a wide theoretical understanding is still lacking, with the risk of suboptimal implementations. As a first step to address this gap, we provide here… ▽ More

    Submitted 15 June, 2025; v1 submitted 28 May, 2025; originally announced May 2025.

    Comments: 17 pages, 10 figures

  4. arXiv:2505.19467  [pdf, other

    cs.DC

    GPU acceleration of non-equilibrium Green's function calculation using OpenACC and CUDA FORTRAN

    Authors: Jia Yin, Khaled Z. Ibrahim, Mauro Del Ben, Jack Deslippe, Yang-hao Chan, Chao Yang

    Abstract: The numerical solution of the Kadanoff-Baym nonlinear integro-differential equations, which yields the non-equilibrium Green's functions (NEGFs) of quantum many-body systems, poses significant computational challenges due to its high computational complexity. In this work, we present efficient implementations of a numerical method for solving these equations on distributed-memory architectures, in… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

    Comments: 14 pages, 20 figures

    MSC Class: 68W10

  5. arXiv:2505.13302  [pdf, ps, other

    cs.CL

    I'll believe it when I see it: Images increase misinformation sharing in Vision-Language Models

    Authors: Alice Plebe, Timothy Douglas, Diana Riazi, R. Maria del Rio-Chanona

    Abstract: Large language models are increasingly integrated into news recommendation systems, raising concerns about their role in spreading misinformation. In humans, visual content is known to boost credibility and shareability of information, yet its effect on vision-language models (VLMs) remains unclear. We present the first study examining how images influence VLMs' propensity to reshare news content,… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

  6. arXiv:2505.07457  [pdf, ps, other

    econ.GN cs.AI

    Can Generative AI agents behave like humans? Evidence from laboratory market experiments

    Authors: R. Maria del Rio-Chanona, Marco Pangallo, Cars Hommes

    Abstract: We explore the potential of Large Language Models (LLMs) to replicate human behavior in economic market experiments. Compared to previous studies, we focus on dynamic feedback between LLM agents: the decisions of each LLM impact the market price at the current step, and so affect the decisions of the other LLMs at the next step. We compare LLM behavior to market dynamics observed in laboratory set… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

  7. arXiv:2502.18620  [pdf, other

    cs.CV cs.AI cs.LG

    Diffusion Models for conditional MRI generation

    Authors: Miguel Herencia García del Castillo, Ricardo Moya Garcia, Manuel Jesús Cerezo Mazón, Ekaitz Arriola Garcia, Pablo Menéndez Fernández-Miranda

    Abstract: In this article, we present a Latent Diffusion Model (LDM) for the generation of brain Magnetic Resonance Imaging (MRI), conditioning its generation based on pathology (Healthy, Glioblastoma, Sclerosis, Dementia) and acquisition modality (T1w, T1ce, T2w, Flair, PD). To evaluate the quality of the generated images, the Fréchet Inception Distance (FID) and Multi-Scale Structural Similarity Index (… ▽ More

    Submitted 25 February, 2025; originally announced February 2025.

  8. arXiv:2502.13791  [pdf, ps, other

    cs.CL

    From Tools to Teammates: Evaluating LLMs in Multi-Session Coding Interactions

    Authors: Nathanaël Carraz Rakotonirina, Mohammed Hamdy, Jon Ander Campos, Lucas Weber, Alberto Testoni, Marzieh Fadaee, Sandro Pezzelle, Marco Del Tredici

    Abstract: Large Language Models (LLMs) are increasingly used in working environments for a wide range of tasks, excelling at solving individual problems in isolation. However, are they also able to effectively collaborate over long-term interactions? To investigate this, we introduce MemoryCode, a synthetic multi-session dataset designed to test LLMs' ability to track and execute simple coding instructions… ▽ More

    Submitted 6 June, 2025; v1 submitted 19 February, 2025; originally announced February 2025.

    Comments: Published as conference paper at ACL 2025

  9. arXiv:2502.01436  [pdf, other

    cs.CL cs.AI

    Towards Safer Chatbots: A Framework for Policy Compliance Evaluation of Custom GPTs

    Authors: David Rodriguez, William Seymour, Jose M. Del Alamo, Jose Such

    Abstract: Large Language Models (LLMs) have gained unprecedented prominence, achieving widespread adoption across diverse domains and integrating deeply into society. The capability to fine-tune general-purpose LLMs, such as Generative Pre-trained Transformers (GPT), for specific tasks has facilitated the emergence of numerous Custom GPTs. These tailored models are increasingly made available through dedica… ▽ More

    Submitted 14 April, 2025; v1 submitted 3 February, 2025; originally announced February 2025.

    ACM Class: I.2.1; I.2.7

  10. Real-Time Brain Tumor Detection in Intraoperative Ultrasound Using YOLO11: From Model Training to Deployment in the Operating Room

    Authors: Santiago Cepeda, Olga Esteban-Sinovas, Roberto Romero, Vikas Singh, Prakash Shetty, Aliasgar Moiyadi, Ilyess Zemmoura, Giuseppe Roberto Giammalva, Massimiliano Del Bene, Arianna Barbotti, Francesco DiMeco, Timothy R. West, Brian V. Nahed, Ignacio Arrese, Roberto Hornero, Rosario Sarabia

    Abstract: Intraoperative ultrasound (ioUS) is a valuable tool in brain tumor surgery due to its versatility, affordability, and seamless integration into the surgical workflow. However, its adoption remains limited, primarily because of the challenges associated with image interpretation and the steep learning curve required for effective use. This study aimed to enhance the interpretability of ioUS images… ▽ More

    Submitted 27 January, 2025; originally announced January 2025.

    Journal ref: Computers in Biology and Medicine, Vol. 170, 110481, 2025

  11. arXiv:2501.14249  [pdf, other

    cs.LG cs.AI cs.CL

    Humanity's Last Exam

    Authors: Long Phan, Alice Gatti, Ziwen Han, Nathaniel Li, Josephina Hu, Hugh Zhang, Chen Bo Calvin Zhang, Mohamed Shaaban, John Ling, Sean Shi, Michael Choi, Anish Agrawal, Arnav Chopra, Adam Khoja, Ryan Kim, Richard Ren, Jason Hausenloy, Oliver Zhang, Mantas Mazeika, Dmitry Dodonov, Tung Nguyen, Jaeho Lee, Daron Anderson, Mikhail Doroshenko, Alun Cennyth Stokes , et al. (1084 additional authors not shown)

    Abstract: Benchmarks are important tools for tracking the rapid advancements in large language model (LLM) capabilities. However, benchmarks are not keeping pace in difficulty: LLMs now achieve over 90\% accuracy on popular benchmarks like MMLU, limiting informed measurement of state-of-the-art LLM capabilities. In response, we introduce Humanity's Last Exam (HLE), a multi-modal benchmark at the frontier of… ▽ More

    Submitted 19 April, 2025; v1 submitted 24 January, 2025; originally announced January 2025.

    Comments: 29 pages, 6 figures

  12. arXiv:2501.10822  [pdf, other

    cs.LG cs.AI

    Addressing Multilabel Imbalance with an Efficiency-Focused Approach Using Diffusion Model-Generated Synthetic Samples

    Authors: Francisco Charte, Miguel Ángel Dávila, María Dolores Pérez-Godoy, María José del Jesus

    Abstract: Predictive models trained on imbalanced data tend to produce biased results. This problem is exacerbated when there is not just one output label, but a set of them. This is the case for multilabel learning (MLL) algorithms used to classify patterns, rank labels, or learn the distribution of outputs. Many solutions have been proposed in the literature. The one that can be applied universally, indep… ▽ More

    Submitted 18 January, 2025; originally announced January 2025.

    Comments: 22 pages, 8 figures, 10 tables

  13. arXiv:2412.12108  [pdf

    cs.CY cs.AI

    Responsible AI Governance: A Response to UN Interim Report on Governing AI for Humanity

    Authors: Sarah Kiden, Bernd Stahl, Beverley Townsend, Carsten Maple, Charles Vincent, Fraser Sampson, Geoff Gilbert, Helen Smith, Jayati Deshmukh, Jen Ross, Jennifer Williams, Jesus Martinez del Rincon, Justyna Lisinska, Karen O'Shea, Márjory Da Costa Abreu, Nelly Bencomo, Oishi Deb, Peter Winter, Phoebe Li, Philip Torr, Pin Lean Lau, Raquel Iniesta, Gopal Ramchurn, Sebastian Stein, Vahid Yazdanpanah

    Abstract: This report presents a comprehensive response to the United Nation's Interim Report on Governing Artificial Intelligence (AI) for Humanity. It emphasizes the transformative potential of AI in achieving the Sustainable Development Goals (SDGs) while acknowledging the need for robust governance to mitigate associated risks. The response highlights opportunities for promoting equitable, secure, and i… ▽ More

    Submitted 31 December, 2024; v1 submitted 29 November, 2024; originally announced December 2024.

    Comments: Submitted to United Nations. 23 pages. All the Authors Contributed Equally

  14. arXiv:2410.00453  [pdf, other

    cs.NI cs.CY cs.SI

    The NetMob2024 Dataset: Population Density and OD Matrices from Four LMIC Countries

    Authors: Wenlan Zhang, Miguel Nunez del Prado, Vincent Gauthier, Sveta Milusheva

    Abstract: The NetMob24 dataset offers a unique opportunity for researchers from a range of academic fields to access comprehensive spatiotemporal data sets spanning four countries (India, Mexico, Indonesia, and Colombia) over the course of two years (2019 and 2020). This dataset, developed in collaboration with Cuebiq (Also referred to as Spectus), comprises privacy-preserving aggregated data sets derived f… ▽ More

    Submitted 2 October, 2024; v1 submitted 1 October, 2024; originally announced October 2024.

  15. Quantification of stylistic differences in human- and ASR-produced transcripts of African American English

    Authors: Annika Heuser, Tyler Kendall, Miguel del Rio, Quinten McNamara, Nishchal Bhandari, Corey Miller, Migüel Jetté

    Abstract: Common measures of accuracy used to assess the performance of automatic speech recognition (ASR) systems, as well as human transcribers, conflate multiple sources of error. Stylistic differences, such as verbatim vs non-verbatim, can play a significant role in ASR performance evaluation when differences exist between training and test datasets. The problem is compounded for speech from underrepres… ▽ More

    Submitted 4 September, 2024; originally announced September 2024.

    Comments: Published in Interspeech 2024 Proceedings, 5 pages excluding references, 5 figures

  16. arXiv:2409.02792  [pdf, other

    cs.LG cs.CV

    UnLearning from Experience to Avoid Spurious Correlations

    Authors: Jeff Mitchell, Jesús Martínez del Rincón, Niall McLaughlin

    Abstract: While deep neural networks can achieve state-of-the-art performance in many tasks, these models are more fragile than they appear. They are prone to learning spurious correlations in their training data, leading to surprising failure cases. In this paper, we propose a new approach that addresses the issue of spurious correlations: UnLearning from Experience (ULE). Our method is based on using two… ▽ More

    Submitted 4 September, 2024; originally announced September 2024.

    Comments: 10 pages

  17. arXiv:2408.14340  [pdf, other

    cs.SD cs.AI cs.CL cs.LG eess.AS

    Foundation Models for Music: A Survey

    Authors: Yinghao Ma, Anders Øland, Anton Ragni, Bleiz MacSen Del Sette, Charalampos Saitis, Chris Donahue, Chenghua Lin, Christos Plachouras, Emmanouil Benetos, Elona Shatri, Fabio Morreale, Ge Zhang, György Fazekas, Gus Xia, Huan Zhang, Ilaria Manco, Jiawen Huang, Julien Guinot, Liwei Lin, Luca Marinelli, Max W. Y. Lam, Megha Sharma, Qiuqiang Kong, Roger B. Dannenberg, Ruibin Yuan , et al. (17 additional authors not shown)

    Abstract: In recent years, foundation models (FMs) such as large language models (LLMs) and latent diffusion models (LDMs) have profoundly impacted diverse sectors, including music. This comprehensive review examines state-of-the-art (SOTA) pre-trained models and foundation models in music, spanning from representation learning, generative learning and multimodal learning. We first contextualise the signifi… ▽ More

    Submitted 3 September, 2024; v1 submitted 26 August, 2024; originally announced August 2024.

  18. arXiv:2406.05577  [pdf, other

    cs.DC cs.MS

    Flexible Multi-Dimensional FFTs for Plane Wave Density Functional Theory Codes

    Authors: Doru Thom Popovici, Mauro del Ben, Osni Marques, Andrew Canning

    Abstract: Multi-dimensional Fourier transforms are key mathematical building blocks that appear in a wide range of applications from materials science, physics, chemistry and even machine learning. Over the past years, a multitude of software packages targeting distributed multi-dimensional Fourier transforms have been developed. Most variants attempt to offer efficient implementations for single transforms… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

    Comments: 17 pages, 9 figures

    MSC Class: 68W15 ACM Class: G.4

  19. arXiv:2405.20900  [pdf, other

    cs.CL cs.CY

    Large Language Models: A New Approach for Privacy Policy Analysis at Scale

    Authors: David Rodriguez, Ian Yang, Jose M. Del Alamo, Norman Sadeh

    Abstract: The number and dynamic nature of web and mobile applications presents significant challenges for assessing their compliance with data protection laws. In this context, symbolic and statistical Natural Language Processing (NLP) techniques have been employed for the automated analysis of these systems' privacy policies. However, these techniques typically require labor-intensive and potentially erro… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  20. arXiv:2404.08399  [pdf, other

    cs.CV

    Mitigating Challenges of the Space Environment for Onboard Artificial Intelligence: Design Overview of the Imaging Payload on SpIRIT

    Authors: Miguel Ortiz del Castillo, Jonathan Morgan, Jack McRobbie, Clint Therakam, Zaher Joukhadar, Robert Mearns, Simon Barraclough, Richard Sinnott, Andrew Woods, Chris Bayliss, Kris Ehinger, Ben Rubinstein, James Bailey, Airlie Chapman, Michele Trenti

    Abstract: Artificial intelligence (AI) and autonomous edge computing in space are emerging areas of interest to augment capabilities of nanosatellites, where modern sensors generate orders of magnitude more data than can typically be transmitted to mission control. Here, we present the hardware and software design of an onboard AI subsystem hosted on SpIRIT. The system is optimised for on-board computer vis… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: AI4Space 2024, 3rd Workshop on AI for Space, CVPR 2024

  21. arXiv:2404.03743  [pdf, other

    cs.CV

    Test Time Training for Industrial Anomaly Segmentation

    Authors: Alex Costanzino, Pierluigi Zama Ramirez, Mirko Del Moro, Agostino Aiezzo, Giuseppe Lisanti, Samuele Salti, Luigi Di Stefano

    Abstract: Anomaly Detection and Segmentation (AD&S) is crucial for industrial quality control. While existing methods excel in generating anomaly scores for each pixel, practical applications require producing a binary segmentation to identify anomalies. Due to the absence of labeled anomalies in many real scenarios, standard practices binarize these maps based on some statistics derived from a validation s… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: Accepted at VAND 2.0, CVPRW 2024

  22. arXiv:2403.08131  [pdf, other

    cs.DC cs.LG

    Cost-Effective Methodology for Complex Tuning Searches in HPC: Navigating Interdependencies and Dimensionality

    Authors: Adrian Perez Dieguez, Min Choi, Mahmut Okyay, Mauro Del Ben, Bryan M. Wong, Khaled Z. Ibrahim

    Abstract: Tuning searches are pivotal in High-Performance Computing (HPC), addressing complex optimization challenges in computational applications. The complexity arises not only from finely tuning parameters within routines but also potential interdependencies among them, rendering traditional optimization methods inefficient. Instead of scrutinizing interdependencies among parameters and routines, practi… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  23. arXiv:2403.07718  [pdf, other

    cs.LG cs.AI

    WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks?

    Authors: Alexandre Drouin, Maxime Gasse, Massimo Caccia, Issam H. Laradji, Manuel Del Verme, Tom Marty, Léo Boisvert, Megh Thakkar, Quentin Cappart, David Vazquez, Nicolas Chapados, Alexandre Lacoste

    Abstract: We study the use of large language model-based agents for interacting with software via web browsers. Unlike prior work, we focus on measuring the agents' ability to perform tasks that span the typical daily work of knowledge workers utilizing enterprise software systems. To this end, we propose WorkArena, a remote-hosted benchmark of 33 tasks based on the widely-used ServiceNow platform. We also… ▽ More

    Submitted 23 July, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Comments: 21 pages, 11 figures, preprint

  24. arXiv:2403.06915  [pdf, other

    cs.NI

    Monitoring the Venice Lagoon: an IoT Cloud-Based Sensor Nerwork Approach

    Authors: Filippo Campagnaro, Matin Ghalkhani, Riccardo Tumiati, Federico Marin, Matteo Del Grande, Alessandro Pozzebon, Davide De Battisti, Roberto Francescon, Michele Zorzi

    Abstract: Monitoring the coastal area of the Venice Lagoon is of significant importance. While the impact of global warming is felt worldwide, coastal and littoral regions bear the brunt more prominently. These areas not only face the threat of rising sea levels but also contend with the escalating occurrence of seaquakes and floods. Additionally, the intricate ecosystems of rivers, seas, and lakes undergo… ▽ More

    Submitted 29 April, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: 12 pages

  25. arXiv:2403.05493  [pdf, other

    cs.CL

    To Err Is Human, but Llamas Can Learn It Too

    Authors: Agnes Luhtaru, Taido Purason, Martin Vainikko, Maksym Del, Mark Fishel

    Abstract: This study explores enhancing grammatical error correction (GEC) through artificial error generation (AEG) using language models (LMs). Specifically, we fine-tune Llama 2-based LMs for error generation and find that this approach yields synthetic errors akin to human errors. Next, we train GEC Llama models with the help of these artificial errors and outperform previous state-of-the-art error corr… ▽ More

    Submitted 4 October, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  26. Efficient anytime algorithms to solve the bi-objective Next Release Problem

    Authors: Miguel Ángel Domínguez-Ríos, Francisco Chicano, Enrique Alba, Isabel María del Águila, José del Sagrado

    Abstract: The Next Release Problem consists in selecting a subset of requirements to develop in the next release of a software product. The selection should be done in a way that maximizes the satisfaction of the stakeholders while the development cost is minimized and the constraints of the requirements are fulfilled. Recent works have solved the problem using exact methods based on Integer Linear Programm… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Journal ref: J. Sys. Soft. 156: 217-231 (2019)

  27. Stability prediction of the software requirements specification

    Authors: J. del Sagrado, I. M. del Águila

    Abstract: Complex decision-making is a prominent aspect of Requirements Engineering. This work presents the Bayesian network Requisites that predicts whether the requirements specification documents have to be revised. We show how to validate Requisites by means of metrics obtained from a large complex software project. Besides, this Bayesian network has been integrated into a software tool by defining a co… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Journal ref: Software Qual J 26 (2018) 585-605

  28. Assisted Requirements Selection by Clustering

    Authors: José del Sagrado, Isabel M del Águila

    Abstract: Requirements selection is a decision-making process that enables project managers to focus on the deliverables that add most value to the project outcome. This task is performed to define which features or requirements will be developed in the next release. It is a complex multi-criteria decision process that has been focused by many research works because a balance between business profits and in… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Journal ref: Requirements Engineering 26 (2021) 167-184

  29. arXiv:2312.14812  [pdf, other

    cs.CV cs.LG

    PARDINUS: Weakly supervised discarding of photo-trapping empty images based on autoencoders

    Authors: David de la Rosa, Antonio J Rivera, María J del Jesus, Francisco Charte

    Abstract: Photo-trapping cameras are widely employed for wildlife monitoring. Those cameras take photographs when motion is detected to capture images where animals appear. A significant portion of these images are empty - no wildlife appears in the image. Filtering out those images is not a trivial task since it requires hours of manual work from biologists. Therefore, there is a notable interest in automa… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

  30. arXiv:2311.10837  [pdf, other

    cs.SI physics.soc-ph

    Evaluating the Relationship Between News Source Sharing and Political Beliefs

    Authors: Sofía M del Pozo, Sebastián Pinto, Matteo Serafino, Federico Moss, Tomás Cicchini, Hernán A Makse, Pablo Balenzuela

    Abstract: In an era marked by an abundance of news sources, access to information significantly influences public opinion. Notably, the bias of news sources often serves as an indicator of individuals' political leanings. This study explores this hypothesis by examining the news sharing behavior of politically active social media users, whose political ideologies were identified in a previous study. Using c… ▽ More

    Submitted 15 October, 2024; v1 submitted 17 November, 2023; originally announced November 2023.

  31. arXiv:2310.08701  [pdf, other

    cs.SI physics.soc-ph

    Analyzing User Ideologies and Shared News During the 2019 Argentinian Elections

    Authors: Sofía M del Pozo, Sebastián Pinto, Matteo Serafino, Lucio Garcia, Hernán A Makse, Pablo Balenzuela

    Abstract: The extensive data generated on social media platforms allow us to gain insights over trending topics and public opinions. Additionally, it offers a window into user behavior, including their content engagement and news sharing habits. In this study, we analyze the relationship between users' political ideologies and the news they share during Argentina's 2019 election period. Our findings reveal… ▽ More

    Submitted 25 April, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

  32. arXiv:2309.14284  [pdf, other

    math.OC cs.RO

    Navigation with shadow prices to optimize multi-commodity flow rates

    Authors: Ignacio Boero, Igor Spasojevic, Mariana del Castillo, George Pappas, Vijay Kumar, Alejandro Ribeiro

    Abstract: We propose a method for providing communication network infrastructure in autonomous multi-agent teams. In particular, we consider a set of communication agents that are placed alongside regular agents from the system in order to improve the rate of information transfer between the latter. In order to find the optimal positions to place such agents, we define a flexible performance function that a… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

    Comments: (c) 2023 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

  33. Saving temporary exhibitions in virtual environments: the Digital Renaissance of Ulisse Aldrovandi -- acquisition and digitisation of cultural heritage objects

    Authors: Roberto Balzani, Sebastian Barzaghi, Gabriele Bitelli, Federica Bonifazi, Alice Bordignon, Luca Cipriani, Simona Colitti, Federica Collina, Marilena Daquino, Francesca Fabbri, Bruno Fanini, Filippo Fantini, Daniele Ferdani, Giulia Fiorini, Elena Formia, Anna Forte, Federica Giacomini, Valentina Alena Girelli, Bianca Gualandi, Ivan Heibi, Alessandro Iannucci, Rachele Manganelli Del Fà, Arcangelo Massari, Arianna Moretti, Silvio Peroni , et al. (8 additional authors not shown)

    Abstract: As per the objectives of Project CHANGES, particularly its thematic sub-project on the use of virtual technologies for museums and art collections, our goal was to obtain a digital twin of the temporary exhibition on Ulisse Aldrovandi called "The Other Renaissance", and make it accessible to users online. After a preliminary study of the exhibition, focussing on acquisition constraints and related… ▽ More

    Submitted 27 December, 2023; v1 submitted 30 August, 2023; originally announced August 2023.

  34. arXiv:2308.03905  [pdf, other

    cs.CL cs.AI cs.LG

    Intelligent Assistant Language Understanding On Device

    Authors: Cecilia Aas, Hisham Abdelsalam, Irina Belousova, Shruti Bhargava, Jianpeng Cheng, Robert Daland, Joris Driesen, Federico Flego, Tristan Guigue, Anders Johannsen, Partha Lal, Jiarui Lu, Joel Ruben Antony Moniz, Nathan Perkins, Dhivya Piraviperumal, Stephen Pulman, Diarmuid Ó Séaghdha, David Q. Sun, John Torr, Marco Del Vecchio, Jay Wacker, Jason D. Williams, Hong Yu

    Abstract: It has recently become feasible to run personal digital assistants on phones and other personal devices. In this paper we describe a design for a natural language understanding system that runs on device. In comparison to a server-based assistant, this system is more private, more reliable, faster, more expressive, and more accurate. We describe what led to key choices about architecture and techn… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

  35. Are Large Language Models a Threat to Digital Public Goods? Evidence from Activity on Stack Overflow

    Authors: Maria del Rio-Chanona, Nadzeya Laurentsyeva, Johannes Wachs

    Abstract: Large language models like ChatGPT efficiently provide users with information about various topics, presenting a potential substitute for searching the web and asking people for help online. But since users interact privately with the model, these models may drastically reduce the amount of publicly available human-generated data and knowledge resources. This substitution can present a significant… ▽ More

    Submitted 14 July, 2023; originally announced July 2023.

  36. arXiv:2306.09247  [pdf, other

    cs.CR cs.AI cs.LG

    ATLAS: Automatically Detecting Discrepancies Between Privacy Policies and Privacy Labels

    Authors: Akshath Jain, David Rodriguez, Jose M. del Alamo, Norman Sadeh

    Abstract: Privacy policies are long, complex documents that end-users seldom read. Privacy labels aim to ameliorate these issues by providing succinct summaries of salient data practices. In December 2020, Apple began requiring that app developers submit privacy labels describing their apps' data practices. Yet, research suggests that app developers often struggle to do so. In this paper, we automatically i… ▽ More

    Submitted 24 May, 2023; originally announced June 2023.

    Comments: 14 pages, 13 figures

  37. arXiv:2306.08737  [pdf, other

    cs.RO cs.IT math.OC

    A Networked Multi-Agent System for Mobile Wireless Infrastructure on Demand

    Authors: Miguel Calvo-Fullana, Mikhail Gerasimenko, Daniel Mox, Leopoldo Agorio, Mariana del Castillo, Vijay Kumar, Alejandro Ribeiro, Juan Andres Bazerque

    Abstract: Despite the prevalence of wireless connectivity in urban areas around the globe, there remain numerous and diverse situations where connectivity is insufficient or unavailable. To address this, we introduce mobile wireless infrastructure on demand, a system of UAVs that can be rapidly deployed to establish an ad-hoc wireless network. This network has the capability of reconfiguring itself dynamica… ▽ More

    Submitted 16 September, 2024; v1 submitted 14 June, 2023; originally announced June 2023.

  38. mldr.resampling: Efficient Reference Implementations of Multilabel Resampling Algorithms

    Authors: Antonio J. Rivera, Miguel A. Dávila, David Elizondo, María J. del Jesus, Francisco Charte

    Abstract: Resampling algorithms are a useful approach to deal with imbalanced learning in multilabel scenarios. These methods have to deal with singularities in the multilabel data, such as the occurrence of frequent and infrequent labels in the same instance. Implementations of these methods are sometimes limited to the pseudocode provided by their authors in a paper. This Original Software Publication pre… ▽ More

    Submitted 30 May, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

  39. arXiv:2305.13402  [pdf, other

    cs.DS cs.LG stat.ML

    Error-Tolerant Exact Query Learning of Finite Set Partitions with Same-Cluster Oracle

    Authors: Adela Frances DePavia, Olga Medrano Martín del Campo, Erasmo Tani

    Abstract: This paper initiates the study of active learning for exact recovery of partitions exclusively through access to a same-cluster oracle in the presence of bounded adversarial error. We first highlight a novel connection between learning partitions and correlation clustering. Then we use this connection to build a Rényi-Ulam style analytical framework for this problem, and prove upper and lower boun… ▽ More

    Submitted 16 June, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: 28 pages, 2 figures

  40. An Estimation of Distribution Algorithm based on interactions between requirements to solve the bi-objective Next Release Problem

    Authors: Jose del Sagrado, Jose Antonio Sierra Ibanez, Isabel M. del Aguila

    Abstract: Selecting the appropriate requirements to develop in the next release of an open market software product under evolution, is a compulsory step of each software development project. This selection should be done by maximizing stakeholders' satisfaction and minimizing development costs, while keeping constraints. In this work we investigate what is the requirements interactions impact when searching… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

    Comments: 34 pages, 8 Figures, 6 tables. Preprint submitted to Journal of Systems and Software

    Journal ref: The Journal of Systems & Software (2023)

  41. arXiv:2301.06047  [pdf, other

    cs.NE cs.AI cs.LG

    EvoAAA: An evolutionary methodology for automated \neural autoencoder architecture search

    Authors: Francisco Charte, Antonio J. Rivera, Francisco Martínez, María J. del Jesus

    Abstract: Machine learning models work better when curated features are provided to them. Feature engineering methods have been usually used as a preprocessing step to obtain or build a proper feature set. In late years, autoencoders (a specific type of symmetrical neural network) have been widely used to perform representation learning, proving their competitiveness against classical feature engineering al… ▽ More

    Submitted 15 January, 2023; originally announced January 2023.

    Comments: Paper submited to Integrated Computer-Aided Engineering

  42. arXiv:2212.10114  [pdf, other

    cs.CL

    True Detective: A Deep Abductive Reasoning Benchmark Undoable for GPT-3 and Challenging for GPT-4

    Authors: Maksym Del, Mark Fishel

    Abstract: Large language models (LLMs) have demonstrated solid zero-shot reasoning capabilities, which is reflected in their performance on the current test tasks. This calls for a more challenging benchmark requiring highly advanced reasoning ability to be solved. In this paper, we introduce such a benchmark, consisting of 191 long-form (1200 words on average) mystery narratives constructed as detective pu… ▽ More

    Submitted 1 June, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: 5 pages, to appear at *SEM

  43. arXiv:2212.06691  [pdf, other

    quant-ph cs.ET cs.LG

    Quantum Clustering with k-Means: a Hybrid Approach

    Authors: Alessandro Poggiali, Alessandro Berti, Anna Bernasconi, Gianna M. Del Corso, Riccardo Guidotti

    Abstract: Quantum computing is a promising paradigm based on quantum theory for performing fast computations. Quantum algorithms are expected to surpass their classical counterparts in terms of computational complexity for certain tasks, including machine learning. In this paper, we design, implement, and evaluate three hybrid quantum k-Means algorithms, exploiting different degree of parallelism. Indeed, e… ▽ More

    Submitted 15 December, 2022; v1 submitted 13 December, 2022; originally announced December 2022.

    Report number: 2212.06691

    Journal ref: Theoretical Computer Science 2024

  44. arXiv:2212.01924  [pdf, other

    cs.CL cs.AI cs.LG

    Cross-lingual Similarity of Multilingual Representations Revisited

    Authors: Maksym Del, Mark Fishel

    Abstract: Related works used indexes like CKA and variants of CCA to measure the similarity of cross-lingual representations in multilingual language models. In this paper, we argue that assumptions of CKA/CCA align poorly with one of the motivating goals of cross-lingual learning analysis, i.e., explaining zero-shot cross-lingual transfer. We highlight what valuable aspects of cross-lingual similarity thes… ▽ More

    Submitted 4 December, 2022; originally announced December 2022.

    Comments: Accepted at AACL 2022

    Journal ref: AACL-IJCNLP 2022 Volume 1 Long Papers

  45. arXiv:2209.00993  [pdf, other

    eess.SP cs.AI cs.HC

    Data Fusion in Neuromarketing: Multimodal Analysis of Biosignals, Lifecycle Stages, Current Advances, Datasets, Trends, and Challenges

    Authors: Mario Quiles Pérez, Enrique Tomás Martínez Beltrán, Sergio López Bernal, Eduardo Horna Prat, Luis Montesano Del Campo, Lorenzo Fernández Maimó, Alberto Huertas Celdrán

    Abstract: The primary goal of any company is to increase its profits by improving both the quality of its products and how they are advertised. In this context, neuromarketing seeks to enhance the promotion of products and generate a greater acceptance on potential buyers. Traditionally, neuromarketing studies have relied on a single biosignal to obtain feedback from presented stimuli. However, thanks to ne… ▽ More

    Submitted 21 August, 2023; v1 submitted 30 August, 2022; originally announced September 2022.

    Comments: 26 pages, 15 figures

  46. arXiv:2208.08865  [pdf, other

    cs.CV astro-ph.IM

    Lessons from a Space Lab -- An Image Acquisition Perspective

    Authors: Leo Pauly, Michele Lynn Jamrozik, Miguel Ortiz Del Castillo, Olivia Borgue, Inder Pal Singh, Mohatashem Reyaz Makhdoomi, Olga-Orsalia Christidi-Loumpasefski, Vincent Gaudilliere, Carol Martinez, Arunkumar Rathinam, Andreas Hein, Miguel Olivares-Mendez, Djamila Aouada

    Abstract: The use of Deep Learning (DL) algorithms has improved the performance of vision-based space applications in recent years. However, generating large amounts of annotated data for training these DL algorithms has proven challenging. While synthetically generated images can be used, the DL models trained on synthetic data are often susceptible to performance degradation, when tested in real-world env… ▽ More

    Submitted 6 December, 2022; v1 submitted 18 August, 2022; originally announced August 2022.

    Journal ref: International Journal of Aerospace Engineering, vol. 2023, Article ID 9944614, 16 pages, 2023

  47. arXiv:2208.07926  [pdf, other

    econ.GN cs.SI

    Mental health concerns prelude the Great Resignation: Evidence from Social Media

    Authors: R. Maria del Rio-Chanona, Alejandro Hermida-Carrillo, Melody Sepahpour-Fard, Luning Sun, Renata Topinkova, Ljubica Nedelkoska

    Abstract: To study the causes of the 2021 Great Resignation, we use text analysis to investigate the changes in work- and quit-related posts between 2018 and 2021 on Reddit. We find that the Reddit discourse evolution resembles the dynamics of the U.S. quit and layoff rates. Furthermore, when the COVID-19 pandemic started, conversations related to working from home, switching jobs, work-related distress, an… ▽ More

    Submitted 16 August, 2022; originally announced August 2022.

  48. arXiv:2208.03197  [pdf, other

    cs.CL

    Low-Resource Dense Retrieval for Open-Domain Question Answering: A Comprehensive Survey

    Authors: Xiaoyu Shen, Svitlana Vakulenko, Marco del Tredici, Gianni Barlacchi, Bill Byrne, Adrià de Gispert

    Abstract: Dense retrieval (DR) approaches based on powerful pre-trained language models (PLMs) achieved significant advances and have become a key component for modern open-domain question-answering systems. However, they require large amounts of manual annotations to perform competitively, which is infeasible to scale. To address this, a growing body of research works have recently focused on improving DR… ▽ More

    Submitted 5 August, 2022; originally announced August 2022.

  49. arXiv:2204.09495  [pdf

    cs.CR cs.LG

    ROI: A method for identifying organizations receiving personal data

    Authors: David Rodriguez, Jose M. Del Alamo, Miguel Cozar, Boni Garcia

    Abstract: Many studies have exposed the massive collection of personal data in the digital ecosystem through, for instance, websites, mobile apps, or smart devices. This fact goes unnoticed by most users, who are also unaware that the collectors are sharing their personal data with many different organizations around the globe. This paper assesses techniques available in the state of the art to identify the… ▽ More

    Submitted 25 July, 2023; v1 submitted 19 April, 2022; originally announced April 2022.

    Comments: 23 pages, 10 figures

    ACM Class: I.7.0; D.4.6

  50. arXiv:2204.03930  [pdf, other

    cs.CL

    From Rewriting to Remembering: Common Ground for Conversational QA Models

    Authors: Marco Del Tredici, Xiaoyu Shen, Gianni Barlacchi, Bill Byrne, Adrià de Gispert

    Abstract: In conversational QA, models have to leverage information in previous turns to answer upcoming questions. Current approaches, such as Question Rewriting, struggle to extract relevant information as the conversation unwinds. We introduce the Common Ground (CG), an approach to accumulate conversational information as it emerges and select the relevant information at every turn. We show that CG offer… ▽ More

    Submitted 8 April, 2022; originally announced April 2022.

    Comments: Accepted at NLP for ConvAI