Skip to main content

Showing 1–50 of 1,790 results for author: De, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.02839  [pdf, ps, other

    cs.IR cs.AI

    DeepShop: A Benchmark for Deep Research Shopping Agents

    Authors: Yougang Lyu, Xiaoyu Zhang, Lingyong Yan, Maarten de Rijke, Zhaochun Ren, Xiuying Chen

    Abstract: Web agents for online shopping have shown great promise in automating user interactions across e-commerce platforms. Benchmarks for assessing such agents do not reflect the complexity of real-world shopping scenarios, as they often consist of overly simple queries with deterministic paths, such as "Find iPhone 15." Real shopping scenarios are inherently more layered, involving multi-dimensional pr… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

  2. arXiv:2506.01450  [pdf, ps, other

    cs.LG cs.AI

    ShaTS: A Shapley-based Explainability Method for Time Series Artificial Intelligence Models applied to Anomaly Detection in Industrial Internet of Things

    Authors: Manuel Franco de la Peña, Ángel Luis Perales Gómez, Lorenzo Fernández Maimó

    Abstract: Industrial Internet of Things environments increasingly rely on advanced Anomaly Detection and explanation techniques to rapidly detect and mitigate cyberincidents, thereby ensuring operational safety. The sequential nature of data collected from these environments has enabled improvements in Anomaly Detection using Machine Learning and Deep Learning models by processing time windows rather than t… ▽ More

    Submitted 2 June, 2025; originally announced June 2025.

    Comments: 22 pages;16 figures;Submitted to Elsevier (Information Fusion)

  3. arXiv:2506.00308  [pdf, ps, other

    cs.CY cs.AI cs.CL cs.HC

    MythTriage: Scalable Detection of Opioid Use Disorder Myths on a Video-Sharing Platform

    Authors: Hayoung Jung, Shravika Mittal, Ananya Aatreya, Navreet Kaur, Munmun De Choudhury, Tanushree Mitra

    Abstract: Understanding the prevalence of misinformation in health topics online can inform public health policies and interventions. However, measuring such misinformation at scale remains a challenge, particularly for high-stakes but understudied topics like opioid-use disorder (OUD)--a leading cause of death in the U.S. We present the first large-scale study of OUD-related myths on YouTube, a widely-used… ▽ More

    Submitted 30 May, 2025; originally announced June 2025.

    Comments: 34 pages, 14 figures, 21 tables. In submission

  4. arXiv:2506.00279  [pdf, ps, other

    cs.AI cs.LG

    Sleep Brain and Cardiac Activity Predict Cognitive Flexibility and Conceptual Reasoning Using Deep Learning

    Authors: Boshra Khajehpiri, Eric Granger, Massimiliano de Zambotti, Fiona C. Baker, Mohamad Forouzanfar

    Abstract: Despite extensive research on the relationship between sleep and cognition, the connection between sleep microstructure and human performance across specific cognitive domains remains underexplored. This study investigates whether deep learning models can predict executive functions, particularly cognitive adaptability and conceptual reasoning from physiological processes during a night's sleep. T… ▽ More

    Submitted 30 May, 2025; originally announced June 2025.

    Comments: This work was accepted for publication in IEEE EMBC 2025

  5. arXiv:2505.24619  [pdf, ps, other

    cs.CL cs.LG

    Interpretable phenotyping of Heart Failure patients with Dutch discharge letters

    Authors: Vittorio Torri, Machteld J. Boonstra, Marielle C. van de Veerdonk, Deborah N. Kalkman, Alicia Uijl, Francesca Ieva, Ameen Abu-Hanna, Folkert W. Asselbergs, Iacer Calixto

    Abstract: Objective: Heart failure (HF) patients present with diverse phenotypes affecting treatment and prognosis. This study evaluates models for phenotyping HF patients based on left ventricular ejection fraction (LVEF) classes, using structured and unstructured data, assessing performance and interpretability. Materials and Methods: The study analyzes all HF hospitalizations at both Amsterdam UMC hosp… ▽ More

    Submitted 30 May, 2025; originally announced May 2025.

    Comments: 43 pages, 8 figures

    MSC Class: 68T50 ACM Class: I.2.7; J.3

  6. arXiv:2505.24451  [pdf, other

    cs.CR cs.AI

    LPASS: Linear Probes as Stepping Stones for vulnerability detection using compressed LLMs

    Authors: Luis Ibanez-Lissen, Lorena Gonzalez-Manzano, Jose Maria de Fuentes, Nicolas Anciaux

    Abstract: Large Language Models (LLMs) are being extensively used for cybersecurity purposes. One of them is the detection of vulnerable codes. For the sake of efficiency and effectiveness, compression and fine-tuning techniques are being developed, respectively. However, they involve spending substantial computational efforts. In this vein, we analyse how Linear Probes (LPs) can be used to provide an estim… ▽ More

    Submitted 30 May, 2025; originally announced May 2025.

  7. arXiv:2505.24279  [pdf, ps, other

    cs.IR

    On the Scaling of Robustness and Effectiveness in Dense Retrieval

    Authors: Yu-An Liu, Ruqing Zhang, Jiafeng Guo, Maarten de Rijke, Yixing Fan, Xueqi Cheng

    Abstract: Robustness and Effectiveness are critical aspects of developing dense retrieval models for real-world applications. It is known that there is a trade-off between the two. Recent work has addressed scaling laws of effectiveness in dense retrieval, revealing a power-law relationship between effectiveness and the size of models and data. Does robustness follow scaling laws too? If so, can scaling imp… ▽ More

    Submitted 30 May, 2025; originally announced May 2025.

  8. arXiv:2505.24216  [pdf, ps, other

    cs.CV

    Shuffle PatchMix Augmentation with Confidence-Margin Weighted Pseudo-Labels for Enhanced Source-Free Domain Adaptation

    Authors: Prasanna Reddy Pulakurthi, Majid Rabbani, Jamison Heard, Sohail Dianat, Celso M. de Melo, Raghuveer Rao

    Abstract: This work investigates Source-Free Domain Adaptation (SFDA), where a model adapts to a target domain without access to source data. A new augmentation technique, Shuffle PatchMix (SPM), and a novel reweighting strategy are introduced to enhance performance. SPM shuffles and blends image patches to generate diverse and challenging augmentations, while the reweighting strategy prioritizes reliable p… ▽ More

    Submitted 30 May, 2025; originally announced May 2025.

    Comments: 6 pages, 3 figures, 5 tables, Accepted to IEEE ICIP 2025

  9. arXiv:2505.22941  [pdf, ps, other

    math.CO cs.DM

    The only Class 0 Flower snark is the smallest

    Authors: Guilherme Adamatti Bridi, André Luis Alves Martins, Franklin de Lima Marquezino, Celina Miraglia Herrera de Figueiredo

    Abstract: Graph pebbling is a game played on graphs with pebbles on their vertices. A pebbling move removes two pebbles from one vertex and places one pebble on an adjacent vertex. The pebbling number is the smallest $t$ so that from any initial configuration of $t$ pebbles it is possible, after a sequence of pebbling moves, to place a pebble on any given target vertex. Graphs whose pebbling number is equal… ▽ More

    Submitted 2 June, 2025; v1 submitted 28 May, 2025; originally announced May 2025.

    Comments: 8 pages, 5 figures. Corrected typos, and added final remarks. Supplementary software available at https://github.com/gabridi/pebbling_unsolvability

  10. arXiv:2505.22848  [pdf, ps, other

    cs.CL

    LiTEx: A Linguistic Taxonomy of Explanations for Understanding Within-Label Variation in Natural Language Inference

    Authors: Pingjun Hong, Beiduo Chen, Siyao Peng, Marie-Catherine de Marneffe, Barbara Plank

    Abstract: There is increasing evidence of Human Label Variation (HLV) in Natural Language Inference (NLI), where annotators assign different labels to the same premise-hypothesis pair. However, within-label variation--cases where annotators agree on the same label but provide divergent reasoning--poses an additional and mostly overlooked challenge. Several NLI datasets contain highlighted words in the NLI i… ▽ More

    Submitted 3 June, 2025; v1 submitted 28 May, 2025; originally announced May 2025.

    Comments: 21 pages, 6 figures

  11. arXiv:2505.21550  [pdf, ps, other

    cs.NI cs.AI cs.MA

    Collaborative Agentic AI Needs Interoperability Across Ecosystems

    Authors: Rishi Sharma, Martijn de Vos, Pradyumna Chari, Ramesh Raskar, Anne-Marie Kermarrec

    Abstract: Collaborative agentic AI is projected to transform entire industries by enabling AI-powered agents to autonomously perceive, plan, and act within digital environments. Yet, current solutions in this field are all built in isolation, and we are rapidly heading toward a landscape of fragmented, incompatible ecosystems. In this position paper, we argue that interoperability, achieved by the adoption… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

  12. arXiv:2505.21507  [pdf

    q-bio.NC cs.LG eess.SP

    Automatic detection of abnormal clinical EEG: comparison of a finetuned foundation model with two deep learning models

    Authors: Aurore Bussalb, François Le Gac, Guillaume Jubien, Mohamed Rahmouni, Ruggero G. Bettinardi, Pedro Marinho R. de Oliveira, Phillipe Derambure, Nicolas Gaspard, Jacques Jonas, Louis Maillard, Laurent Vercueil, Hervé Vespignani, Philippe Laval, Laurent Koessler, Ulysse Gimenez

    Abstract: Electroencephalography (EEG) is commonly used by physicians for the diagnosis of numerous neurological disorders. Due to the large volume of EEGs requiring interpretation and the specific expertise involved, artificial intelligence-based tools are being developed to assist in their visual analysis. In this paper, we compare two deep learning models (CNN-LSTM and Transformer-based) with BioSerenity… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

    Comments: 20 pages, 7 figures

  13. arXiv:2505.20688  [pdf, ps, other

    stat.ML cs.CV cs.LG stat.ME

    A False Discovery Rate Control Method Using a Fully Connected Hidden Markov Random Field for Neuroimaging Data

    Authors: Taehyo Kim, Qiran Jia, Mony J. de Leon, Hai Shu

    Abstract: False discovery rate (FDR) control methods are essential for voxel-wise multiple testing in neuroimaging data analysis, where hundreds of thousands or even millions of tests are conducted to detect brain regions associated with disease-related changes. Classical FDR control methods (e.g., BH, q-value, and LocalFDR) assume independence among tests and often lead to high false non-discovery rates (F… ▽ More

    Submitted 29 May, 2025; v1 submitted 26 May, 2025; originally announced May 2025.

  14. arXiv:2505.20201  [pdf, other

    cs.CL

    Reasoning Is Not All You Need: Examining LLMs for Multi-Turn Mental Health Conversations

    Authors: Mohit Chandra, Siddharth Sriraman, Harneet Singh Khanuja, Yiqiao Jin, Munmun De Choudhury

    Abstract: Limited access to mental healthcare, extended wait times, and increasing capabilities of Large Language Models (LLMs) has led individuals to turn to LLMs for fulfilling their mental health needs. However, examining the multi-turn mental health conversation capabilities of LLMs remains under-explored. Existing evaluation frameworks typically focus on diagnostic accuracy and win-rates and often over… ▽ More

    Submitted 28 May, 2025; v1 submitted 26 May, 2025; originally announced May 2025.

    Comments: 34 pages, 5 figures, 30 tables

  15. arXiv:2505.20128  [pdf, other

    cs.CL

    Iterative Self-Incentivization Empowers Large Language Models as Agentic Searchers

    Authors: Zhengliang Shi, Lingyong Yan, Dawei Yin, Suzan Verberne, Maarten de Rijke, Zhaochun Ren

    Abstract: Large language models (LLMs) have been widely integrated into information retrieval to advance traditional techniques. However, effectively enabling LLMs to seek accurate knowledge in complex tasks remains a challenge due to the complexity of multi-hop queries as well as the irrelevant retrieved content. To address these limitations, we propose EXSEARCH, an agentic search framework, where the LLM… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: Working in process

  16. arXiv:2505.19356  [pdf, other

    cs.IR cs.AI cs.CL cs.LG

    Optimized Text Embedding Models and Benchmarks for Amharic Passage Retrieval

    Authors: Kidist Amde Mekonnen, Yosef Worku Alemneh, Maarten de Rijke

    Abstract: Neural retrieval methods using transformer-based pre-trained language models have advanced multilingual and cross-lingual retrieval. However, their effectiveness for low-resource, morphologically rich languages such as Amharic remains underexplored due to data scarcity and suboptimal tokenization. We address this gap by introducing Amharic-specific dense retrieval models based on pre-trained Amhar… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

    Comments: 10 pages (excluding references and appendix), 10 figures. Accepted to ACL 2025 Findings. Public release includes dataset, code, and trained models: https://github.com/kidist-amde/amharic-ir-benchmarks

    MSC Class: 68T50 (Primary); 68T05 (Secondary) ACM Class: H.3.3; H.3.1; I.2.7

  17. arXiv:2505.18591  [pdf, ps, other

    cs.LG stat.ML

    Bayesian Meta-Reinforcement Learning with Laplace Variational Recurrent Networks

    Authors: Joery A. de Vries, Jinke He, Mathijs M. de Weerdt, Matthijs T. J. Spaan

    Abstract: Meta-reinforcement learning trains a single reinforcement learning agent on a distribution of tasks to quickly generalize to new tasks outside of the training set at test time. From a Bayesian perspective, one can interpret this as performing amortized variational inference on the posterior distribution over training tasks. Among the various meta-reinforcement learning approaches, a common method… ▽ More

    Submitted 24 May, 2025; originally announced May 2025.

  18. arXiv:2505.18583  [pdf, ps, other

    cs.IR

    The Silent Saboteur: Imperceptible Adversarial Attacks against Black-Box Retrieval-Augmented Generation Systems

    Authors: Hongru Song, Yu-an Liu, Ruqing Zhang, Jiafeng Guo, Jianming Lv, Maarten de Rijke, Xueqi Cheng

    Abstract: We explore adversarial attacks against retrieval-augmented generation (RAG) systems to identify their vulnerabilities. We focus on generating human-imperceptible adversarial examples and introduce a novel imperceptible retrieve-to-generate attack against RAG. This task aims to find imperceptible perturbations that retrieve a target document, originally excluded from the initial top-$k$ candidate s… ▽ More

    Submitted 28 May, 2025; v1 submitted 24 May, 2025; originally announced May 2025.

    Comments: 18 pages,accepted by ACL25 findings

  19. arXiv:2505.18276  [pdf, other

    stat.ML cs.LG math.NA

    Preconditioned Langevin Dynamics with Score-Based Generative Models for Infinite-Dimensional Linear Bayesian Inverse Problems

    Authors: Lorenzo Baldassari, Josselin Garnier, Knut Solna, Maarten V. de Hoop

    Abstract: Designing algorithms for solving high-dimensional Bayesian inverse problems directly in infinite-dimensional function spaces - where such problems are naturally formulated - is crucial to ensure stability and convergence as the discretization of the underlying problem is refined. In this paper, we contribute to this line of work by analyzing a widely used sampler for linear inverse problems: Lange… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

    MSC Class: 62F15; 65N21; 68Q32; 60Hxx; 65C05; 82C31; 28C20; 60G15; 60J60

  20. arXiv:2505.18048  [pdf, ps, other

    cs.CV

    SHARDeg: A Benchmark for Skeletal Human Action Recognition in Degraded Scenarios

    Authors: Simon Malzard, Nitish Mital, Richard Walters, Victoria Nockles, Raghuveer Rao, Celso M. De Melo

    Abstract: Computer vision (CV) models for detection, prediction or classification tasks operate on video data-streams that are often degraded in the real world, due to deployment in real-time or on resource-constrained hardware. It is therefore critical that these models are robust to degraded data, but state of the art (SoTA) models are often insufficiently assessed with these real-world constraints in min… ▽ More

    Submitted 27 May, 2025; v1 submitted 23 May, 2025; originally announced May 2025.

    Comments: 19 pages, 2 images, updated acknowledgements versus previous versions to be compliant with funders

  21. arXiv:2505.17747  [pdf, ps, other

    cs.CL

    Discriminating Form and Meaning in Multilingual Models with Minimal-Pair ABX Tasks

    Authors: Maureen de Seyssel, Jie Chi, Skyler Seto, Maartje ter Hoeve, Masha Fedzechkina, Natalie Schluter

    Abstract: We introduce a set of training-free ABX-style discrimination tasks to evaluate how multilingual language models represent language identity (form) and semantic content (meaning). Inspired from speech processing, these zero-shot tasks measure whether minimal differences in representation can be reliably detected. This offers a flexible and interpretable alternative to probing. Applied to XLM-R (Con… ▽ More

    Submitted 2 June, 2025; v1 submitted 23 May, 2025; originally announced May 2025.

  22. arXiv:2505.17051  [pdf, ps, other

    cs.CL cs.AI

    Embedding-to-Prefix: Parameter-Efficient Personalization for Pre-Trained Large Language Models

    Authors: Bernd Huber, Ghazal Fazelnia, Andreas Damianou, Sebastian Peleato, Max Lefarov, Praveen Ravichandran, Marco De Nadai, Mounia Lalmas-Roellke, Paul N. Bennett

    Abstract: Large language models (LLMs) excel at generating contextually relevant content. However, tailoring these outputs to individual users for effective personalization is a significant challenge. While rich user-specific information often exists as pre-existing user representations, such as embeddings learned from preferences or behaviors, current methods to leverage these for LLM personalization typic… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

  23. arXiv:2505.16050  [pdf, ps, other

    cs.DM

    A Weight Function Lemma Heuristic for Graph Pebbling

    Authors: G. A. Bridi, F. L. Marquezino, C. M. H. de Figueiredo

    Abstract: Graph pebbling is a problem in which pebbles are distributed across the vertices of a graph and moved according to a specific rule: two pebbles are removed from a vertex to place one on an adjacent vertex. The goal is to determine the minimum number of pebbles required to ensure that any target vertex can be reached, known as the pebbling number. Computing the pebbling number lies beyond NP in the… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

    Comments: 18 pages, 8 figures, 5 tables

  24. arXiv:2505.13565  [pdf, other

    cs.CY cs.AI cs.HC

    Aligning Trustworthy AI with Democracy: A Dual Taxonomy of Opportunities and Risks

    Authors: Oier Mentxaka, Natalia Díaz-Rodríguez, Mark Coeckelbergh, Marcos López de Prado, Emilia Gómez, David Fernández Llorca, Enrique Herrera-Viedma, Francisco Herrera

    Abstract: Artificial Intelligence (AI) poses both significant risks and valuable opportunities for democratic governance. This paper introduces a dual taxonomy to evaluate AI's complex relationship with democracy: the AI Risks to Democracy (AIRD) taxonomy, which identifies how AI can undermine core democratic principles such as autonomy, fairness, and trust; and the AI's Positive Contributions to Democracy… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

    Comments: 26 pages, 5 figures

  25. arXiv:2505.13188  [pdf, ps, other

    cs.LG cs.AI stat.ML

    When a Reinforcement Learning Agent Encounters Unknown Unknowns

    Authors: Juntian Zhu, Miguel de Carvalho, Zhouwang Yang, Fengxiang He

    Abstract: An AI agent might surprisingly find she has reached an unknown state which she has never been aware of -- an unknown unknown. We mathematically ground this scenario in reinforcement learning: an agent, after taking an action calculated from value functions $Q$ and $V$ defined on the {\it {aware domain}}, reaches a state out of the domain. To enable the agent to handle this scenario, we propose an… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

  26. arXiv:2505.12106  [pdf, ps, other

    cs.CR

    MalVis: A Large-Scale Image-Based Framework and Dataset for Advancing Android Malware Classification

    Authors: Saleh J. Makkawy, Michael J. De Lucia, Kenneth E. Barner

    Abstract: As technology advances, Android malware continues to pose significant threats to devices and sensitive data. The open-source nature of the Android OS and the availability of its SDK contribute to this rapid growth. Traditional malware detection techniques, such as signature-based, static, and dynamic analysis, struggle to detect obfuscated threats that use encryption, packing, or compression. Whil… ▽ More

    Submitted 17 May, 2025; originally announced May 2025.

  27. The Effects of Demographic Instructions on LLM Personas

    Authors: Angel Felipe Magnossão de Paula, J. Shane Culpepper, Alistair Moffat, Sachin Pathiyan Cherumanal, Falk Scholer, Johanne Trippas

    Abstract: Social media platforms must filter sexist content in compliance with governmental regulations. Current machine learning approaches can reliably detect sexism based on standardized definitions, but often neglect the subjective nature of sexist language and fail to consider individual users' perspectives. To address this gap, we adopt a perspectivist approach, retaining diverse annotations rather th… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

    Comments: Accepted at SIGIR'25, Padua, Italy

  28. arXiv:2505.11640  [pdf, ps, other

    cs.CV

    BandRC: Band Shifted Raised Cosine Activated Implicit Neural Representations

    Authors: Pandula Thennakoon, Avishka Ranasinghe, Mario De Silva, Buwaneka Epakanda, Roshan Godaliyadda, Parakrama Ekanayake, Vijitha Herath

    Abstract: In recent years, implicit neural representations(INRs) have gained popularity in the computer vision community. This is mainly due to the strong performance of INRs in many computer vision tasks. These networks can extract a continuous signal representation given a discrete signal representation. In previous studies, it has been repeatedly shown that INR performance has a strong correlation with t… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

    Comments: Submitted as a conference paper to ICCV 2025

  29. arXiv:2505.10292  [pdf, ps, other

    cs.CV cs.CL

    StoryReasoning Dataset: Using Chain-of-Thought for Scene Understanding and Grounded Story Generation

    Authors: Daniel A. P. Oliveira, David Martins de Matos

    Abstract: Visual storytelling systems struggle to maintain character identity across frames and link actions to appropriate subjects, frequently leading to referential hallucinations. These issues can be addressed through grounding of characters, objects, and other entities on the visual elements. We propose StoryReasoning, a dataset containing 4,178 stories derived from 52,016 movie images, with both struc… ▽ More

    Submitted 15 May, 2025; originally announced May 2025.

    Comments: 31 pages, 14 figures

    ACM Class: I.2.10; I.2.7

  30. arXiv:2505.09619  [pdf, ps, other

    stat.OT cs.AI

    Machine Learning Solutions Integrated in an IoT Healthcare Platform for Heart Failure Risk Stratification

    Authors: Pietro Cassieri, Aiman Faiz, Anna Maria De Roberto, Claudio Pascarelli, Gianvito Mitrano, Gianluca Fimiani, Marina Garofano, Genoveffa Tortora, Mariangela Lazoi, Claudio Passino, Alessia Bramanti, Giuseppe Scanniello

    Abstract: The management of chronic Heart Failure (HF) presents significant challenges in modern healthcare, requiring continuous monitoring, early detection of exacerbations, and personalized treatment strategies. In this paper, we present a predictive model founded on Machine Learning (ML) techniques to identify patients at HF risk. This model is an ensemble learning approach, a modified stacking techniqu… ▽ More

    Submitted 22 May, 2025; v1 submitted 7 April, 2025; originally announced May 2025.

  31. arXiv:2505.08143  [pdf, ps, other

    cs.HC cs.AI

    Communication Styles and Reader Preferences of LLM and Human Experts in Explaining Health Information

    Authors: Jiawei Zhou, Kritika Venkatachalam, Minje Choi, Koustuv Saha, Munmun De Choudhury

    Abstract: With the wide adoption of large language models (LLMs) in information assistance, it is essential to examine their alignment with human communication styles and values. We situate this study within the context of fact-checking health information, given the critical challenge of rectifying conceptions and building trust. Recent studies have explored the potential of LLM for health communication, bu… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

  32. arXiv:2505.04796  [pdf, ps, other

    cs.LG

    Robust ML Auditing using Prior Knowledge

    Authors: Jade Garcia Bourrée, Augustin Godinot, Martijn De Vos, Milos Vujasinovic, Sayan Biswas, Gilles Tredan, Erwan Le Merrer, Anne-Marie Kermarrec

    Abstract: Among the many technical challenges to enforcing AI regulations, one crucial yet underexplored problem is the risk of audit manipulation. This manipulation occurs when a platform deliberately alters its answers to a regulator to pass an audit without modifying its answers to other users. In this paper, we introduce a novel approach to manipulation-proof auditing by taking into account the auditor'… ▽ More

    Submitted 22 May, 2025; v1 submitted 7 May, 2025; originally announced May 2025.

    Comments: Accepted to the 42nd International Conference on Machine Learning ICML25

  33. arXiv:2505.03770  [pdf, other

    cs.AI

    Proceedings of 1st Workshop on Advancing Artificial Intelligence through Theory of Mind

    Authors: Mouad Abrini, Omri Abend, Dina Acklin, Henny Admoni, Gregor Aichinger, Nitay Alon, Zahra Ashktorab, Ashish Atreja, Moises Auron, Alexander Aufreiter, Raghav Awasthi, Soumya Banerjee, Joe M. Barnby, Rhea Basappa, Severin Bergsmann, Djallel Bouneffouf, Patrick Callaghan, Marc Cavazza, Thierry Chaminade, Sonia Chernova, Mohamed Chetouan, Moumita Choudhury, Axel Cleeremans, Jacek B. Cywinski, Fabio Cuzzolin , et al. (83 additional authors not shown)

    Abstract: This volume includes a selection of papers presented at the Workshop on Advancing Artificial Intelligence through Theory of Mind held at AAAI 2025 in Philadelphia US on 3rd March 2025. The purpose of this volume is to provide an open access and curated anthology for the ToM and AI research community.

    Submitted 28 April, 2025; originally announced May 2025.

    Comments: workshop proceedings

  34. arXiv:2505.03590  [pdf, other

    stat.ML cs.LG eess.SP q-bio.QM

    Physics-Informed Sylvester Normalizing Flows for Bayesian Inference in Magnetic Resonance Spectroscopy

    Authors: Julian P. Merkofer, Dennis M. J. van de Sande, Alex A. Bhogal, Ruud J. G. van Sloun

    Abstract: Magnetic resonance spectroscopy (MRS) is a non-invasive technique to measure the metabolic composition of tissues, offering valuable insights into neurological disorders, tumor detection, and other metabolic dysfunctions. However, accurate metabolite quantification is hindered by challenges such as spectral overlap, low signal-to-noise ratio, and various artifacts. Traditional methods like linear-… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

    Comments: Preprint submitted to IEEE MLSP 2025

  35. arXiv:2505.03075  [pdf, other

    cs.IR

    Direct Retrieval-augmented Optimization: Synergizing Knowledge Selection and Language Models

    Authors: Zhengliang Shi, Lingyong Yan, Weiwei Sun, Yue Feng, Pengjie Ren, Xinyu Ma, Shuaiqiang Wang, Dawei Yin, Maarten de Rijke, Zhaochun Ren

    Abstract: Retrieval-augmented generation (RAG) integrates large language models ( LLM s) with retrievers to access external knowledge, improving the factuality of LLM generation in knowledge-grounded tasks. To optimize the RAG performance, most previous work independently fine-tunes the retriever to adapt to frozen LLM s or trains the LLMs to use documents retrieved by off-the-shelf retrievers, lacking end-… ▽ More

    Submitted 5 May, 2025; originally announced May 2025.

  36. arXiv:2505.02519  [pdf

    cs.CY

    Deaf in AI: AI language technologies and the erosion of linguistic rights

    Authors: Maartje De Meulder

    Abstract: This paper explores the interplay of AI language technologies, sign language interpreting, and linguistic access, highlighting the complex interdependencies shaping access frameworks and the tradeoffs these technologies bring. While AI tools promise innovation, they also perpetuate biases, reinforce technoableism, and deepen inequalities through systemic and design flaws. The historical and contem… ▽ More

    Submitted 5 May, 2025; originally announced May 2025.

  37. arXiv:2505.02501  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Corr2Distrib: Making Ambiguous Correspondences an Ally to Predict Reliable 6D Pose Distributions

    Authors: Asma Brazi, Boris Meden, Fabrice Mayran de Chamisso, Steve Bourgeois, Vincent Lepetit

    Abstract: We introduce Corr2Distrib, the first correspondence-based method which estimates a 6D camera pose distribution from an RGB image, explaining the observations. Indeed, symmetries and occlusions introduce visual ambiguities, leading to multiple valid poses. While a few recent methods tackle this problem, they do not rely on local correspondences which, according to the BOP Challenge, are currently t… ▽ More

    Submitted 5 May, 2025; originally announced May 2025.

    Comments: 8 pages, 5 figures

  38. arXiv:2505.00891  [pdf

    quant-ph cs.ET

    Quantum Computing in Industrial Environments: Where Do We Stand and Where Are We Headed?

    Authors: Eneko Osaba, Iñigo Perez Delgado, Alejandro Mata Ali, Pablo Miranda-Rodriguez, Aitor Moreno Fdez de Leceta, Luka Carmona Rivas

    Abstract: This article explores the current state and future prospects of quantum computing in industrial environments. Firstly, it describes three main paradigms in this field of knowledge: gate-based quantum computers, quantum annealers, and tensor networks. The article also examines specific industrial applications, such as bin packing, job shop scheduling, and route planning for robots and vehicles. The… ▽ More

    Submitted 1 May, 2025; originally announced May 2025.

    Comments: This paper is the English version of the work published in https://doi.org/10.52152/D11380

  39. arXiv:2505.00788  [pdf, ps, other

    cs.CV

    SpatialLLM: A Compound 3D-Informed Design towards Spatially-Intelligent Large Multimodal Models

    Authors: Wufei Ma, Luoxin Ye, Nessa McWeeney, Celso M de Melo, Jieneng Chen, Alan Yuille

    Abstract: Humans naturally understand 3D spatial relationships, enabling complex reasoning like predicting collisions of vehicles from different directions. Current large multimodal models (LMMs), however, lack of this capability of 3D spatial reasoning. This limitation stems from the scarcity of 3D training data and the bias in current model designs toward 2D data. In this paper, we systematically study th… ▽ More

    Submitted 2 June, 2025; v1 submitted 1 May, 2025; originally announced May 2025.

    Comments: CVPR 2025 highlight

  40. arXiv:2505.00571  [pdf, ps, other

    stat.ML cs.LG

    Hypothesis-free discovery from epidemiological data by automatic detection and local inference for tree-based nonlinearities and interactions

    Authors: Giorgio Spadaccini, Marjolein Fokkema, Mark A. van de Wiel

    Abstract: In epidemiological settings, Machine Learning (ML) is gaining popularity for hypothesis-free discovery of risk (or protective) factors. Although ML is strong at discovering non-linearities and interactions, this power is currently compromised by a lack of reliable inference. Although local measures of feature effect can be combined with tree ensembles, uncertainty quantifications for these measure… ▽ More

    Submitted 1 May, 2025; originally announced May 2025.

    Comments: Main body: 29 pages, 7 figures; Supplementary material: 39 pages, 14 figures

  41. arXiv:2505.00369  [pdf

    cs.CV

    Automated segmenta-on of pediatric neuroblastoma on multi-modal MRI: Results of the SPPIN challenge at MICCAI 2023

    Authors: M. A. D. Buser, D. C. Simons, M. Fitski, M. H. W. A. Wijnen, A. S. Littooij, A. H. ter Brugge, I. N. Vos, M. H. A. Janse, M. de Boer, R. ter Maat, J. Sato, S. Kido, S. Kondo, S. Kasai, M. Wodzinski, H. Muller, J. Ye, J. He, Y. Kirchhoff, M. R. Rokkus, G. Haokai, S. Zitong, M. Fernández-Patón, D. Veiga-Canuto, D. G. Ellis , et al. (5 additional authors not shown)

    Abstract: Surgery plays an important role within the treatment for neuroblastoma, a common pediatric cancer. This requires careful planning, often via magnetic resonance imaging (MRI)-based anatomical 3D models. However, creating these models is often time-consuming and user dependent. We organized the Surgical Planning in Pediatric Neuroblastoma (SPPIN) challenge, to stimulate developments on this topic, a… ▽ More

    Submitted 1 May, 2025; originally announced May 2025.

    Comments: 23 pages, 6 figures

  42. arXiv:2505.00340  [pdf, ps, other

    cs.CR

    Vehicular Communication Security: Multi-Channel and Multi-Factor Authentication

    Authors: Marco De Vincenzi, Shuyang Sun, Chen Bo Calvin Zhang, Manuel Garcia, Shaozu Ding, Chiara Bodei, Ilaria Matteucci, Sanjay E. Sarma, Dajiang Suo

    Abstract: Secure and reliable communications are crucial for Intelligent Transportation Systems (ITSs), where Vehicle-to-Infrastructure (V2I) communication plays a key role in enabling mobility-enhancing and safety-critical services. Current V2I authentication relies on credential-based methods over wireless Non-Line-of-Sight (NLOS) channels, leaving them exposed to remote impersonation and proximity attack… ▽ More

    Submitted 8 May, 2025; v1 submitted 1 May, 2025; originally announced May 2025.

  43. arXiv:2504.21549  [pdf, other

    cs.NI

    Online Experimental Design for Network Tomography

    Authors: Xuchuang Wang, Yu-Zhen Janice Chen, Matheus Guedes de Andrade, Mohammad Hajiesmaili, John C. S. Lui, Ting He, Don Towsley

    Abstract: How to efficiently perform network tomography is a fundamental problem in network management and monitoring. A network tomography task usually consists of applying multiple probing experiments, e.g., across different paths or via different casts (including unicast and multicast). We study how to optimize the network tomography process through online sequential decision-making. From the methodology… ▽ More

    Submitted 30 April, 2025; originally announced April 2025.

  44. RecGaze: The First Eye Tracking and User Interaction Dataset for Carousel Interfaces

    Authors: Santiago de Leon-Martinez, Jingwei Kang, Robert Moro, Maarten de Rijke, Branislav Kveton, Harrie Oosterhuis, Maria Bielikova

    Abstract: Carousel interfaces are widely used in e-commerce and streaming services, but little research has been devoted to them. Previous studies of interfaces for presenting search and recommendation results have focused on single ranked lists, but it appears their results cannot be extrapolated to carousels due to the added complexity. Eye tracking is a highly informative approach to understanding how us… ▽ More

    Submitted 29 April, 2025; originally announced April 2025.

    Comments: Accepted to Resource & Reproducibility Track SIGIR '25

  45. arXiv:2504.18541  [pdf, ps, other

    cs.IT

    Optimal tables for asymmetric numeral systems

    Authors: Raphael S. Steiner, Mirko De Vita, Endri Bezati

    Abstract: We present several algorithms to generate tables for asymmetric numeral systems and prove that they are optimal in terms of discrepancy. In turn, this gives rise to the strongest proven bound on entropy loss. We further give improved theoretical bounds for the entropy loss in tabled asymmetric numeral systems and a brief empirical evaluation of the stream variant.

    Submitted 8 May, 2025; v1 submitted 24 March, 2025; originally announced April 2025.

    Comments: 20 pages

  46. arXiv:2504.18413  [pdf, other

    cs.IR

    An Empirical Study of Evaluating Long-form Question Answering

    Authors: Ning Xian, Yixing Fan, Ruqing Zhang, Maarten de Rijke, Jiafeng Guo

    Abstract: \Ac{LFQA} aims to generate lengthy answers to complex questions. This scenario presents great flexibility as well as significant challenges for evaluation. Most evaluations rely on deterministic metrics that depend on string or n-gram matching, while the reliability of large language model-based evaluations for long-form answers remains relatively unexplored. We address this gap by conducting an i… ▽ More

    Submitted 25 April, 2025; originally announced April 2025.

  47. arXiv:2504.17519  [pdf, other

    cs.IR

    Replication and Exploration of Generative Retrieval over Dynamic Corpora

    Authors: Zhen Zhang, Xinyu Ma, Weiwei Sun, Pengjie Ren, Zhumin Chen, Shuaiqiang Wang, Dawei Yin, Maarten de Rijke, Zhaochun Ren

    Abstract: Generative retrieval (GR) has emerged as a promising paradigm in information retrieval (IR). However, most existing GR models are developed and evaluated using a static document collection, and their performance in dynamic corpora where document collections evolve continuously is rarely studied. In this paper, we first reproduce and systematically evaluate various representative GR approaches over… ▽ More

    Submitted 24 April, 2025; originally announced April 2025.

    Comments: Accepted at SIGIR 2025 (Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval)

  48. Adaptive Orchestration of Modular Generative Information Access Systems

    Authors: Mohanna Hoveyda, Harrie Oosterhuis, Arjen P. de Vries, Maarten de Rijke, Faegheh Hasibi

    Abstract: Advancements in large language models (LLMs) have driven the emergence of complex new systems to provide access to information, that we will collectively refer to as modular generative information access (GenIA) systems. They integrate a broad and evolving range of specialized components, including LLMs, retrieval models, and a heterogeneous set of sources and tools. While modularity offers flexib… ▽ More

    Submitted 24 April, 2025; originally announced April 2025.

    Comments: Accepted at SIGIR 2025 Perspective Paper Track

  49. Generative AI for Research Data Processing: Lessons Learnt From Three Use Cases

    Authors: Modhurita Mitra, Martine G. de Vos, Nicola Cortinovis, Dawa Ometto

    Abstract: There has been enormous interest in generative AI since ChatGPT was launched in 2022. However, there are concerns about the accuracy and consistency of the outputs of generative AI. We have carried out an exploratory study on the application of this new technology in research data processing. We identified tasks for which rule-based or traditional machine learning approaches were difficult to appl… ▽ More

    Submitted 22 April, 2025; originally announced April 2025.

    Comments: 10 pages, 4 figures, 6 tables. Published in Proceedings of the 2024 IEEE 20th International Conference on e-Science (e-Science), Osaka, Japan

    MSC Class: 68T50 ACM Class: I.2.7

  50. arXiv:2504.15285  [pdf, other

    physics.med-ph cs.CE math.NA

    AneuPy: An open source Python tool for creating simulation-ready geometries of abdominal aortic aneurysms

    Authors: Mario de Lucio, Jacobo Diaz, Alberto de Castro, Luis E. Romera

    Abstract: Abdominal aortic aneurysms (AAAs) are localized dilatations of the abdominal aorta that can lead to life-threatening rupture if left untreated. AAAs primarily affect older individuals, with high mortality rates following rupture, so early diagnosis and risk assessment are critical. The geometrical characteristics of an AAA, such as its maximum diameter, asymmetry, and wall thickness, are extremely… ▽ More

    Submitted 15 May, 2025; v1 submitted 13 March, 2025; originally announced April 2025.

    Comments: 14 pages, 5 figures